Gene Hhal_1471 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1471 
Symbol 
ID4710044 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp1587967 
End bp1589103 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content71% 
IMG OID639855938 
Productsuccinyl-diaminopimelate desuccinylase 
Protein accessionYP_001003040 
Protein GI121998253 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0624] Acetylornithine deacetylase/Succinyl-diaminopimelate desuccinylase and related deacylases 
TIGRFAM ID[TIGR01246] succinyl-diaminopimelate desuccinylase, proteobacterial clade 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGCCA CGCTCGAGCT CGCCCGGGAG CTGATCCAGC GCCCGTCGGT AACTCCCGAG 
GACGCCGGAT GCCAGACCCT CGTCGCCGAG CGCCTGGCCG CGGCCGGGTT CGGCGCCGAG
TGGCTGAACG CCGCCGGGGT CACCAACCTG TGGGCGCAGC GCGGCACCGA GCGCCCCCTG
TTCTGCTTTC TCGGCCACAC CGACGTGGTC CCCAGCGGTC CAGAGTCGGC CTGGCAACAC
CCGCCGTTCC AGCCCATCGT CGAGAACGGC TGTCTCTATG GCCGAGGCGC GGCCGACATG
AAGGGCAGTG TGGCGGCCTT CGTCGCTGCG GTGGAGCGCT TCGTCGCCCG CCACCCGGAC
CACGCGGGCG CCATCGCCGT GCTGCTGACC AGCGACGAGG AAGGCCCCGC GGTGGATGGC
ACCCGACGCG TGGTCGAGAC CCTGGCAGCG CGGGGGGCGG CCATCGACTA CTGCCTGGTG
GGCGAACCCA GCAGCCAGGC ACGGCTCGGC GACGAGTACA AGGTCGGCCG CCGCGGGTCC
CTAACGGGGC ACCTCACCGT GCACGGCGAA CAGGGGCACG TCGCCTACCC GCACCAGGCG
GACAATCCCA TCCACGCGTT CGCCCCGGCA CTCCAGGAGC TGGTCGCCAC CGAGTGGGAC
CAGGGCGATG CCGACTTCCC GCCGACGAGC TTCCAGATCT CCAACATCCA GGCGGGCACC
GGCGCCGACA ACGTCATCCC CGGAGCCATG GAGGTCGTGT TCAACCTGCG CTACGCCCCG
GCGGTCTCCG CCGAGGAGCT TCAGGAACGG ATCGAATCCA TCCTGCACCG TCACGGGGTG
CACCACACCC TGCACTGGCG GCACTCCGGC GCCCCCTTCG CCACCCGCGA GGGCGCACTC
ATCGATGCCG TTGAACAGGC AGTCACAGCG CACACCGGGC AGTGTCCACG ACGATCGACC
TCCGGCGGCA CCTCCGATGG CCGTTTCATG GGTCCGACCG GGGCGCAGGT GGTCGAGCTT
GGTCCGCTGA ACGCCACCAT CCACAAGGCC AACGAGCACG TCGCGGTCGC CGACCTGGAG
GCCCTGGAGG CGATCTACTT CGACATCCTG CAGCACCTGC TGGCCCCGGC CGACTGA
 
Protein sequence
MSATLELARE LIQRPSVTPE DAGCQTLVAE RLAAAGFGAE WLNAAGVTNL WAQRGTERPL 
FCFLGHTDVV PSGPESAWQH PPFQPIVENG CLYGRGAADM KGSVAAFVAA VERFVARHPD
HAGAIAVLLT SDEEGPAVDG TRRVVETLAA RGAAIDYCLV GEPSSQARLG DEYKVGRRGS
LTGHLTVHGE QGHVAYPHQA DNPIHAFAPA LQELVATEWD QGDADFPPTS FQISNIQAGT
GADNVIPGAM EVVFNLRYAP AVSAEELQER IESILHRHGV HHTLHWRHSG APFATREGAL
IDAVEQAVTA HTGQCPRRST SGGTSDGRFM GPTGAQVVEL GPLNATIHKA NEHVAVADLE
ALEAIYFDIL QHLLAPAD