Gene Mvan_4876 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_4876 
Symbol 
ID4643854 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp5222083 
End bp5223531 
Gene Length1449 bp 
Protein Length482 aa 
Translation table11 
GC content70% 
IMG OID639808347 
Productsuccinate-semialdehyde dehydrogenase (NAD(P)(+)) 
Protein accessionYP_955655 
Protein GI120405826 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR01780] succinate-semialdehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.892375 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.621517 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATACCT CCGCGCTGCT GAAATCCGTG CCGACCGGTC TGTGGATCGG CGGTGAGGAA 
CGCCAGGGCA CGTCGACGTT CAGCGTTCTC GACCCCAGCG ACGACGAAGT GCTGGCGACG
GTGGCCGACG CCACGGCCGA CGACGCCCGC GACGCGCTCG ACGCCGCGTG CGCGGTGCAG
GGCGAGTGGG CCGCCACCCC GGCGCGCAAG CGCGGCGAGA TCCTGCGGTC GGTGTTCGAG
ACGATCACCG CGCGCGCCGA CGACATCGCC GCGCTGATGA CTCTGGAGAT GGGCAAGGTT
TTCGCAGAGA GCAAGGGCGA GGTCACCTAC GGCGCCGAGT TCTTCCGCTG GTTCGCCGAG
GAGGCCGTGC GCATCGAAGG CCGCTACACC CCGGCTCCCG CCGGCACCGG CCGGATCCTG
GTGACCAAGC AGGCCGTCGG CCCCTGCTAC GCGATCACTC CGTGGAACTT CCCGCTGGCG
ATGGGCACCC GCAAGATGGG CCCGGCGTTC GCCGCCGGCT GCACGATGAT CGTCAAGCCC
GCCCAGGAGA CCCCGCTGAC CATGCTGCTG CTGGCCAAGC TGATGGCCGA GGCGGGCCTG
CCCAAGGGCG TGCTGTCGGT GCTGCCCACC AGCAATCCGG GCGCGGTGAC CGAGGCGCTG
ATCAACGACG GCCGGCTGCG CAAGCTGACG TTCACCGGAT CGACCGGTGT GGGCAAGGCG
CTGGTGAAGC AGTCGGCCGA CAAACTGCTG CGCACCTCGA TGGAGCTGGG CGGCAACGCG
CCGTTCATCG TCTTCGACGA CGCCGACGTC GACGCCGCCG TGGACGGGGC GATCCTGGCC
AAGATGCGCA ACGGCGGCGA GGCCTGCACC GCGGCCAACC GCATCCACGT CGCCAACGCG
GTGCGCGAGG AGTTCACCGA GAAGTTCGTG AAACGGATGA GCGAGTTCAC CCTCGGCAAG
GGCCTCGACG AGAAGTCGAC GCTGGGTCCG CTGATCAACG CCAAGCAGGT CGCCACCGTC
ACCGAACTGG TGTCCGACGC GGTGTCGCGC GGCGCGACCG TCGCGGTCGG CGGTGTCGCA
CCGGGCGGGC CCGGCAACTT CTACCCGGCC ACCGTGCTGA CCGACGTGCC CGCCGACGCG
CGCATCCTCA AGGAAGAGGT GTTCGGGCCC GTCGCGCCGA TCGCCGGGTT CGACACCGAG
GAGGAGGGCA TCGCCGCGGC CAACGACACC GAGTACGGCC TGGCCGCCTA CGTGTACACC
CAGTCGCTGG ACCGCGCGCT GCGGGTGGCC GAGAACATCG AGTCCGGAAT GGTCGGCGTC
AACCGCGGGG TGATCTCCGA CGCCGCCGCG CCGTTCGGCG GGATCAAGGA ATCCGGCTTC
GGCCGCGAGG GCGGCACCGA GGGCATCGAG GAGTACCTCG ACACGAAGTA CATCGCACTG
ACCAGATAG
 
Protein sequence
MDTSALLKSV PTGLWIGGEE RQGTSTFSVL DPSDDEVLAT VADATADDAR DALDAACAVQ 
GEWAATPARK RGEILRSVFE TITARADDIA ALMTLEMGKV FAESKGEVTY GAEFFRWFAE
EAVRIEGRYT PAPAGTGRIL VTKQAVGPCY AITPWNFPLA MGTRKMGPAF AAGCTMIVKP
AQETPLTMLL LAKLMAEAGL PKGVLSVLPT SNPGAVTEAL INDGRLRKLT FTGSTGVGKA
LVKQSADKLL RTSMELGGNA PFIVFDDADV DAAVDGAILA KMRNGGEACT AANRIHVANA
VREEFTEKFV KRMSEFTLGK GLDEKSTLGP LINAKQVATV TELVSDAVSR GATVAVGGVA
PGGPGNFYPA TVLTDVPADA RILKEEVFGP VAPIAGFDTE EEGIAAANDT EYGLAAYVYT
QSLDRALRVA ENIESGMVGV NRGVISDAAA PFGGIKESGF GREGGTEGIE EYLDTKYIAL
TR