Gene Mvan_3953 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_3953 
Symbol 
ID4646244 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp4225104 
End bp4226369 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content72% 
IMG OID639807415 
Productsaccharopine dehydrogenase 
Protein accessionYP_954736 
Protein GI120404907 
COG category[S] Function unknown 
COG ID[COG3268] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0682928 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGCAA ACGAAGCACA CGACCGGGAG CACGACATCG TTGTCTACGG CGCCACCGGG 
TTCGTCGGGA AGCTGACGGC GCAGTATCTC GCCGCCGCGG GAGCCGGCGC CCGTATCGCG
CTGGCGGGCC GCTCCACCGA TCGCCTGCTG GCGGTGCGGG AATCGCTGGG GGAGGCCGCG
CAGGACTGGC CGCTGCTCGT CGCGGACGCG TCGCAGCCCT CGACGCTCAA CGCGATGGCG
GCCAGCACCC GCGTGGTGAT CACCACGGTC GGTCCCTACC TGCGCTACGG GCTGCCGCTT
GTCGCGGCGT GCGCGGCCGC GGGCACCGAC TACGCGGATC TCACCGGGGA GACGCTGTTC
GTGCGCGAGT GCATCGACCT GTACCACAAG CAGGCCGCCG ACACGGGAGC GCGCATCGTG
CACGCGTGCG GGTTCGACTC CATCCCGTCG GATATGACCG TGTTCGCGCT GTACCGGGCC
GCCGAACGCG ACCGCACCGG TGAGCTCGGC GACACCAATT TCGTCGTCCG CTCCTTCGCC
GGCGGGGTAT CCGGCGGCAC GGTGGCGTCG ATGACCGAGC TGGCTCGCCA GGCATCGCAG
GACCCCGAGG CCCGGCGCCT GCTCAACGAC CCGTACACGC TCACCCCCGA CCGTGCGGCC
GAGCCCGAAC TCGGAGCCCA GCCCGACGCG CGGTGGCGGC GGGGCCGCGA GATCGCCCCG
GAACTGGACG GCTACTGGGT CGGCGCGTTC GCGATGGCGC TGCCCAACAC CCGCGTCGTC
CGGCGCAGCA ACGCGCTTCT GGGCTACGCG TACGGCAGGC GGTTCGAATA CGCCGAACAG
ATGAGCACCG GCCGTTCCGT GGGCGCGCCG CTGGTCGCCG CCATGGCCAC GGCGGGCAAC
GTCGCGACGA TGGAGCTCAG CAGTCGCTTC CTGGACCGGG TGCCGCGGGG CGCGCTCGAG
CGCATCCTCC CCAAGGTTGG TACCGGGCCC AGCGAACAGA CCCGTGAACG CGGGCACTAC
ACCGTCGAGA CCTACACCAC GACGTCGACC GGCGCCCGCT ACCTCGCCCG GATGTCCCAG
CAGGGCGACC CCGGCTACAA GGCCACGTCG GTGCTGCTCG GCGAGAGCGG CCTGGCCCTT
GCGCTGGACC GCGACAAGCT GTCCGACCTG CGCGGGATCC TGACCCCGGC CGCCGCCATG
GGGGATGCGC TGCTGGCCCG GTTCCCGGCC GCGGGGGTGT CGCTGGACGT GTCCAGGCTG
AACTGA
 
Protein sequence
MSANEAHDRE HDIVVYGATG FVGKLTAQYL AAAGAGARIA LAGRSTDRLL AVRESLGEAA 
QDWPLLVADA SQPSTLNAMA ASTRVVITTV GPYLRYGLPL VAACAAAGTD YADLTGETLF
VRECIDLYHK QAADTGARIV HACGFDSIPS DMTVFALYRA AERDRTGELG DTNFVVRSFA
GGVSGGTVAS MTELARQASQ DPEARRLLND PYTLTPDRAA EPELGAQPDA RWRRGREIAP
ELDGYWVGAF AMALPNTRVV RRSNALLGYA YGRRFEYAEQ MSTGRSVGAP LVAAMATAGN
VATMELSSRF LDRVPRGALE RILPKVGTGP SEQTRERGHY TVETYTTTST GARYLARMSQ
QGDPGYKATS VLLGESGLAL ALDRDKLSDL RGILTPAAAM GDALLARFPA AGVSLDVSRL
N