Gene Mvan_5043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_5043 
Symbol 
ID4644780 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp5398058 
End bp5399089 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content66% 
IMG OID639808514 
Productputative glutathione S-transferase 
Protein accessionYP_955821 
Protein GI120405992 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0435] Predicted glutathione S-transferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.939861 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCTACG TCGCCGACCC GTCGAGCTCT GGCGGGGAGT TCAACCGGGA CACCGAATAC 
ATCTCCACCC GGATCACCGC CGACGGGCGG GACGGCTATC CCGTCGAGCC GGGCCGCTAT
CGGCTCATCG TTGCCCGAGC GTGCCCATGG GCCAACCGCA CCATCATCGT GCGGCGGCTG
CTCGGGCTGG AAGATGTTCT GTCCATAGGC TTTTGCGGCC CGACCCATGA TGAGCGCAGC
TGGACGTTCG ACCTCGATCC CGGTGGTGTC GACCCGGTGC TGGGCATTCA CTTCCTGCGC
GACGCCTACA ACAAACGTGT GCGCGACTAC CCCAAGGGTG TCACCGTCCC GGCCGTCGTG
GAGGTCGCGA CGGGAGAGGT CGTCACCAAC GACTTCGCGC AGATCACCCT GGACTTCTCC
ACCGAGTGGA CCGCCTACCA CCGCGACGGC GCACCGCAGC TCTATCCCGA ACCGCTGCGC
GACGAGATCG ACGAGGTCGC CCAGCGCGTC TACACCGAGG TCAACAACGG CGTCTACCGG
TGCGGTTTCG CGGGGTCCCA GCGGGCCTAC GAGAAGGCAT ACGACCGGTT GTTCACCGCG
CTGGACTGGC TGTCCGAGCG GCTGTCGCGG CAGCGCTTCC TGGTGGGCGA CACCATCACC
GAGGCAGACG TACGACTATT CACCACACTG GCTCGATTCG ACCCCGTGTA TCACGGCCAC
TTCAAGACCA ATCGCAGCAA GCTCTCCGAG ATGCCGGTGC TGTGGGCATA CGCACGCGAC
CTGTTTCAGA CGCCGGGGTT CGGTGACACC ATCGACTTCG TGCAGATCAA GCAGCACTAC
TACATCGTTC ACTCCGACAT CAATCCCACC GGCATCGTCC CGAAGGGGCC GGAGCTGTCG
AACTGGCTGA CGCCGCACGG TCGAGAAGCG TTGGGCGGCA GACCGTTCGG TGACGGAACC
GCCCCCGGGC CGACGCGGGA CACCGAGCGC GTGCCCGAGG GTCACACAGC CGGCGACTCG
CAACCCGGAT GA
 
Protein sequence
MTYVADPSSS GGEFNRDTEY ISTRITADGR DGYPVEPGRY RLIVARACPW ANRTIIVRRL 
LGLEDVLSIG FCGPTHDERS WTFDLDPGGV DPVLGIHFLR DAYNKRVRDY PKGVTVPAVV
EVATGEVVTN DFAQITLDFS TEWTAYHRDG APQLYPEPLR DEIDEVAQRV YTEVNNGVYR
CGFAGSQRAY EKAYDRLFTA LDWLSERLSR QRFLVGDTIT EADVRLFTTL ARFDPVYHGH
FKTNRSKLSE MPVLWAYARD LFQTPGFGDT IDFVQIKQHY YIVHSDINPT GIVPKGPELS
NWLTPHGREA LGGRPFGDGT APGPTRDTER VPEGHTAGDS QPG