Gene Mmcs_3173 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_3173 
Symbol 
ID4112005 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp3363807 
End bp3365291 
Gene Length1485 bp 
Protein Length494 aa 
Translation table11 
GC content64% 
IMG OID638032304 
ProductUBA/THIF-type NAD/FAD binding fold 
Protein accessionYP_640336 
Protein GI108800139 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0476] Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 2 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCTCGCGG GTCAACGACT ATGCATCTAT TTGGATCCTT CACGGGAATG GCAGCCGACG 
CTTGGTGTGG CCGGCCTACT CACCCGCCTT TGGGACTGGC TCGTCGACGC GGCCGCCGGA
AACTTCGACG CCGCCACCGC CATGTACCAC GCTGTTGGCG GAGTGCCGCA TCAGGCACAT
GACACACCGA CGATCGTTAC CCGAGAACCC GGACCGGCGA AGCGCCACCA AACGGCTCAC
CTGATCGCCC GGTCAACGCA CCGATACGAC CTGACGTACT CGCCTGGAGC TGCCGGGCAT
CGCGTACCGG TAATTACCCT GGCCACCGCG CTGCCGTTCG GTGCCGCATC CACATTCGCG
CTACTGCTTG CTCTCCTGGA CGACCCCTAC CTTGACCGCC TCGAAGGACG GGCTCCCCGG
ATCGCACCGC AATCGCCGGC GTTCCTCACC GCCCTCCTGG CGAGTGCGTT ACGAAATCAC
CACGACGCCG AGCAATACTT CGTCCTCGCC GTGCCGCACC CCGCTGGAGG CCCACCCCAC
CTCTTGGGCG GACGGCTCCC CACCCCAACG GCGAATGCGC TCCGCGAGGT CGCGCAGCAA
CGGGGTGTGG GGGTTGTTCT CGACCCCGCG AAGATCAACA CTGAAATCCC GATTGAGTGG
TGCAGGATGT CCGACGAACG ACCCGAAGTG ACAACCCGCC GCGACGACGG CCGCCCCGTG
AACGGATTTC AACGAAAGAC TGTCCACATC TGGGGCTGCG GCGGGCTCGG ATCATGGATC
GCCGAATTCA TCGCTCGCGC AGGAGCATCG GAGATCACCG TGTGCGACCC TGGCATCGTC
ACCGGCGGCT TGCTCGTCCG ACAAAACTAC GTCGAAGACG ACATTGGCCG TTCCAAAGCC
GAGGCACTCG CTGGACGGCT CCGCGCGATC CGTGATGACC TGACGGTCAC CGTCGCAGAA
GGGCACCTCC CAGAAGACCA CACGTCATGC CTGGCAGCGG ATCTCATCAT CGACGCCACA
GTGAACAACG GCATCACGAG CTGTCTCGAT GCGTTGGCAA CTGCGCCGAC GCGAAAGGCA
TTGATCGCTC AGGTCGCCAC AGACGCTCGC TCTGGCACGC TCGGCCTAGC CGTGCTGTGC
GCCGCAAGCG CAACAGCGAC AGTTTCCAGC ATCGATCAAG ACGCTGGCCG AACAATCCAG
GGCGACAGCG GACTTGAGCT CTACCACACG CTGTGGCAAG AACCCAGCGA TGACGAACTT
ATACCAACCA GGGGCTGCTC GGTCCCCACA TTCCACGGCT CGGCAGCCGA CCTCGTAGCG
GTCGCAGCCA CACTCGTCAA CCTGATCGGA AGCCACCTCC AACAACCGGA CTCCGCGGTT
TCGGGCACAC ACCTCATCGC TCTGCCGCAC GCGGCCAGCG GCCCCCGACA CCACTTCCTC
CCCGGTGTAA CGCACCCCAT GGATCACACA GCAGGGACAG AATGA
 
Protein sequence
MLAGQRLCIY LDPSREWQPT LGVAGLLTRL WDWLVDAAAG NFDAATAMYH AVGGVPHQAH 
DTPTIVTREP GPAKRHQTAH LIARSTHRYD LTYSPGAAGH RVPVITLATA LPFGAASTFA
LLLALLDDPY LDRLEGRAPR IAPQSPAFLT ALLASALRNH HDAEQYFVLA VPHPAGGPPH
LLGGRLPTPT ANALREVAQQ RGVGVVLDPA KINTEIPIEW CRMSDERPEV TTRRDDGRPV
NGFQRKTVHI WGCGGLGSWI AEFIARAGAS EITVCDPGIV TGGLLVRQNY VEDDIGRSKA
EALAGRLRAI RDDLTVTVAE GHLPEDHTSC LAADLIIDAT VNNGITSCLD ALATAPTRKA
LIAQVATDAR SGTLGLAVLC AASATATVSS IDQDAGRTIQ GDSGLELYHT LWQEPSDDEL
IPTRGCSVPT FHGSAADLVA VAATLVNLIG SHLQQPDSAV SGTHLIALPH AASGPRHHFL
PGVTHPMDHT AGTE