Gene Mmcs_3235 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_3235 
Symbol 
ID4112067 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp3429619 
End bp3431169 
Gene Length1551 bp 
Protein Length516 aa 
Translation table11 
GC content68% 
IMG OID638032367 
Producthypothetical protein 
Protein accessionYP_640398 
Protein GI108800201 
COG category[S] Function unknown 
COG ID[COG3333] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAATT TCGACTGGCT GATGCAGGGC TTCGCGGAGG CCGCGACACC GATGAACCTG 
CTCTACGCGA TCATCGGCGT GCTGCTGGGC ACCGCGGTCG GTGTGCTGCC GGGGATCGGC
CCCGCGATGA CGGTGGCGCT GCTGCTGCCG GTCACCTACA ACGTCAGCCC GAGCGCGGCG
TTCATCATGT TCGCCGGCAT CTTCTACGGC GGCATGTACG GCGGATCGAC CACCTCGATC
CTGCTGAACA CCCCCGGTGA ATCGTCGTCG GTGATCACCG CGATCGAGGG CAACAAGATG
GCCAAGGCCG GCCGGGCCGC CCAGGCGCTG GCCACCGCCG CGATCGGCTC GTTCGTCGCC
GGTGCGATCG GCACTGCGCT GCTCGCGGCC TTCGCACCCC CGATCAGCAG GTTCGCGGTC
ACGCTCGGCG CGCCGTCGTA CCTGGCGATC ATGGTGTTCG CGCTGGTCGC GGTCACCGCG
GTGCTCGGCG CCTCGAAGCT GCGCGGGGCG ATCTCGCTGT TTCTCGGCCT GGCCATCGGG
GTGGTGGGCA TCGACTTCCT CACCGGCCAA CCGCGGGCCA CCTTCGGGCT ACCGCAGCTG
TCCGACGGTA TCGACATCGT GGTGATCGCC GTCGCCGTGT TCGCCGTCGG CGAGGCGTTG
TGGGTGGCCG CCCACCTGCG GCGGCGCCCC GCGGAGGTGA TCCCGGTGGG CCGGCCGTGG
ATGGGTCGCG ACGACTTTCG CCGGTCATGG AAGCCCTGGC TGCGCGGCAC CGCCTACGGC
TTCCCGTTCG GTGCGCTGCC CGCCGGCGGC GCCGAACTGC CGACGTTCCT GTCCTACATC
ACCGAGAAGA AGCTCGCGAA ACGCACGGGG CACGATGTGG AGTTCGGCAA GGGCGCGATC
GAGGGCGTGG CCGGACCGGA GGCGGCCAAC AACGCGTCGG CGGCGGGCAC GCTGGTGCCG
ATGCTGTCGC TCGGCCTGCC CACCAACGCC ACCGCGGCGG TCATCCTGAC CGCCTTCGTG
TCCTACGGAA TCCAGCCCGG TCCAACGCTT TTCGAGAAGG AGCCGTTGCT GATCTGGACG
CTGATCGCCA GCCTGTTCAT CGGCAACCTG CTGCTGTTGG TGCTCAACCT GCCGCTGGCC
CCGCTGTGGG CGAAACTGCT GCGCACACCG CGGCCGTACC TGTACGCCGG CATCCTGTTC
TTCGCCACGC TGGGTGCTCT GGCCGTCAAC ATCCAACCGC TGGACCTGGC GCTGCTGTTG
GTGTTCGGAC TGCTCGGGTT GATGATGCGC CGCTTCGGCC TCCCGGTGCT GCCGTTGATC
ATCGGGGTCA TCCTCGGGCC GCGGATCGAA CGCCAACTGC GGCAGAGCCT TCAACTCGGC
GGCGGCGACT GGACGAGCCT GTTCACCGAA CCGGTCGCGA TCGTCGTCTA CGTGTTGATG
GCGCTGTTAC TGCTGGCCCC CTTGGTGCTC AAGCTCTTTC ACCGTAGTGA GGACACTCTG
CTCATCGTGG AGGACGATGT GGACCAACAG GAGAAGGCGG CACGGACATG A
 
Protein sequence
MNNFDWLMQG FAEAATPMNL LYAIIGVLLG TAVGVLPGIG PAMTVALLLP VTYNVSPSAA 
FIMFAGIFYG GMYGGSTTSI LLNTPGESSS VITAIEGNKM AKAGRAAQAL ATAAIGSFVA
GAIGTALLAA FAPPISRFAV TLGAPSYLAI MVFALVAVTA VLGASKLRGA ISLFLGLAIG
VVGIDFLTGQ PRATFGLPQL SDGIDIVVIA VAVFAVGEAL WVAAHLRRRP AEVIPVGRPW
MGRDDFRRSW KPWLRGTAYG FPFGALPAGG AELPTFLSYI TEKKLAKRTG HDVEFGKGAI
EGVAGPEAAN NASAAGTLVP MLSLGLPTNA TAAVILTAFV SYGIQPGPTL FEKEPLLIWT
LIASLFIGNL LLLVLNLPLA PLWAKLLRTP RPYLYAGILF FATLGALAVN IQPLDLALLL
VFGLLGLMMR RFGLPVLPLI IGVILGPRIE RQLRQSLQLG GGDWTSLFTE PVAIVVYVLM
ALLLLAPLVL KLFHRSEDTL LIVEDDVDQQ EKAART