Gene Mmcs_3454 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_3454 
Symbol 
ID4112286 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp3664503 
End bp3665813 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content69% 
IMG OID638032589 
Producthypothetical protein 
Protein accessionYP_640617 
Protein GI108800420 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0757862 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGGAC TTCCCCAACT CATCGGAGTG ATCGCGCTCG TCGCGTTCGG TGGGCTGTTC 
GCGGCTATCG ACGCCGCGCT GAGCACGGTG TCCATGGCCC GGGTCGAGGA ACTCGTACGC
GAGGAACGGC CGGGAGCCGT GCGGTTGCAG CGGGTGATGC ACGAACGGCC CCGCTACATC
AACCTCATCG TGCTGCTGCG GATCGCCTGC GAGGTGACCG CGACTGTGCT GCTCGCCGCC
TACCTGGACG GCCACCTCGG CGTGAGCTGG GGACTGACCG CGGCCGCGGC CATCATGGTG
GTCGCCAGCT TCGTCGCCGT CGGCGTCGGG CCGCGCACCG TCGGGCGCCA GAACGCCTAT
CCCATCGCGC TGTACACCGC GCTTCCGCTG CAGGCCATCT CGGTGCTGCT CACCCCGATC
AGCCGCCTGC TGGTGTTGAT CGGCAACGCG CTGACCCCCG GCCGCGGATT CCGCAACGGG
CCGTTCGCCT CGGAGATCGA ACTGCGTGAG GTCGTCGACC TGGCGCAGCA GCGCGGCGTG
GTGGCCGACG ACGAGCGCCG GATGATCCAG TCGGTGTTCG AACTCGGCGA CACCGCGGCC
CGCGAGGTGA TGGTGCCGCG CACCGAGATG GTGTGGATCG AAAGTGACAA GACAGCCGGC
CAGGCCACCT CACTCGCGGT CCGCAGCGGA CACTCCCGCA TCCCCGTCAT CGGGGAGAAC
GTCGACGACG TGGTCGGCGT GGTGTACCTG AAAGACCTCG TCCAGCAGAC GTATTACTCG
GTCAACGGCG GCCGCGACAC CACCGTCGCG CAGGTCATGC GCGATCCGGT GTTCGTGCCG
GACTCCAAAC CGCTCGACGA ACTGCTGCGT GAGATGCAGC GCGACCGCTA CCACATGGCG
CTGCTGGTCG ACGAGTACGG CGCCATCGCC GGGCTGGTCA CCATCGAGGA CGTCCTCGAG
GAGATCGTGG GTGAGATCGC CGACGAGTAC GACACCGACG AGGTGGCGCC GGTCGAAGAA
CTCGGCAACC GGGAGTACCG GGTGTCCGCG CGGCTGCCCA TCGAGGACCT CGGCGAGCTC
TACGACATCG AGTTCGGTGA GGATCTCGAC GTCGACACCG TCGGCGGTCT GGTCGCCTTC
GAACTCGGGC GCGTACCGCT GCCCGGCGCC GAGATCACCT GGGACGGCCT GCGGCTCAAG
GCCGAAGGCG GCCCCGACCA TCGCGGCCGG GTGCGCATCG GCACCGTCCT GGTCAGCCCC
ACCGAGCCAG AGCACGACGA CGAGACCGAA CCCGAGGAGC GCGGTGACTG A
 
Protein sequence
MSGLPQLIGV IALVAFGGLF AAIDAALSTV SMARVEELVR EERPGAVRLQ RVMHERPRYI 
NLIVLLRIAC EVTATVLLAA YLDGHLGVSW GLTAAAAIMV VASFVAVGVG PRTVGRQNAY
PIALYTALPL QAISVLLTPI SRLLVLIGNA LTPGRGFRNG PFASEIELRE VVDLAQQRGV
VADDERRMIQ SVFELGDTAA REVMVPRTEM VWIESDKTAG QATSLAVRSG HSRIPVIGEN
VDDVVGVVYL KDLVQQTYYS VNGGRDTTVA QVMRDPVFVP DSKPLDELLR EMQRDRYHMA
LLVDEYGAIA GLVTIEDVLE EIVGEIADEY DTDEVAPVEE LGNREYRVSA RLPIEDLGEL
YDIEFGEDLD VDTVGGLVAF ELGRVPLPGA EITWDGLRLK AEGGPDHRGR VRIGTVLVSP
TEPEHDDETE PEERGD