Gene Mmcs_2814 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_2814 
Symbol 
ID4111646 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp2960103 
End bp2961518 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content70% 
IMG OID638031938 
Producthypothetical protein 
Protein accessionYP_639977 
Protein GI108799780 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGTCG CATACACCCT GCTCTCGCTG TTGGCGATCG TGGTGTTGAC CGCGGGTACC 
GCGGTGTTCG TCGCCGCCGA GTTCTCGTTG ACGGCGCTGG AGCGCAGCAC CGTCGAGGCC
AATGCCCGTA GCGGGCACCG GCGCGACCAG CTCGTGCGCC GCGCCCACCG CACGCTGTCC
TTCCAGCTCT CGGGCGCCCA GGTGGGCATC TCGATCACCA CGCTGGCCAC CGGCTACCTG
GCCGAACCCG TGGTCGCCCG GCTGCTGCAG CCGGGTCTGG ACGCGATCGG ACTGCCGGAA
CAGGCCGCGA GCGGCGTCGC CCTGTTCCTC GCGATCCTGA TCGCCACCTC CCTGTCGATG
GTCTTCGGCG AACTCGTGCC GAAGAACCTC GCGGTGGCCC GCCCGGCGCC GACCGCCCGC
GCAGCCGCAC CTCCGCAGCT GCTCTTCTCG ACGATCTTCA CGCCGCTGAT CCGGCTCACC
AACGGCACCG CGAACATGAT CCTGCGCCGG CTGGGCATCG AACCCGCCGA GGAACTGCGC
TCCGCGCGGT CGGTGCAGGA GCTGATCTCG CTGGTGCGCA ACTCCGCGCG CAGCGGTTCA
CTCGACCCGG TGACCGCGGT GCTGGTGGAC AGGTCACTGC AGTTCGGCGA GCGCACCGCC
GAGGAACTGA TGACCCCCCG CACCGAGATC GAGGCCCTGC AGGCCGACGA CACCGTCGCC
GACCTCATCG CCGCGGCGAT CGAAACGGGG TATTCGCGCT TCCCGATCGT CGAGGGTGAC
CTCGACGAGA CCATCGGCGT CGTCCACGTC AAACAGGTGT TCTCGGTACC GCGCGACGAC
CGCGACCGCA CCCGCCTCGC GGCAATCGCG ATCCCGGTGG CCACCGTGCC CTCGACGCTG
GACGGGGACG CGGTGATGAC CCAGATCCGC GCCAACGGGC TGCAGACCGC GCTGGTGGTC
GACGAGTACG GCGGCACCGC CGGCATGGTG ACCGTCGAGG ATCTGATCGA GGAGATCGTC
GGCGACGTCC GCGACGAACA CGATGACGCC ACCCCCGACG TGGTCGCCGC CGGCGACGGC
TGGCAGGTGT CGGGCCTGCT GCGGATCGAC GAGGTGGCCA CCGGGACCGG TTTCCGCGCC
CCCGAGGGCG AGTACGAGAC CATCGGCGGG CTGGTGCTGC AGGAGCTCGG ACACATCCCG
GAAGTGGGCG ACTCGGTCGA GCTGACCGCG TTCGATCCGG ACGGGCCGCT CGACGATCCG
ATCCGCTGGC AGGCCAAGGT CGTGCAGATG GACGGTCGCC GGATCGACCT TCTGGAGTTG
GTCGAACTCG GGCGCCGCGG CGACACCGAC GACGACCACA TCGACAACGA CGACCACCAC
AACAAAGACG GCGCCGCGCC GGAGGAGGAC CGCTGA
 
Protein sequence
MSVAYTLLSL LAIVVLTAGT AVFVAAEFSL TALERSTVEA NARSGHRRDQ LVRRAHRTLS 
FQLSGAQVGI SITTLATGYL AEPVVARLLQ PGLDAIGLPE QAASGVALFL AILIATSLSM
VFGELVPKNL AVARPAPTAR AAAPPQLLFS TIFTPLIRLT NGTANMILRR LGIEPAEELR
SARSVQELIS LVRNSARSGS LDPVTAVLVD RSLQFGERTA EELMTPRTEI EALQADDTVA
DLIAAAIETG YSRFPIVEGD LDETIGVVHV KQVFSVPRDD RDRTRLAAIA IPVATVPSTL
DGDAVMTQIR ANGLQTALVV DEYGGTAGMV TVEDLIEEIV GDVRDEHDDA TPDVVAAGDG
WQVSGLLRID EVATGTGFRA PEGEYETIGG LVLQELGHIP EVGDSVELTA FDPDGPLDDP
IRWQAKVVQM DGRRIDLLEL VELGRRGDTD DDHIDNDDHH NKDGAAPEED R