Gene Mmcs_1121 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_1121 
Symbol 
ID4109959 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp1218185 
End bp1219276 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content67% 
IMG OID638030243 
Producttransposase IS116/IS110/IS902 
Protein accessionYP_638290 
Protein GI108798093 
COG category[L] Replication, recombination and repair 
COG ID[COG3547] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTTTACAG AGCGTACGAG TGTGGGACTG GACGTGCACG CACGTTCGGT AGCAGCAGCT 
GCCATCGACG GCGTGACCGG TGAGGTTCAG CAGACTCGCC TGACCCCATC CCATGAGCAC
ATCCGGTCGT GGATATCGGG GCTGGCGGGC CCGGTGGCGG TGGCCTACGA GGCTGGTCCC
ACCGGTTTCG GCTTGCAGCG GTCGTTGACG GAGGCCGGGA TCCGCTGCGT CGTGGTGGCG
CCGTCGAAAC TGCAGAAGCC CGCTGGAGAT CGAGTGAAGA CCGATGCCCG CGACGCCCTG
CACCTGTGCC GGTTGTTGCG GCTGGATGAG ATCACGTCGG TGTCGATTCC GAGCGTGGCT
CAGGAAGCGG CTCGTGACTT GGTGCGTGCC CGCGAGGACT GCCGCGGCGA CCTGATGCGG
GCTCGGCATC GCCTGTCCAA GCTGCTGTTG CGCCACGGCA TCGTGTACTA CGGCGGGCAG
GCCTGGACCG GTGCCCATGA TCAGTGGCTG CGCACCGTCG CCGCGCCGCA GCTCATGGCG
CCGGCGACGC GGATGGCCTT TGACGCCGAC TATGACCACG TGTTGACAAT GCAGGCCCGG
CGGCGACGGC TGGACGCAGC GATCGAGGAG AGGGCCGCCG ATAGTGAGTT CACCGCGATC
GTGCGGCGGG TGTCGTGTCT GCGGGGGGTG AACACGTTGA CCGGGTTTGC GTTGGCAGTC
GAAATCGGTG ATTGGAACCG GTTCACCGGC AACACGATTG GTTCCTTCGT CGGGCTGGTT
CCCTCGGAGT TTTCGTCGGG CTCCTCGCGG GCTCAAGGTC CGATCACCAA GACCGGCAAC
ACCCATGTCC GGCGGCTGCT GGTCGAGGCG GCGTGGCATC ACAAGCCGCG ATATCGGGTC
GGTACGGTGA TGCGTTCGCG GTGGGATCGG GCATCTGCGG CGGCCCGCGC CCGCGGGGAC
GAAGGCAACC GCCGCCTGCA TGGCAGGTGG GTGGGCTTCC TGGAGCGACG CAAACGACCC
GTGACGGCCA ATGTCGCGGT CGCGCGTGAG CTGGCCGGCT GGTGCTGGTC GCTGGCCGTC
ATGGACGACT GA
 
Protein sequence
MFTERTSVGL DVHARSVAAA AIDGVTGEVQ QTRLTPSHEH IRSWISGLAG PVAVAYEAGP 
TGFGLQRSLT EAGIRCVVVA PSKLQKPAGD RVKTDARDAL HLCRLLRLDE ITSVSIPSVA
QEAARDLVRA REDCRGDLMR ARHRLSKLLL RHGIVYYGGQ AWTGAHDQWL RTVAAPQLMA
PATRMAFDAD YDHVLTMQAR RRRLDAAIEE RAADSEFTAI VRRVSCLRGV NTLTGFALAV
EIGDWNRFTG NTIGSFVGLV PSEFSSGSSR AQGPITKTGN THVRRLLVEA AWHHKPRYRV
GTVMRSRWDR ASAAARARGD EGNRRLHGRW VGFLERRKRP VTANVAVARE LAGWCWSLAV
MDD