Gene MmarC5_0140 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmarC5_0140 
Symbol 
ID4927792 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanococcus maripaludis C5 
KingdomArchaea 
Replicon accessionNC_009135 
Strand
Start bp118052 
End bp119170 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content36% 
IMG OID640165640 
ProductAllergen V5/Tpx-1 family protein 
Protein accessionYP_001096672 
Protein GI134045186 
COG category[S] Function unknown 
COG ID[COG2340] Uncharacterized protein with SCP/PR1 domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.697458 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATTGGC CGTTTAAAGC TTTTACATTA ACGTTAATTA CACTATCCTT ATTAGTAGCC 
GCTAGCGCAA ATTGTGTCTT CGATGGATGT GAACCAACTT TTTCAAACGT AACTTACCAA
AATTTTAACG AAAACTATTC TAATTTATTT GATAACAACC ATAATCTATT CTTCGAATTA
AACAGCCTTT TTAAAAATAA TTTGGACTAC AAGACTTTTG AAGTTAAAGC ATATGCTCCC
CTGAAAAAGA CGTCTTCAAA AATAGATTTT CTTGAATCAA GATATGTTCC AGTGACTACG
GTAGACACGT CTTCTGAAGA TAATACCGAC ACGTCTTCTG AAGATAATAC CGACACGTCT
TCTGAAGATA ATACCGACAC GTCTTCTGAA GATAATACCG ACACGTCTTC TGAAGATAAT
ACCGACACGT CTTCTGAAGA TAATACCGAC ACGTCTTCTG AAGATAATAC CGACACGTCT
TCTGAAGATT ACGTTTATTT ACCTTCAAAA ATTACTCAAT CTCCAAAAAC ATCGCTATAT
ATCATTAAAA CTACACAAGA ACCAGTAGTA GAAGAACCAG TAGTAGAAGA ACCAGTAGTA
GAAGAACCAG TAGTAGAAGA ACCAGTAGTA GAAGAACCAG TAGTAGAAGA ACCAGTAGTA
GATAAAAACT CATTGATTGA ACAATATATA TTAGACTATA CCAATATAGA ACGCTCCTCA
TATGGACTCG ATGAGTTAAT ATTAGATAGT AAGTTAAGTC AAATTTCACA AGCTCATAGT
GATGACATGG TGGAAAATGA TTATTTTTCC CATGTAAACT TAGATGGAGA AACTCCTACC
GATAGGGCCA TTGCAGCAGA TTATAACGTT GTAAAATACC TAGGAGACGG ATATTACGCT
ACAGGAATTG GCGAAAATAT TGCAAAAATG CCTACTGGCA ATGTAATTGG AATTGGATAT
GTTTCAGACG ATGCTGAAAG TATTGCAAAA GCTATCGTGG ATGCCTGGAT GGATAGTCCC
GGCCACAGGG CAAATATTCT AAACTCCCAA TACACCAATA TGGGCATAGG CGTATCTTTT
GATGGTACGT ATTATGTTGC TACCCAAAAT TTCTATTAA
 
Protein sequence
MDWPFKAFTL TLITLSLLVA ASANCVFDGC EPTFSNVTYQ NFNENYSNLF DNNHNLFFEL 
NSLFKNNLDY KTFEVKAYAP LKKTSSKIDF LESRYVPVTT VDTSSEDNTD TSSEDNTDTS
SEDNTDTSSE DNTDTSSEDN TDTSSEDNTD TSSEDNTDTS SEDYVYLPSK ITQSPKTSLY
IIKTTQEPVV EEPVVEEPVV EEPVVEEPVV EEPVVEEPVV DKNSLIEQYI LDYTNIERSS
YGLDELILDS KLSQISQAHS DDMVENDYFS HVNLDGETPT DRAIAADYNV VKYLGDGYYA
TGIGENIAKM PTGNVIGIGY VSDDAESIAK AIVDAWMDSP GHRANILNSQ YTNMGIGVSF
DGTYYVATQN FY