Gene MCA1032 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMCA1032 
Symbol 
ID3102226 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylococcus capsulatus str. Bath 
KingdomBacteria 
Replicon accessionNC_002977 
Strand
Start bp1083865 
End bp1086072 
Gene Length2208 bp 
Protein Length735 aa 
Translation table11 
GC content67% 
IMG OID637170217 
ProductS1 RNA-binding domain-containing protein 
Protein accessionYP_113508 
Protein GI53804824 
COG category[K] Transcription 
COG ID[COG2183] Transcriptional accessory protein 
TIGRFAM ID[TIGR00426] competence protein ComEA helix-hairpin-helix repeat region 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.013008 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCCTTTCA TCGCGCGCTA CCGCAAGGAA GTGACCGGGG GTCTGGATGA TACCCAATTG 
CGCACCCTGG AAGTCCGCCT CGGCTATCTG CGCGAGCTGA ATGACCGACG CGAAGTCATT
CTGGCCAGCA TCGCCGAACA GGGCAAGCTG ACCACGGAGC TGGAGCAGGC CATCCGTGAG
GCCGATACCA AGACCGGGCT CGAAGACCTG TATCTTCCCT ACCGGCCGAG GCGGCGGACC
AAGGCACAGA TCGCCCGAGA GGCGGGTTTG GAGCCTCTGG CCGATGCGTT GCTGGCCGAA
CCGGCGCTCG AACCTCTGGC GACGGCGGCC GGCTATGTCG ATGCCGGCAA AGGCGTGGCC
GATGCCGGCG CAGCCTTGGA CGGTGCCCGT CAAATCCTGA TGGAGCGCTT CGCCGAGGAG
GCGACCTTGG CGGGCGAGTT GCGTGAAAAG GTGTGGAGCG AAGGGCTTCT GGTTTCGTCC
CTGGTGGATG GCAAGGCAGA GGGAGGAGCG AAGTTCAAGG ATTATTTCGA CTATGAGGAG
GCGCTGGCCG GGATTCCTTC GCACCGCGCC CTGGCCCTGC TGCGCGGGCG CAATGAAGGC
GTCCTGTCGC TGGTTCTGCG GGTGGCCAAG GACGAGGAAG CCGGCCACCG GTTCGCCGAA
CACCGCATCG CCAGCGCTTT CGCCATCGCC GACCGCGGAC GTCCGGCGGA CTCCTGGTTG
TTGGAAACCG TGCGCCTGGC CTGGAGGGTG AAGCTTCTCA CCCGCATCGA ACTCGATCTG
ATGCAGCGCC TGCGGGAGAC GGCGGAGGCC GAGGCCATCC GGGTGTTCGC CAGCAACCTG
AAGGATTTGC TGCTGGCCGC GCCGGCCGGG CAGCGCGCCA CCCTGGGGTT GGATCCGGGG
TTGCGCACCG GGGTCAAGGT CGCCGTGGTC GATGGCACCG GCAAACTGGT GGCCACCGAT
ACGATCTATC CGCATGCCCC CAGGAACCAG TGGGATCAGT CCATCGCCAC GCTGGCCGCG
CTGATCGCGC GCCACGGCGT CAGCCTGGTC AGCATTGGCA ACGGCACCGC TTCGCGCGAG
ACCGACCAAT TGGTGGCCGA TCTGATGCAG CGCCATCCGG AACTGGGCGT GACCCGCGTG
GTGGTGTCGG AGGCCGGGGC CTCGGTGTAT TCGGCCTCGG AGCTGGCAGC GCAGGAATTT
CCCGAGCTGG ACGTATCGTT GCGCGGCGCG GTGTCCATCG CCCGCCGCCT ACAGGATCCG
CTGGCGGAGC TGGTCAAGAT CGATCCCAAG TCGATAGGCG TCGGCCAGTA CCAGCACGAC
GTCAACCAGG CCCAACTCGG CCGTACCCTC GATGCGGTGG TGGAGGACTG CGTGAACGCG
GTGGGGGTGG ACGTCAACAC CGCTTCGGTC TCGCTGCTGC GTTACGTCTC CGGCCTGTCG
CCGAGTCTGG GCCGGAATAT CGTGGAATAC CGCAACCAGC ATGGGCCGTT CGCCAATCGG
GAGCAGCTGA GGCAGGTCTC GCGGCTCGGG CCGAAGGCCT TCGAGCAGGC CGCAGGCTTC
CTGCGCATCG CCAATGGCGA ACATCCGTTG GATGCGTCCG CCGTGCATCC GGAAGCCTAT
CCGGTGGTGG AGAAGATCGT CCGGCGCACT GGCAAGGATA TCCGTGAGCT GATCGGCAAC
GTGGCCTTCC TGCGCAGCGT GAATGCCGAG CAGTTCACCG ACGAGCGCTT CGGCCTGCCC
ACCGTCACCG ACATCCTGCG CGAACTGGAG AAGCCCGGTC GTGATCCGCG GCCGGAGTTC
AAGACCGCCC GCTTCAAGGA GGGGGTGACC GAGCCGAAGG ACCTCGAGCC GGGCATGCGG
CTGGAAGGGG TGGTGACCAA CGTCACCAAT TTCGGCGCCT TCGTCGATGT CGGCGTGCAT
CAGGACGGTC TGGTCCACAT CTCCCACCTG GCGGACAGGT TCGTGCGCGA CCCGCGCGAA
GTGGTCAAGG CCGGGGATGT GGTCCAAGTG AAGGTGCTGG AGGTCGATAT CCCGCGCAGG
CGGATCGCCC TGTCGATGCG CAGCGACGCG GCGAAGGGCG AAGCGGAGCC GCCGCGCCGG
GAAACGGGAG AGACGGCCGG CAAGCCGCGC CGCAGAGCCG CTCCCCAGCC GGTACGGCAA
GGCGCGATGG CCGAGGCGTT GGCGCAGGCC TTGAAGCGCG GGCGCTGA
 
Protein sequence
MPFIARYRKE VTGGLDDTQL RTLEVRLGYL RELNDRREVI LASIAEQGKL TTELEQAIRE 
ADTKTGLEDL YLPYRPRRRT KAQIAREAGL EPLADALLAE PALEPLATAA GYVDAGKGVA
DAGAALDGAR QILMERFAEE ATLAGELREK VWSEGLLVSS LVDGKAEGGA KFKDYFDYEE
ALAGIPSHRA LALLRGRNEG VLSLVLRVAK DEEAGHRFAE HRIASAFAIA DRGRPADSWL
LETVRLAWRV KLLTRIELDL MQRLRETAEA EAIRVFASNL KDLLLAAPAG QRATLGLDPG
LRTGVKVAVV DGTGKLVATD TIYPHAPRNQ WDQSIATLAA LIARHGVSLV SIGNGTASRE
TDQLVADLMQ RHPELGVTRV VVSEAGASVY SASELAAQEF PELDVSLRGA VSIARRLQDP
LAELVKIDPK SIGVGQYQHD VNQAQLGRTL DAVVEDCVNA VGVDVNTASV SLLRYVSGLS
PSLGRNIVEY RNQHGPFANR EQLRQVSRLG PKAFEQAAGF LRIANGEHPL DASAVHPEAY
PVVEKIVRRT GKDIRELIGN VAFLRSVNAE QFTDERFGLP TVTDILRELE KPGRDPRPEF
KTARFKEGVT EPKDLEPGMR LEGVVTNVTN FGAFVDVGVH QDGLVHISHL ADRFVRDPRE
VVKAGDVVQV KVLEVDIPRR RIALSMRSDA AKGEAEPPRR ETGETAGKPR RRAAPQPVRQ
GAMAEALAQA LKRGR