Gene TM1040_0703 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_0703 
Symbol 
ID4077370 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp752089 
End bp754533 
Gene Length2445 bp 
Protein Length814 aa 
Translation table11 
GC content60% 
IMG OID638006000 
ProductRNA binding S1 
Protein accessionYP_612698 
Protein GI99080544 
COG category[K] Transcription 
COG ID[COG2183] Transcriptional accessory protein 
TIGRFAM ID[TIGR00426] competence protein ComEA helix-hairpin-helix repeat region 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.509283 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.104907 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCCGGA CCGCTAAGAC GGTACTCCGG CCTCAGTGGT TGCGCATCAT CGTGGCCGAC 
GCCAAAGCAA AGGAGCAGCA GGTTTTGGAT ACGACCCTTC GAATTGCGCG GAAGATTTCC
GAAGAAATCG GATCATCGGT ACAGCAGGTG AACGCAGCCG TGGGGCTCTT GGACGAGGGG
GCGACTGTAC CTTTTGTCGC GCGCTACCGC AAAGAGGTGA CCGGGGGCCT TGATGACACG
CAATTGCGGA CCTTGTCGGA GCGGCTGGAA TACTTGCGCG AGCTGGAAAA GCGACGGGCC
GCGATCCTGG AGACGATTAC GCAGCAAGAC AAGTTGACCG ATGATCTGGC GGCATCGATT
GCCAAGGCGG AGACCAAGGC GCAGCTCGAA GACATCTATC TCCCCTTCAA ACCCAAGCGT
CGCACCAAGG CGATGATCGC GCGCGAAAAC GGGCTTGAGC CTCTTCTGGA GGCTATTCTC
GCGGATCGCT CTGCTGATCC CGAGGTTTTG GCCGCGAACT ACCTCTCCGG GGAAGTGGCA
GACACAAAGA CCGCTTTGAA CGGTGCACGT GATATTCTCA CCGAGCGTCT CACCGAGAAT
GCCGAGCTTT TGGGGCGGCT ACGTAATTTC ATGCAGTCTG AAGCCATTTT GCGATCCAAA
GTGGTCGAAG GACAGGAGCA GGCGGGTGCC AAATTCTCGG ATTACTTCGA TCATTCCGAG
GCTTGGAAAA CCACCCCGTC GCATCGGGCT CTGGCGATGC TGCGCGGCTC CAACGAGGGC
GTGCTGACGC TCGATATTGC CCCACCGGGC GAAGAGGGGG TCACGCGAGC CGAAGAAATG
GTCGCCGCTG TTCTGGAGGT GCGAAGCTCG GCTCCGGGCG ACAAGTGGCT GCGCAAAGCG
GCTGGTTGGA CCTGGCGGGT CAAACTGTCG CTGTCGATGA TGCTGGAGCT GATGGGCGAT
CTGCGCAGCC GGGCGCAGGA CGAAGCTATT TCGGTTTTTG CGCGCAATCT CAAGGATTTG
CTCTTCGCTG CCCCAGCCGG TGCGCGGCCG ACCCTCGGGC TGGACCCGGG CATTCGCACC
GGCGTCAAGG CCGCAGTGGT TGATGCGACC GGGAAATTGG TGGCCACAGA GACCCTCTAT
CCGTTCCAGC CGAAAAACGA TCTGCGTGGT GCGCAGGTCT CCATTCTGAA GCTCATCGCC
GAGCACGGGG TTGAGCTGAT CGCGATCGGC AATGGTACGG CCAGCCGCGA GACCGAGCGC
ATGGTCGCCG AAGTTCTGAA AAACCTGCCC GCCAAGGTCA AAGCGCCGAC CAAGGTGGTG
GTTTCTGAGG CGGGCGCCTC GGTCTATTCC GCCTCCGAAC TGGCTGCGCG TGAATTCCCG
GATCTTGACG TCAGCTTGCG TGGTGCGGTT TCGATTGCGC GGCGCCTTCA GGATCCGCTA
GCGGAACTGG TCAAGATTGA GCCCAAGAGC ATTGGGGTCG GACAATATCA GCATGACGTG
GACCAGCATA AGCTTTCCAA GTCACTCGAA GCCGTGATCG AAGATGTGGT GAACGCGGTT
GGTGTTGATC TCAACATGGC GTCCGCACCG CTTTTGGCAC ATGTCTCGGG TCTTGGTCCC
GGCCTTGCCG AAGCCATCGT GGCGCATCGC GACCTCAACG GCGCTTTCAA GACCCGCAAA
GAACTCTTGA AGGTTGCGCG TCTTGGCCCC AAGGCGTTTG AGCAATGTGC AGGTTTCCTG
CGTATCCAAG GCGGCAAAGA GCCTCTGGAC GCCTCTGCGG TCCACCCCGA AAGCTATGAT
GTCGCGCGCA AGATCGTGAG CGCCTGCGGC CGCGATATTC GCGAGATCAT GGGGGACGCC
ACGGCGCTGA AATCCCTCCG GGCTGAGCAA TTCGTCTCTG GTGAGGTCGG CCTGCCCACC
GTACATGACA TCTTTGCGGA GCTGGAAAAA CCGGGGCGTG ATCCACGTCC CTCTTTCGTC
ACCGCAAGCT TTACCGATGG CGTGGAGGAG ATCACTGATC TCAAACCCGG TATGGTTCTT
GAGGGCACAG TGACCAATGT TGCGGCCTTC GGGGCCTTCG TGGACATTGG TGTGCACCAA
GACGGTCTTG TGCATGTGAG CCAGCTGGCC GACAAATTCG TCAAGGACCC GCATGAGATT
GTGAAAACCG GTCAGGTGGT CAAGGTCACG GTGGTCGAGG TGGACGTGCC GCGCAAACGC
ATCGGCCTGA CCATGAAAAA GGACGGTGGC GCATCTGCGC GAGAGGATCG CGCAGGCCGT
GGGCCCTCCC AGAATGCAGC GCGCGGTGGT GCCAAAGGGC GCAATTCCTC CTCACCCAAA
CGCCACGGCT CTGCCGCGCC CAGAGGCAAA TCCGCCTCGG ATAACGGCGC AACGGGCAAT
GGCGCATTGG GGGCCGCCTT GATGGATGCC TTCAAGAAGA AATAG
 
Protein sequence
MFRTAKTVLR PQWLRIIVAD AKAKEQQVLD TTLRIARKIS EEIGSSVQQV NAAVGLLDEG 
ATVPFVARYR KEVTGGLDDT QLRTLSERLE YLRELEKRRA AILETITQQD KLTDDLAASI
AKAETKAQLE DIYLPFKPKR RTKAMIAREN GLEPLLEAIL ADRSADPEVL AANYLSGEVA
DTKTALNGAR DILTERLTEN AELLGRLRNF MQSEAILRSK VVEGQEQAGA KFSDYFDHSE
AWKTTPSHRA LAMLRGSNEG VLTLDIAPPG EEGVTRAEEM VAAVLEVRSS APGDKWLRKA
AGWTWRVKLS LSMMLELMGD LRSRAQDEAI SVFARNLKDL LFAAPAGARP TLGLDPGIRT
GVKAAVVDAT GKLVATETLY PFQPKNDLRG AQVSILKLIA EHGVELIAIG NGTASRETER
MVAEVLKNLP AKVKAPTKVV VSEAGASVYS ASELAAREFP DLDVSLRGAV SIARRLQDPL
AELVKIEPKS IGVGQYQHDV DQHKLSKSLE AVIEDVVNAV GVDLNMASAP LLAHVSGLGP
GLAEAIVAHR DLNGAFKTRK ELLKVARLGP KAFEQCAGFL RIQGGKEPLD ASAVHPESYD
VARKIVSACG RDIREIMGDA TALKSLRAEQ FVSGEVGLPT VHDIFAELEK PGRDPRPSFV
TASFTDGVEE ITDLKPGMVL EGTVTNVAAF GAFVDIGVHQ DGLVHVSQLA DKFVKDPHEI
VKTGQVVKVT VVEVDVPRKR IGLTMKKDGG ASAREDRAGR GPSQNAARGG AKGRNSSSPK
RHGSAAPRGK SASDNGATGN GALGAALMDA FKKK