Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_2845 |
Symbol | |
ID | 4076664 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | + |
Start bp | 3012733 |
End bp | 3015666 |
Gene Length | 2934 bp |
Protein Length | 977 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 638008174 |
Product | hypothetical protein |
Protein accession | YP_614839 |
Protein GI | 99082685 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3893] Inactivated superfamily I helicase |
TIGRFAM ID | [TIGR02786] double-strand break repair protein AddB, alphaproteobacterial type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.786725 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTTGATC CGTCTCAAAC CAAACCCAGG CTGTTTGCGG TCCCTTGTGG TGTAGATTTC CCACGCGCAC TTTATGACGG ACTCACTGCC CGTTTTGCCG ATGCCCTCCC GGAAGAGTTG GCACGGGTCG AGTTGATCCT CAACACGGAA CGCATGCAAC GCCGCGTGCG GCAGTTGTTT GACGCAGGAC CAGCGCGTCT CTTGCCTCGC ATCTCGCTCC TGTCGGGATT GAACAAAGAA GCAAATCTAC GTGGCCTGCC ACCTGCACTG CCTCCGCTGC GCCGCCGTCT GGAGCTTTCG CAGTTGATCG CAAAGCTACT GGATGCGCAG CCCGATTTGG CTGCTCGCGC CTCGCTTTAC GATCTTTCCG ACAGCCTGGC CGAATTGATC GACGAGATGC AGAGTGAAGG CGTGAGCACC GATCAGATCC GCGCGCTTGA TGTGTCTGAT ATGTCCGGGC ACTGGAAACG GGCGCAGGAC TTCATCGGCA TCGCGGATCA ATTTGTGGAC ATGCACGAAG GTGCGCTCGA CGTGAATGCC CGGCAGCGGC AGGTTGTGAT GGATCTGATC GCAGAATGGG AACAGACGCC ACCAACGCAT CCCATCATTT TGGCAGGCTC AACCGGGTCT CGGGGCACGA CGCTTCTTCT GATGGAGGCG ATTGCTCGCC TACCACAGGG CGCGGTCGTC TTGCCGGGAT TTGATTTTGA CCAGCCAGAA CCCGTCTGGG AGAGCCTGAC AGATGGGTTG GTTGCGGAAG ATCATCCGCA ATACCGGTTT CACAAACTGA TGCGCGACCT TGATCTACGC CCCTCGGATA TTTGCCGCTG GGTTGATACA ACGCCCCAAT CTCCTGCTCG CAATCGGTTG ATTTCTCTCG CACTTCGCCC AGCTCCGGTG ACGGATGCCT GGATGAGTGA AGGCCCCTAC CTCAAAGATC TGGACCAAGC CACAGAGGCG CTGACGCTGG TGGAGGCCGC AAATCCCCGA AGCGAAGCCT TGGCTATCGC TTTGCGCCTG CGTCAGGCAG CCGAGGACGG TCAGACCGCA GCCCTGATCA CGCCCGACCG CATGCTGACA CGCCAGGTCT CTGCCGCGCT GGATCGCTGG GACATTCTGC CAGACGATTC TGCGGGATTG CCGCTTCAAT TGTCGCCCCC TGGACGGTTT CTGCGACATG TGGCCGATCT GTTTTGTCGC CCTCTTCAGG CCGATATGTT GCTCACGCTT CTCAAGCATC CGCTCACCCA TTCCGGCGCG GAGCGAGGTC TGCATTTGCT GCACACCCGT GATCTGGAGC TCCATATGCG CCGCAACGGA CCGCCCTTCC CGGACTGCGA GAGCCTCAGT GCCTTTGGGA GCACGCGCGA TTTGCAGCCG GGTTGGTCAG ATTGGCTAGC GCAGAGCTTT GCCGGTCGAG ATCGGACGGG CACGCGCGCC CTGAGTGACT GGGTGTCAGA TTTGCGTGAG ACCGCCGAAG GGATCGCAGC AGGATCGCAG GTCGGCGGCA CGGGCGAGCT CTGGGACAAA AAAGCAGGGA ACGCAGCGCG CGACGTTCTT GAGGAATTGC AGGCACAATC ACCTCACGGT GGCGACATGA CCGCGCGCGA TTTTGCAGAC CTCCTGGGCG CACTCTTGTC GCAGGGCGAG GTGCGGGATC GCGATGCGCC ATATGGATCC ATTATGATCT GGGGCACGCT CGAGGCCCGC GTGCAAGGCG CCGATCTTGT TATCCTCGGT GGTCTCAACG AGGGGAGCTG GCCAGAAGCC GCGCGTCCTG ATCCCTGGCT CAATCGTAAA TTGCGCCACG AGGCAGGCCT TTTGCTGCCA GAGCGGCGCA TTGGCCTATC CGCACATGAT TTTCAGCAGG CTGTCGCAGC GCCCGAAGTC TGGCTCACGC GTGCTGTCCG CTCCGAAGAG GCCGACACGG TCCCGTCGCG CTGGCTCAAC CGTATGACTA ACCTTCTGGG AGGTCTACCG GATCAGGGCG GACAAGCCGC GCTGTCGGCC ATGCGAATGC GCGGCCAGGT CTGGCTGGAT TGGGCAAGCG TTCTTGATGC GCCAGTGCCA ACCCCTCTGA GCCCACGCCC CTCGCCGCGC CCGCCAGTTG CAGCACGGCC CCGTAGATTG ACGGTGACAG AAATTCCAAA ACTGATCCGT GATCCTTATG CGATCTACGC CAAACATGTA CTGCGCCTGA AACCCGTGGA TCCGCTGCTG CAGGAACCTG ATGCACTGCT GCGCGGCACA ATCATCCACA AGGTTTTGGA AGACTTTATA AAATCCGCTC AAGAGCAGCC GGAAAATCTC TCGGCCCGTC ACTTCATCGA CCACGCTCGA CGCGTCCTCG AAACCGAGGT GCCCTGGCCC GTTGCGCGCA CGCTTTGGCT AACCCGCCTA AAGAAAGTGG CCGAAGATTT TGTCACCGGT GAACGACAAC GTCAGTCGCG GGCGCGCCCA TCAGGGTTTG AGAAATCTGG CCAGGTCCGT CTTGATCCAC TTGATTTCGA AATCGCGGCC AAGGCGGACC GGATCGATGT GGACGAGCGT GGCTTGCTCC ATCTGTATGA TTACAAGACG GGCGACCCGC CTTCGGAAAA ACAGCAGAAA TCCTTTGAAA AACAGCTTCT GATCGAGACC GCCATGGCCG AACAGGGCGC GTTTTCAGAC TATGGTGCGG CGCGGGTGGA GCGTGCGCTC TATATCGGGT TAAAGCCTCC GGTGAAGGAG GTTGCCGCAC CCATTCTGGA CGAGCCACCC GCAAAGGTGT GGGCGGAGTT GCGCAGTCTG GTTGAGGCGT ATTTCGACGC AGAGCAAGGC TTTAGCAGCC GCCGAATGGT ACACCGGGAT GATTTTGCCG GGGATTACGA TCACCTTGCC CGCTATGGCG AATGGGATCG CAGTTCGGAT CCGGTGCCGG AGGATTTGAC ATGA
|
Protein sequence | MFDPSQTKPR LFAVPCGVDF PRALYDGLTA RFADALPEEL ARVELILNTE RMQRRVRQLF DAGPARLLPR ISLLSGLNKE ANLRGLPPAL PPLRRRLELS QLIAKLLDAQ PDLAARASLY DLSDSLAELI DEMQSEGVST DQIRALDVSD MSGHWKRAQD FIGIADQFVD MHEGALDVNA RQRQVVMDLI AEWEQTPPTH PIILAGSTGS RGTTLLLMEA IARLPQGAVV LPGFDFDQPE PVWESLTDGL VAEDHPQYRF HKLMRDLDLR PSDICRWVDT TPQSPARNRL ISLALRPAPV TDAWMSEGPY LKDLDQATEA LTLVEAANPR SEALAIALRL RQAAEDGQTA ALITPDRMLT RQVSAALDRW DILPDDSAGL PLQLSPPGRF LRHVADLFCR PLQADMLLTL LKHPLTHSGA ERGLHLLHTR DLELHMRRNG PPFPDCESLS AFGSTRDLQP GWSDWLAQSF AGRDRTGTRA LSDWVSDLRE TAEGIAAGSQ VGGTGELWDK KAGNAARDVL EELQAQSPHG GDMTARDFAD LLGALLSQGE VRDRDAPYGS IMIWGTLEAR VQGADLVILG GLNEGSWPEA ARPDPWLNRK LRHEAGLLLP ERRIGLSAHD FQQAVAAPEV WLTRAVRSEE ADTVPSRWLN RMTNLLGGLP DQGGQAALSA MRMRGQVWLD WASVLDAPVP TPLSPRPSPR PPVAARPRRL TVTEIPKLIR DPYAIYAKHV LRLKPVDPLL QEPDALLRGT IIHKVLEDFI KSAQEQPENL SARHFIDHAR RVLETEVPWP VARTLWLTRL KKVAEDFVTG ERQRQSRARP SGFEKSGQVR LDPLDFEIAA KADRIDVDER GLLHLYDYKT GDPPSEKQQK SFEKQLLIET AMAEQGAFSD YGAARVERAL YIGLKPPVKE VAAPILDEPP AKVWAELRSL VEAYFDAEQG FSSRRMVHRD DFAGDYDHLA RYGEWDRSSD PVPEDLT
|
| |