Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_3494 |
Symbol | |
ID | 4075173 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008043 |
Strand | + |
Start bp | 527798 |
End bp | 531232 |
Gene Length | 3435 bp |
Protein Length | 1144 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 638005009 |
Product | hypothetical protein |
Protein accession | YP_611728 |
Protein GI | 99078470 |
COG category | [S] Function unknown |
COG ID | [COG4717] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.364107 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCCTCA AACGTCTGTC ACTGGATCGG TTTGGCCATT TCACCGATCA ACAGTTCGAT TTTGGCTCTG CCCACGATGG GCATGACTTT CATATCATCT ACGGGCGCAA CGAAGCCGGT AAGACCACCA CGATGGAAGC CGTCCTGCGG CTGTTTTACG GCTTTCCAAC GCGCGAGGCC TACGCGTTTC GCCATCCGCG CAACAATCTA CAGATCTCTG CGACGCTCGA TTTCAATGGT GAGCTGCGCC AGTTCACCCG CCTGCCCACG CGCAGTGGCG CGCTGGTGGA TGAGAGCGGC ACCCCACTGC CGGAGGCTGC GCTTTCGGCG CATCTCGCCG GGCTTTCGGA ACCGGATTAC CGCCGTCTTC TGTGTCTGGA TGACGAGACG ATTGAACGCG GCGGCGAAGA GATCGCCAAT GCTCAGGGCG ATATTGGCCG GCTGCTGTTC TCGGCCGCGG CGGGGGTTGC GGATCTGAGC CAGGTGCTTG ATGGCGTGCG CAGCCGGGCC GATGAGATTT GGAAGAAACG CGCCCGCAAC ACGCGGATGC GCGAGCTGAA ACGCGCGCTT GAGGAGCTCG ACAAGGAGAT CAAGGCCCGC GATGTCTCTG CAAACGCGTG GAAATCCCTC AAGCGGGATC TCACCAAGGC GCAACAGGCT GAAGAAGACG CGCGGACGCG TCGCGATGAT CTCAATACAA CCCGGGCCCG GACCGAGGCC GAACGGCGCG CCGTGCCGCT TCTTGCAGAA CTTCAGGAGT TGGAACAGGC GCTTGCGCCC TTTGCGGATT TTCCCGCACA GCTGGATTTC AACCCCGAGC GGCTCGTCGA GTTGCGCAGC GATTTAGGCA GCGCCACGCA GAACATCGCG CGCCTCAGCG ACGAACTTCA GACCCTTGAG GAGGAGCAAG CGGCGCTTGT ATTGGATCCC GCACTTGAGG CGCTTCCTGC GGCACTCGAG GCGCTTGAGG ATCTGAGTGC CCGCCACCGC ACCGCTGCGC TTGATCTGGA TCGGCGCAAG GCCGAGCAGC AGGAGGCGCT CGAGACAATG CGGCAGGCCG CCCGCGACCT TGGTCTCGTC GCAGAGAACG CCGACCTTGC GGCCTTGGTT CTTTCCGGCG CTGATATCAC CGCGCTTCAG GACGCAAGAG AGCAACTTCG CGCGGCCATC ACCCACGAAG GTCTAGAAGC GCGCGAGGTC GAGGATCTCG AGGAGCGCCT GCGCGTAGCG CAGGATGCTG TGCACGATTG TGCGCCGCTC GCTGCAGAGG CCGGGCAGTT GCAAGAGCTG CTCTTGAATT TTGATGCAGA GCGGCTGGCG ACGGCCCATG CAGCGGCAGC GGAGGCGATC ACAGCCGCCC AATCCCGGGC CGCCCGCAGC CTGGCAGCGC TGTCTGTGCG CCACGTGACT TTTGATACCC TGCCGCATTG CCCGAGCAGC TTGCGTCAGG CGCAGGAGCA CGCGGATCGT GACGCAGCGC TGGTTCAGGA CCTGCGCCGG GCGACTGAGA CGCGCAATCA GCATAAGGAG GACGCCAGCG CCCGCGCCGC GCAGGCAAAG GCCCTTGAAC GCGCTGCAAA TGTGGTCTCC GACAGTGAGG CCGAGGCCTT GCGCACCAAA CGCGACGCAC TCTGGGCCGT GCATGTAGAC AGTTTGGATG CCGAGAGCGC CGCCACGTTC CACGAAGCGC TGAGCCGCCA CGACACCGCT GCTGACGCGC GACTGACCCA ATCGCGAGAA CTGGCGCAGC TGCGAGAGAT TGCGCAATCA GAGGCTGCGT CGAAGGCCCG CGCCGCAGAG GCCGCCACCC GGATCACTGC GCTAGAAGAA CAGCGCGATG CCATTGCGGC AGAGGTCAAC GCCGCTGCCA CCCAGCTTGG CCTTGCGCCG CTTTCCCCGG CAGAGTGGCG CGATTGGGTT GAGCGTCATG AAACCGCGCG CGCGGATGCC GAGGCGCTTC GGGACACCAA GGACAGCCAT TCGGCCACCT TTGCGCGGGC GCAGGCGTTG ATGGACGCCC TTGCAGAACG TGTACCGTCT CTTCCGGCTG ATCTTGATCC GGCCCTTGCA CAGGCGCGCC GGATGGCAGA GGCGGCGCGC CAGACCTCCG AGGCGCGCGG CGCCGCAGAA AAGCTCTTGC GGCAGGTGGA GCGCGACCTG AATCGGCGAC AGGATCGCCA CAACGCCGCA CAAGAGGCCA AGAAAAAGGC CGAGGACACA TGGCGGGCCT TGGTGCAGGA ACTTCTGGCA GGGCAGGTCT CGCCAGAGTC GCTTATGGCC TCTTTGGAGC CACTGCGGGT GCTACGCGAG CATGACAAGA CCCGCAGCGC CGCCGCACGT CGGGTCCGGA TGATGGAGGC GGATCAGGCG CTTTTTGCTC AAGAGGTCAA CGCCTTGGCA CAGCTGCACG GGGTCGGGGT CAGTTCGGAC CCGGGCCAGA CCTATGGCGA CCTCAAATCC CTGGCCGAGA CGGCCCGCAT CGCGCGCGAC AAGGCCGCGC GTCTGGAGCG TGCAATTGCA ACGGCCACGG CAGAACGTGC GGACAACACC GCGCGGGTTG AGGCCATAGA TCAGGAGCTC CGGGCGATCG CTGCGAGCTT TCCCGAGCCC CCACCTGCAC AGGACATCGA CACGTTGCGC CAATGCGCCG CGCAGGCCCA GAAGGTCATT GCCGACCGCG CGGCCCGCGA CCGCCTGCGC CGTCAGATCC TCTCCGAGCT CGACCTGTCG GATCTGGAGT CGGCGCGCGC GCAGCTTGCC GAGACCTCGG TTGCCACGCT CAACGCGCGC CTGGAAAGCA CCCTCTCCGA TCTGACCCAC GCCGAAGACG ACCTCACGCA GGCGATCCAA CAGCGGGTCA ACGCAGAACA TGCGCTGGCA GAGATCAACG GCGACCGCAC GGCAGCGGCG CTGGTCGAGC AAAAAGCAAC GCTGGAACTG CAGCTCGAAG AGGCCGCGCT CGAACATCTG GAGCTGTCTT TGGGGCATCA CCTGGCCTCA GACGCAATCA AGCGCTACCG CGACAGCCAC CGCAGCGGCA TGCTGACCGC CACGGAACAG TGTTTTGCCG ATCTGACGCA GGGCGCCTAT CCCGCCCTCA GCACACAGAT CTACGGCGAC AGCGAGGTCC TTTTGGCGGT CGACAGAACC GGTGCCAGCA AACGGGCCGA CGAGATGTCC AAAGGCACCC GGTTCCAGCT CTATCTGGCG CTCCGGGCCG CAGCGCATGA ACAACTGGTG GCGCAGGGCA CCATATTGCC GTTCTTCTGC GATGACATCT TTGAGACCTT TGACGAGACC CGAACCAGTG CGGCCTGTCA GGTGATGGAA GCCATCGGCA CGCGAGGACA GGCGATCTAT CTCACCCATC ATCGCCATGT GGTTGAAATC GCCCAAAGCG TCTGTGCCAC CCCGCCAATC ATTCACGAGC TCTGA
|
Protein sequence | MRLKRLSLDR FGHFTDQQFD FGSAHDGHDF HIIYGRNEAG KTTTMEAVLR LFYGFPTREA YAFRHPRNNL QISATLDFNG ELRQFTRLPT RSGALVDESG TPLPEAALSA HLAGLSEPDY RRLLCLDDET IERGGEEIAN AQGDIGRLLF SAAAGVADLS QVLDGVRSRA DEIWKKRARN TRMRELKRAL EELDKEIKAR DVSANAWKSL KRDLTKAQQA EEDARTRRDD LNTTRARTEA ERRAVPLLAE LQELEQALAP FADFPAQLDF NPERLVELRS DLGSATQNIA RLSDELQTLE EEQAALVLDP ALEALPAALE ALEDLSARHR TAALDLDRRK AEQQEALETM RQAARDLGLV AENADLAALV LSGADITALQ DAREQLRAAI THEGLEAREV EDLEERLRVA QDAVHDCAPL AAEAGQLQEL LLNFDAERLA TAHAAAAEAI TAAQSRAARS LAALSVRHVT FDTLPHCPSS LRQAQEHADR DAALVQDLRR ATETRNQHKE DASARAAQAK ALERAANVVS DSEAEALRTK RDALWAVHVD SLDAESAATF HEALSRHDTA ADARLTQSRE LAQLREIAQS EAASKARAAE AATRITALEE QRDAIAAEVN AAATQLGLAP LSPAEWRDWV ERHETARADA EALRDTKDSH SATFARAQAL MDALAERVPS LPADLDPALA QARRMAEAAR QTSEARGAAE KLLRQVERDL NRRQDRHNAA QEAKKKAEDT WRALVQELLA GQVSPESLMA SLEPLRVLRE HDKTRSAAAR RVRMMEADQA LFAQEVNALA QLHGVGVSSD PGQTYGDLKS LAETARIARD KAARLERAIA TATAERADNT ARVEAIDQEL RAIAASFPEP PPAQDIDTLR QCAAQAQKVI ADRAARDRLR RQILSELDLS DLESARAQLA ETSVATLNAR LESTLSDLTH AEDDLTQAIQ QRVNAEHALA EINGDRTAAA LVEQKATLEL QLEEAALEHL ELSLGHHLAS DAIKRYRDSH RSGMLTATEQ CFADLTQGAY PALSTQIYGD SEVLLAVDRT GASKRADEMS KGTRFQLYLA LRAAAHEQLV AQGTILPFFC DDIFETFDET RTSAACQVME AIGTRGQAIY LTHHRHVVEI AQSVCATPPI IHEL
|
| |