Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_1449 |
Symbol | |
ID | 5712626 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | - |
Start bp | 1506778 |
End bp | 1509000 |
Gene Length | 2223 bp |
Protein Length | 740 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641267362 |
Product | TonB-dependent receptor |
Protein accession | YP_001532792 |
Protein GI | 159043998 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1629] Outer membrane receptor proteins, mostly Fe transport |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 0.389596 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGTTCA CAAGACCCCT GACCGCCACC GCGGTCGCCG CCCTGATGCC AACCCTCGCC CTCGGCCAAT CAGCCACGCC GGGCACCACC ATCGAACTGG AACCGATCGT CATCGACGGC ACCAGTGGCG CCCTGGCCAC CGCTGAGGAC CGCCAGCGCG CCACCCCGGG CGGGACCGAC CTGCTGCGAG GCGACAGTTA TCGCGACAGC GCGACCGTGA CCCTGTCCGA TGTGCTGGAC GGGGCGCCGG GTGTGGTGGT GCAGGACTTC TTCGGCGGGT TCGATCAACC CCGGGTACAA ATCCGCGGAT CGGGCCTGCA ACAGAACCCG GTCGAACGCG GTGTGCTGTT CCTGCAGGAC GGCCTTCCGC TCAACCAGGC GGATGGCTCC TATATCGTGG GCCTGTCCAA TCCGAGGGCC GCCGAGTTCG TCGAGGTCTA TCGCGGCTAC ACCGCGAACC GGCTCGGGGC GACGGTGCTG GGCGGGGCGC TGAACTTCGT CTCGCCCACC GGCTCCTCCG CCCCGGGCCT GAGCTTTGGC CTGACCGGGG GCAGTTTCGG CTATCTCGAA GGCGAGGCGA TCGCCGGATG GCAGGGCGAC GGGTACGATG CCCATCTGCG CTTCGAGGCC CTGACCCGCG ACGGCTTTCG GGACGACAAC AACGACTCCG AGCGGCAGGC CTTCAACGCA AACCTCGGGA TCGAGCTGAC CGACCAGATT TCCGCCCGGG TCTTCGCGGG CGTCACCGAC CTCTCCTTCG GCGTGCCCGG ACCCATCACC GCCGCGGCAC TGGACCGGGA CCCCGCCTCC GTCCATGCCG GGCCGGTCTT CACGCCGGGG ACGCCGCCGA GCGTCGCCAA TCCGGGGCCG AACGTGCCGC GGGACGACCC GGGGCGCGAA GCGACCCAGG CGCGCATCGG GGCCCGGATC ACCGGCGAGT TCGGCGCGAG CGTTGCGGAT CTCGCCTTCG GCTATGCCCG CACCGAAGAC AGCTTCCGCT TCCCTGTGTC CTCGGGGGTC CGCGAAACGG ACGGGGACGA TTTCACCTTC GTAGCACGCT ATGCCTATCG CCCCGACCCT GAGGCGGCCC TGCCGCTCTT CGAGGCCACC GCCTCCTACA TCACCGGCAC GGCGGACCGC GACTATTTCC TCAACGAGGC GGGCACGCGG GGCGCACGGT TCGGCCGCAA CCGTCTTTCG GCCGACACCC TGACGGTCTC GGCCATCGCC AATATCCCCC TTTCGGACCG GCTGGTCCTG TCCCCGGGCA TCGCGTGGTC CCATGCGACA CGGGACAATG ACGACCGGTT TGGCGCCGCC ACACGGCCCA CGCTGGCCTT CAACCCGGCG ATGCCCGATA TGGCCCTGCC CCCGGGCGTC GTCCCTTTCG AGGACACGAG CTATTCCCGC ACCTACGAGG GCTGGAGCCC CTCGCTCGCC CTCAGCTACC AGGTGAACGA CCGCAACCTC GTGTTCGGCG CGATCAGCCG GTCCTTCGAG CCGCCGACCC ATAACGACCT GCTCGCGACC ATCAACGGCA CCCCGAACAG CAGTGCGGGC CGCCCGCAAC CACCGAACCC GGCCTTTCCC GCCTCCGCCT TCGCCACTCC GGACCTGGAG GCGCAGACCG CGACCACCGT CGAGATCGGC TGGCGCGGCC AGGTCGGCGC GTTCGATGTC GATGCGGTGG TCTATCACGC CGCGATCGAA AACGAGCTGC TCTCGCTCCG CGACGTCACC GGGGCCTCGC TCGGCGCCGT CAATGCCGGA GAGACCACCC ACAAAGGGGT CGAGCTCGGG GTCTCCGCCG TGTTCGGCGA TGTCGCTGCC CGGCTCGCCT ACACCTATCA AGACTTCCGC TTCGACGACG ATCCCGTGCG GGGCAATAAC CGCCTCGCGG GCGCCACGCC CCATGTGATC GACCTCGCGC TTGATTGGGC CGCAACCGAC CGGCTCAACC TCGGGGGACG GCTTTACTGG CGGCCGGTCA AGACCCCGGT GGACAATCTC AATACGCTCC ACGCGGATCC GTTCGCGACG CTGGACCTCA ATATGCGCTA TGCCGTGACC GAAACCACCA CAGCGTTCTT CGAGATCCGC AACGCGACCG ACGAACGCTA TGCGGCCTCA ACACTGATCG TCGACCAGGC GCGCAACGAT CAGGCCGCCT TCATTCCCGG GGACGGCCGA TCCTTCTATA TCGGTCTGCG TTCAACATTC TGA
|
Protein sequence | MPFTRPLTAT AVAALMPTLA LGQSATPGTT IELEPIVIDG TSGALATAED RQRATPGGTD LLRGDSYRDS ATVTLSDVLD GAPGVVVQDF FGGFDQPRVQ IRGSGLQQNP VERGVLFLQD GLPLNQADGS YIVGLSNPRA AEFVEVYRGY TANRLGATVL GGALNFVSPT GSSAPGLSFG LTGGSFGYLE GEAIAGWQGD GYDAHLRFEA LTRDGFRDDN NDSERQAFNA NLGIELTDQI SARVFAGVTD LSFGVPGPIT AAALDRDPAS VHAGPVFTPG TPPSVANPGP NVPRDDPGRE ATQARIGARI TGEFGASVAD LAFGYARTED SFRFPVSSGV RETDGDDFTF VARYAYRPDP EAALPLFEAT ASYITGTADR DYFLNEAGTR GARFGRNRLS ADTLTVSAIA NIPLSDRLVL SPGIAWSHAT RDNDDRFGAA TRPTLAFNPA MPDMALPPGV VPFEDTSYSR TYEGWSPSLA LSYQVNDRNL VFGAISRSFE PPTHNDLLAT INGTPNSSAG RPQPPNPAFP ASAFATPDLE AQTATTVEIG WRGQVGAFDV DAVVYHAAIE NELLSLRDVT GASLGAVNAG ETTHKGVELG VSAVFGDVAA RLAYTYQDFR FDDDPVRGNN RLAGATPHVI DLALDWAATD RLNLGGRLYW RPVKTPVDNL NTLHADPFAT LDLNMRYAVT ETTTAFFEIR NATDERYAAS TLIVDQARND QAAFIPGDGR SFYIGLRSTF
|
| |