Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hoch_2028 |
Symbol | |
ID | 8544410 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haliangium ochraceum DSM 14365 |
Kingdom | Bacteria |
Replicon accession | NC_013440 |
Strand | - |
Start bp | 2796992 |
End bp | 2799874 |
Gene Length | 2883 bp |
Protein Length | 960 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 646386731 |
Product | TonB family protein |
Protein accession | YP_003266466 |
Protein GI | 262195257 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1629] Outer membrane receptor proteins, mostly Fe transport |
TIGRFAM ID | [TIGR01352] TonB family C-terminal domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.38309 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCTCCGAG CTTCTCATCG CCTGGTTCAG CCAGCGCTGC TGTGCATCGC GGCGCTGTGC CTGAGCGGCA TCGCCCCGCC CTCTGAGGCA CGCGCGCAGG CCGCGGCCGC GCCCGGCGTC GACCAGGTGT CCGACGACAC GGATGCAGGC GCGGGCGCAG GAGCTGGCGT GGCTCCGGCT CCGGCTCCGG CAGCATCCGC AGCCGACGCC GTGCAGCCGC CGGTGCTCGT GCACTTCGTC GAGGCCGAGT ACCCGGCCGC CGCGCGCAGC GAAGGCCGCG CCGGCAGCGT CGATCTGCTG CTCAGCATCG ACGCGCGAGG GCGCGTCACC GCCGCGGAAA TCGCCGCCTC GGCCGGCCCC GACTTCGACA GCGCCGCGCG CGCCGCGGCC TTGGCGTTTC GCTTCGAACC CGCGCGCCGC GGCGGCGTGG CCATGCCCGC GCGCATCCGC TACCGCTACG CCTTCGCGCT GCCCGAGGCC CCGGACGCAG ACGCAGCCTC GGCAGACGCG GACGCCACCG CCGACGCTGC AGTGCCCGCA GCGCCCCCGG CCCGTGCCGC CGCCAGCGGC GCGCTCGTCG GCAGCCTGCG CATCCCGGGC CGCGCGCGCG AGCCCGTCGC CGGCGCCAGC ATCACGCTCA CCGCCGCCGA CGGCAGCACG CGCAGCGCCA CCAGCGACGA GCGCGGCGGC TTCCGCATCG ACGCGCTCGC GCCCGGCGCC TACGATGTCC GCGCCGCGGC GCCCGGCTTC GAACCCGGCC TGCTGCGCAC CGAAATTGTC CCCGGACGCG CGGCCGAAAT CGCGCTCGTA CTCGCGCCCA CGGTCGGCGG CGCCGCCGAC CAGGTCCTCG AGGTCACCGT GCGCGGCAGC CGCGCGGGCG AGCGCTTGCA CAACTCGGCG CAAGCCGTCC GCGTGGTCGA AACCGAGCAA GCCCAGCAGC AGAGCGCCGA CCTCGGCGAG GTGCTGGCGC GCACCGAAGG CGTGGACGTA CGCCGCAGCG GCGGCCTGGG CTCCGACGCG CGGCTGTCGC TCGGCGGACT CAGCGGCGAC CAGATCCGCA CCTTCCTCGA CGGCATCCCG CTCGCGCTCG CCGGCTATCC CCTGGGCATC GCCAACGTGC CCGTCAACCA GGTCGAGCGC ATCGAGATCT ACCGCGGCGT CGTGCCCGTG CGCTTCGGCA CCGACGCCCT CGGCGGCGCC ATCAACCTGG TCAGCGACGG CGCCGAGCGC CGCGCCGATG GCGCCTCGGC CTCGTACCAG ACCGGCGCCT TCGACACCCA CCGCCTCACG CTCGGCGCCC GCCGCCTGCT GCACCCGGCC AGCGGCCTGT TCGCGCGCGT CAACGGCTAC TACGACCGCG CCCGCAACGA CTATCCCATC GACGTCGACG TGCCCGACGC CAGCGGCCGG CTGGTCGCCG GACGCGCGCG CCGTTTTCAC GACGGCTACC GCGCGCTCGG TGCCGGCGCC GAGATCGGCG TCTCCCGCCA GCCCTGGGCC GAGCATCTGT CGCTGCGCGG CTTCGTCGGC GAGCACGCGC GCGACCTCCA GCACAACCTG CTGGCCACCG TGCCCTACGG CGAGGCCACC TACGGCAAGT ACAGCGCCGG CGCGCACCTG CGCTACGCCC AGCCGCTGTC CGAGCGCGCG CGCGTCGACG CCGTCGCCGG ATATTCGTAC ATCCGGACCA CCTTCCGCGA TCTCGCCACC TGCCGCTACG ACTGGTACGG CCGCTGCGTG GCCAACCTGC CCCAGGCCGG CGAGATCGAG GCCCTCGCCA GCGATCAGCG CGTGCGTCAG AACGTCCTCT TTGCCCACCT GGGCCTGACC TGGACGCCGG CCCCCACCCA CGCCCTGCGC GTCGCGCTGG CGCCCAGGCA CACCAGCCAA CGCGGCGAAA ATCGCGCCCT CCCGGCCGAG GACTACGATC CCCTGGCCGC CCGTCGCACC TTGAGCAGCG CGGTTTTCGG CGCTGAGTAC GAAATCGACG CCTGGGCCGA GCGCCTGACC AATATCGCCT TCATCAAGAG CTACCTGCAG CGCGCGCGCA GCGACGAGCG CTTGCCCAAT GGCACCATCC GCGAGCTCGA CCGCCTCACG CCGCGCCTGG GCGTGGGCGA CAGCCTGCGT CTGCAGCTCA CGGACGAGCT GAGCCTCAAA AGCTCGTACG AACTCGCCAC CCGCCTGCCC CGACCCGAAG AACTTTTTGG CAACGGCGGA CTCATCGTCG AAAACCTGCA CCTCGATCCC GAGACCAGTC ACAACCTCAA CCTCGACCTG CAATTCGAAC GCGAAGACCA GGCTCTTGGC CACATGCGCG TGCGCGTGGG CGGCTTCGGT CGCTTCACCC GCGACCTCAT CGTGCTGCTC GGCACCGGCA GCTACCAGCA GTACGAAAAC GTCTACGCCG CGCGCTCGCT CGGCGTCGAG ACCACGGCCG GTTGGCAATC GCCCGGCCAG CGCCTCGCGC TCGAGGGCGC GCTCACCTGG CAGGACTTCC GCAACGCGGC CGACGAGGAC TCGGACGACC CCTTCGCCGG CGACCGCATC CCCAACCATC CCCACCTGTT CGCCAGCGGC AGCGCGCGCG TGCAGTGGCC GGCGCTGCTA CAGCCCGGCG ACGCGCTCGC GCTCACCTGG CACACCCGCT ACGTGCACGC CTTCTTCCGC AGTTGGGAGA GCGCGGGCGC GGCCGACTCC AAGCTCCAGG TCCCCAGCCA GCTCATCCAC ACCCCCGCGC TCAGCTACCG CGTCGAGCGC GAGGGCCGGG CCGCGAGCTT CACCATCGAG CTGCAAAACC TCAGCGACGA GCGAGTCTTC GACTTTTTCG GCGTCCAGCG CCCCGGACGG GCCGCCTACG CAAAGCTGGT ACTCGACATG TGA
|
Protein sequence | MLRASHRLVQ PALLCIAALC LSGIAPPSEA RAQAAAAPGV DQVSDDTDAG AGAGAGVAPA PAPAASAADA VQPPVLVHFV EAEYPAAARS EGRAGSVDLL LSIDARGRVT AAEIAASAGP DFDSAARAAA LAFRFEPARR GGVAMPARIR YRYAFALPEA PDADAASADA DATADAAVPA APPARAAASG ALVGSLRIPG RAREPVAGAS ITLTAADGST RSATSDERGG FRIDALAPGA YDVRAAAPGF EPGLLRTEIV PGRAAEIALV LAPTVGGAAD QVLEVTVRGS RAGERLHNSA QAVRVVETEQ AQQQSADLGE VLARTEGVDV RRSGGLGSDA RLSLGGLSGD QIRTFLDGIP LALAGYPLGI ANVPVNQVER IEIYRGVVPV RFGTDALGGA INLVSDGAER RADGASASYQ TGAFDTHRLT LGARRLLHPA SGLFARVNGY YDRARNDYPI DVDVPDASGR LVAGRARRFH DGYRALGAGA EIGVSRQPWA EHLSLRGFVG EHARDLQHNL LATVPYGEAT YGKYSAGAHL RYAQPLSERA RVDAVAGYSY IRTTFRDLAT CRYDWYGRCV ANLPQAGEIE ALASDQRVRQ NVLFAHLGLT WTPAPTHALR VALAPRHTSQ RGENRALPAE DYDPLAARRT LSSAVFGAEY EIDAWAERLT NIAFIKSYLQ RARSDERLPN GTIRELDRLT PRLGVGDSLR LQLTDELSLK SSYELATRLP RPEELFGNGG LIVENLHLDP ETSHNLNLDL QFEREDQALG HMRVRVGGFG RFTRDLIVLL GTGSYQQYEN VYAARSLGVE TTAGWQSPGQ RLALEGALTW QDFRNAADED SDDPFAGDRI PNHPHLFASG SARVQWPALL QPGDALALTW HTRYVHAFFR SWESAGAADS KLQVPSQLIH TPALSYRVER EGRAASFTIE LQNLSDERVF DFFGVQRPGR AAYAKLVLDM
|
| |