Gene Hoch_2028 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_2028 
Symbol 
ID8544410 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp2796992 
End bp2799874 
Gene Length2883 bp 
Protein Length960 aa 
Translation table11 
GC content73% 
IMG OID646386731 
ProductTonB family protein 
Protein accessionYP_003266466 
Protein GI262195257 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID[TIGR01352] TonB family C-terminal domain 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.38309 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTCCGAG CTTCTCATCG CCTGGTTCAG CCAGCGCTGC TGTGCATCGC GGCGCTGTGC 
CTGAGCGGCA TCGCCCCGCC CTCTGAGGCA CGCGCGCAGG CCGCGGCCGC GCCCGGCGTC
GACCAGGTGT CCGACGACAC GGATGCAGGC GCGGGCGCAG GAGCTGGCGT GGCTCCGGCT
CCGGCTCCGG CAGCATCCGC AGCCGACGCC GTGCAGCCGC CGGTGCTCGT GCACTTCGTC
GAGGCCGAGT ACCCGGCCGC CGCGCGCAGC GAAGGCCGCG CCGGCAGCGT CGATCTGCTG
CTCAGCATCG ACGCGCGAGG GCGCGTCACC GCCGCGGAAA TCGCCGCCTC GGCCGGCCCC
GACTTCGACA GCGCCGCGCG CGCCGCGGCC TTGGCGTTTC GCTTCGAACC CGCGCGCCGC
GGCGGCGTGG CCATGCCCGC GCGCATCCGC TACCGCTACG CCTTCGCGCT GCCCGAGGCC
CCGGACGCAG ACGCAGCCTC GGCAGACGCG GACGCCACCG CCGACGCTGC AGTGCCCGCA
GCGCCCCCGG CCCGTGCCGC CGCCAGCGGC GCGCTCGTCG GCAGCCTGCG CATCCCGGGC
CGCGCGCGCG AGCCCGTCGC CGGCGCCAGC ATCACGCTCA CCGCCGCCGA CGGCAGCACG
CGCAGCGCCA CCAGCGACGA GCGCGGCGGC TTCCGCATCG ACGCGCTCGC GCCCGGCGCC
TACGATGTCC GCGCCGCGGC GCCCGGCTTC GAACCCGGCC TGCTGCGCAC CGAAATTGTC
CCCGGACGCG CGGCCGAAAT CGCGCTCGTA CTCGCGCCCA CGGTCGGCGG CGCCGCCGAC
CAGGTCCTCG AGGTCACCGT GCGCGGCAGC CGCGCGGGCG AGCGCTTGCA CAACTCGGCG
CAAGCCGTCC GCGTGGTCGA AACCGAGCAA GCCCAGCAGC AGAGCGCCGA CCTCGGCGAG
GTGCTGGCGC GCACCGAAGG CGTGGACGTA CGCCGCAGCG GCGGCCTGGG CTCCGACGCG
CGGCTGTCGC TCGGCGGACT CAGCGGCGAC CAGATCCGCA CCTTCCTCGA CGGCATCCCG
CTCGCGCTCG CCGGCTATCC CCTGGGCATC GCCAACGTGC CCGTCAACCA GGTCGAGCGC
ATCGAGATCT ACCGCGGCGT CGTGCCCGTG CGCTTCGGCA CCGACGCCCT CGGCGGCGCC
ATCAACCTGG TCAGCGACGG CGCCGAGCGC CGCGCCGATG GCGCCTCGGC CTCGTACCAG
ACCGGCGCCT TCGACACCCA CCGCCTCACG CTCGGCGCCC GCCGCCTGCT GCACCCGGCC
AGCGGCCTGT TCGCGCGCGT CAACGGCTAC TACGACCGCG CCCGCAACGA CTATCCCATC
GACGTCGACG TGCCCGACGC CAGCGGCCGG CTGGTCGCCG GACGCGCGCG CCGTTTTCAC
GACGGCTACC GCGCGCTCGG TGCCGGCGCC GAGATCGGCG TCTCCCGCCA GCCCTGGGCC
GAGCATCTGT CGCTGCGCGG CTTCGTCGGC GAGCACGCGC GCGACCTCCA GCACAACCTG
CTGGCCACCG TGCCCTACGG CGAGGCCACC TACGGCAAGT ACAGCGCCGG CGCGCACCTG
CGCTACGCCC AGCCGCTGTC CGAGCGCGCG CGCGTCGACG CCGTCGCCGG ATATTCGTAC
ATCCGGACCA CCTTCCGCGA TCTCGCCACC TGCCGCTACG ACTGGTACGG CCGCTGCGTG
GCCAACCTGC CCCAGGCCGG CGAGATCGAG GCCCTCGCCA GCGATCAGCG CGTGCGTCAG
AACGTCCTCT TTGCCCACCT GGGCCTGACC TGGACGCCGG CCCCCACCCA CGCCCTGCGC
GTCGCGCTGG CGCCCAGGCA CACCAGCCAA CGCGGCGAAA ATCGCGCCCT CCCGGCCGAG
GACTACGATC CCCTGGCCGC CCGTCGCACC TTGAGCAGCG CGGTTTTCGG CGCTGAGTAC
GAAATCGACG CCTGGGCCGA GCGCCTGACC AATATCGCCT TCATCAAGAG CTACCTGCAG
CGCGCGCGCA GCGACGAGCG CTTGCCCAAT GGCACCATCC GCGAGCTCGA CCGCCTCACG
CCGCGCCTGG GCGTGGGCGA CAGCCTGCGT CTGCAGCTCA CGGACGAGCT GAGCCTCAAA
AGCTCGTACG AACTCGCCAC CCGCCTGCCC CGACCCGAAG AACTTTTTGG CAACGGCGGA
CTCATCGTCG AAAACCTGCA CCTCGATCCC GAGACCAGTC ACAACCTCAA CCTCGACCTG
CAATTCGAAC GCGAAGACCA GGCTCTTGGC CACATGCGCG TGCGCGTGGG CGGCTTCGGT
CGCTTCACCC GCGACCTCAT CGTGCTGCTC GGCACCGGCA GCTACCAGCA GTACGAAAAC
GTCTACGCCG CGCGCTCGCT CGGCGTCGAG ACCACGGCCG GTTGGCAATC GCCCGGCCAG
CGCCTCGCGC TCGAGGGCGC GCTCACCTGG CAGGACTTCC GCAACGCGGC CGACGAGGAC
TCGGACGACC CCTTCGCCGG CGACCGCATC CCCAACCATC CCCACCTGTT CGCCAGCGGC
AGCGCGCGCG TGCAGTGGCC GGCGCTGCTA CAGCCCGGCG ACGCGCTCGC GCTCACCTGG
CACACCCGCT ACGTGCACGC CTTCTTCCGC AGTTGGGAGA GCGCGGGCGC GGCCGACTCC
AAGCTCCAGG TCCCCAGCCA GCTCATCCAC ACCCCCGCGC TCAGCTACCG CGTCGAGCGC
GAGGGCCGGG CCGCGAGCTT CACCATCGAG CTGCAAAACC TCAGCGACGA GCGAGTCTTC
GACTTTTTCG GCGTCCAGCG CCCCGGACGG GCCGCCTACG CAAAGCTGGT ACTCGACATG
TGA
 
Protein sequence
MLRASHRLVQ PALLCIAALC LSGIAPPSEA RAQAAAAPGV DQVSDDTDAG AGAGAGVAPA 
PAPAASAADA VQPPVLVHFV EAEYPAAARS EGRAGSVDLL LSIDARGRVT AAEIAASAGP
DFDSAARAAA LAFRFEPARR GGVAMPARIR YRYAFALPEA PDADAASADA DATADAAVPA
APPARAAASG ALVGSLRIPG RAREPVAGAS ITLTAADGST RSATSDERGG FRIDALAPGA
YDVRAAAPGF EPGLLRTEIV PGRAAEIALV LAPTVGGAAD QVLEVTVRGS RAGERLHNSA
QAVRVVETEQ AQQQSADLGE VLARTEGVDV RRSGGLGSDA RLSLGGLSGD QIRTFLDGIP
LALAGYPLGI ANVPVNQVER IEIYRGVVPV RFGTDALGGA INLVSDGAER RADGASASYQ
TGAFDTHRLT LGARRLLHPA SGLFARVNGY YDRARNDYPI DVDVPDASGR LVAGRARRFH
DGYRALGAGA EIGVSRQPWA EHLSLRGFVG EHARDLQHNL LATVPYGEAT YGKYSAGAHL
RYAQPLSERA RVDAVAGYSY IRTTFRDLAT CRYDWYGRCV ANLPQAGEIE ALASDQRVRQ
NVLFAHLGLT WTPAPTHALR VALAPRHTSQ RGENRALPAE DYDPLAARRT LSSAVFGAEY
EIDAWAERLT NIAFIKSYLQ RARSDERLPN GTIRELDRLT PRLGVGDSLR LQLTDELSLK
SSYELATRLP RPEELFGNGG LIVENLHLDP ETSHNLNLDL QFEREDQALG HMRVRVGGFG
RFTRDLIVLL GTGSYQQYEN VYAARSLGVE TTAGWQSPGQ RLALEGALTW QDFRNAADED
SDDPFAGDRI PNHPHLFASG SARVQWPALL QPGDALALTW HTRYVHAFFR SWESAGAADS
KLQVPSQLIH TPALSYRVER EGRAASFTIE LQNLSDERVF DFFGVQRPGR AAYAKLVLDM