Gene Hoch_5205 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_5205 
Symbol 
ID8547617 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp7158887 
End bp7160500 
Gene Length1614 bp 
Protein Length537 aa 
Translation table11 
GC content71% 
IMG OID646389880 
ProductSubstrate-binding region of ABC-type glycine betaine transport system 
Protein accessionYP_003269584 
Protein GI262198375 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1732] Periplasmic glycine betaine/choline-binding (lipo)protein of an ABC-type transport system (osmoprotectant binding protein) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.057164 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGTCCGA TCTTTGCAAC CCAGCTCGAG CGCCTGCCCG CGCTGCTCGG GGCGCACATC 
GTGCTCACGG TCATCGCCCT GGCGCTGGGC CTGGCCATCT CGCTGCCGGC CGCGTTCCTG
GGGCTGCGCC AGCGCGCCCT GCAGGGGCCG CTGCTGGCGG TGGCGAGCAT CATCCAGACC
ATCCCCAGCC TGGCGATTCT GGCGCTGATG GTGGCCGCGT TCGGGCTTTT CGGACAGCCG
GCCGCGCTCA TCGCGCTCAC CGCCTACAGC GTGCTGCCGA TCCTGCGCAA CACCATCACC
GGCATCGAGG GCGTGGACCC GGCCGCGGTC GAGGCCGCGC GCGGCATCGG CATGACCCGC
AACCAGATCC TGTGGCGGGT GCAGTTGCCG CTGGCCGCGC CCATCATCCT CGCCGGTATC
CGCACCGCCA CGGTGTGGGT GGTCGGCATC GCCACGCTGG CCACGCCCGT GGGCGCCGAC
TCGCTGGGCA GCTACATCTT CGGCGGCCTG CAGACGCGCA ACACCACGGC CGTGCTGTTC
GGCGTGGTAT CGGCGGCGGC GCTGGCCATC GCGCTGGACT CGCTGATCCA CCTGGGCGAG
GTCGCCGCGC GCAAGCGCTC GCGGCCGCTG GCGCTCATCA CCGCCGTGGG CCTGGCCCTC
ATCCTGGCCA TGGGCGTGTG GCCCAAGGGC GGCGACGAGC GCGTGATGGC GGCCGCGCCC
ACGGCCGCGG CGCAGGCCGA GGGCGTCGAG GCCGCGCCGA GGCGCACCGT GATGGTCGGC
GCCAAGACCT TCACCGAGAG CTATATCCTG GCCCGGCTCA TCCGCGCGCG GCTGAGCGAG
GCCGGCTATC CCGCCCAGCT CAAAGAGGGG CTGGGATCGG CCGTGGTCTT CGATGCGCTG
CGCCAGGGCG AGATCGACGT GTACGTCGAT TACTCGGGCA CCATCTGGGC CAACGCCATG
AAGCGCACCG AGACGCTGCC GCCGCAGGAG GTTCTCGACC AGATGTCCGA GTGGCTCGAG
CGCGAGCACG AGATGAAGAG TCTGGGCGCG CTCGGCTTCG AGAACGCCTA CGGCCTGGCC
CTGCGCGAGG ACGCGGCCGC CGAGCTCGGG GTCGATACCA TCTCCGAGCT GGTGCCGCAC
ACGCCCAAGC TGTCGCTGGG CTCGGACTTC GAGTTCTTCG ACCGGCCGGA GTGGACCAAG
CTGCGCGACA CCTACGGGCT GGCGTTCGAC GCCCAGCGCG CGTTCGACCC CACCCTGATG
TATCCGGCGG TCAAAGAGGG CGACGTCGAC GTGATCACGG CCTTCACCAC CGACGGCCGC
ATCGCGGCCT TCAACCTGCG CGTGCTGCCC GATGACAAAC ACGCCTTCCC GCCCTACGAC
GCGGTGCTGC TGCTCAGCCC CGAGGCGTCC AAGGATCCGG ATCTCATCGC CGCGCTCGCG
CCGCTCATCG GCGCCATCGA CAGCGACGCC ATGCGCACGG CCAACAAGCT GGTCGACGTC
GACCGCAAGG ATACGCAGTT CGCGGCCCAG TATCTGCTCG ACCGGATCGC GGCGGGCGAC
GACGCCCAAC CGGCCGCCGA CGACACCGCC GAGTCAGCCG CGGACGGCGA GTAG
 
Protein sequence
MSPIFATQLE RLPALLGAHI VLTVIALALG LAISLPAAFL GLRQRALQGP LLAVASIIQT 
IPSLAILALM VAAFGLFGQP AALIALTAYS VLPILRNTIT GIEGVDPAAV EAARGIGMTR
NQILWRVQLP LAAPIILAGI RTATVWVVGI ATLATPVGAD SLGSYIFGGL QTRNTTAVLF
GVVSAAALAI ALDSLIHLGE VAARKRSRPL ALITAVGLAL ILAMGVWPKG GDERVMAAAP
TAAAQAEGVE AAPRRTVMVG AKTFTESYIL ARLIRARLSE AGYPAQLKEG LGSAVVFDAL
RQGEIDVYVD YSGTIWANAM KRTETLPPQE VLDQMSEWLE REHEMKSLGA LGFENAYGLA
LREDAAAELG VDTISELVPH TPKLSLGSDF EFFDRPEWTK LRDTYGLAFD AQRAFDPTLM
YPAVKEGDVD VITAFTTDGR IAAFNLRVLP DDKHAFPPYD AVLLLSPEAS KDPDLIAALA
PLIGAIDSDA MRTANKLVDV DRKDTQFAAQ YLLDRIAAGD DAQPAADDTA ESAADGE