Gene Hoch_5464 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_5464 
Symbol 
ID8547877 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp7502160 
End bp7503098 
Gene Length939 bp 
Protein Length312 aa 
Translation table11 
GC content63% 
IMG OID646390137 
ProductSubstrate-binding region of ABC-type glycine betaine transport system 
Protein accessionYP_003269840 
Protein GI262198631 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2113] ABC-type proline/glycine betaine transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.102731 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTGTAC GTCATCTCGA GAAATTGGGA ATCGCCGCGC TCGTGGTCGG ATGCGGGCTC 
AGCGGTTGCA GCAAAGAGGC CGAGAAGCCG GCGACCGAGA CCACCGAAGG CACCGAGACC
GCCGAGACCG CTGAGACCGA GGAAGCTGGC GAGGAGAAAG TCGCCAACCT CGTGTACGTG
AACTGGGCCG AGGGCATCGC CTACACCCAC CTGGCCAAGG TCGTGCTCGA GGACAAGATG
GGCTACGAGG TCAAGCTCAC CGCCGCCGAC GTCGGCCCGG CCTACACCTC GGTGGCCCAG
GGCGACCAGG ACGCCTTCAT GGAGACCTGG CTGCCGACCC TGCACAAGGA CTACATCGAG
AAGTTCGAGG GCAAGCTGGT CGATCTCGGC CACGTGTACG AAGGCACGCA GAGCGGCCTG
GTCGTGCCCG CCTACGTGCC GATCACCAAG ATATCCGAGC TCAAGGATCA CAAGGACAAA
TTCGACGGCA AGATCACGGG TATCGACGCC GGCGCCGGCA TCATGAATAC CACCGAGGAG
GTCATCGCTT CCTACGACCT GGGCTTCACC CTGCTGCCCT CGAGCGGCCC GGCCATGACC
TCGGCGCTCA AGAACGCCAT CGACAAAGAG GAGTGGATCG TGGTCACCGG CTGGCGTCCG
CACTGGAAGT TCGGCCGCTG GGACCTCAAG TTCCTCGAGC AGGATGAGGA CAAGATGGTG
TGGAAGGAGG GCAACATCCA CATCACCGGC CGCGCCGGCA TCAAGGAAGA CAAGCCCACC
CTGGCCGCGT TCCTGAGCAG CATGATGCTC ACCGACGAGC AGCTCGGCGA CCTGATGATC
AAGGTGAACG AGAGCGACGG CAAGGACGTC GAGGACGTCG CCCGCCAGTG GATGGCCGAC
AACGAGGCTG TCATCACGGC GTGGGTGCCG GCCTCCTGA
 
Protein sequence
MFVRHLEKLG IAALVVGCGL SGCSKEAEKP ATETTEGTET AETAETEEAG EEKVANLVYV 
NWAEGIAYTH LAKVVLEDKM GYEVKLTAAD VGPAYTSVAQ GDQDAFMETW LPTLHKDYIE
KFEGKLVDLG HVYEGTQSGL VVPAYVPITK ISELKDHKDK FDGKITGIDA GAGIMNTTEE
VIASYDLGFT LLPSSGPAMT SALKNAIDKE EWIVVTGWRP HWKFGRWDLK FLEQDEDKMV
WKEGNIHITG RAGIKEDKPT LAAFLSSMML TDEQLGDLMI KVNESDGKDV EDVARQWMAD
NEAVITAWVP AS