Gene Hoch_5466 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_5466 
Symbol 
ID8547879 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp7504039 
End bp7505322 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content66% 
IMG OID646390139 
Productglycine betaine/L-proline ABC transporter, ATPase subunit 
Protein accessionYP_003269842 
Protein GI262198633 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4175] ABC-type proline/glycine betaine transport system, ATPase component 
TIGRFAM ID[TIGR01186] glycine betaine/L-proline transport ATP binding subunit 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0707326 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGAAA CCGTCATCCA GGTCGAAAAC CTCTTCAAGA TCTTCGGTGC CAACCCGCAC 
CGGGTCTACC CCATGCTCGA GAAGGGCGAC TCGAAAGAGG AGATTCTCGA GAAAACCGGC
TGCGTGGTCG CCATCGACGA CGTGAGCTTC GAGGTCCAGC GGGGCGAATT CTTCGTCATC
ATGGGACTCT CGGGCAGCGG CAAATCGACC ATCATCCGCT GCATCAACCG CCTCATCGAG
CCCACCCGCG GCAAGATTTT GATCGGCGGT CAAGATGTGG TCCAGATGGA CGACAAAGCG
CTCATCGAGA CCCGGCGCAC CAAGATGTCG ATGGTGTTCC AGCACTTCGG CCTGCTGCCG
CACCGCACGG TGCTGGCCAA CGTCGAGTAC GGGCTGGAGA TCTCGGGCAT GGAGGTGGCC
GAGCGCCAGC AGCGCGCGCG CGCGACCATC GCCCAGGTCG GTCTCGAGGG CTACGAGGAC
AGCATGCCCT CGGAGCTGAG CGGCGGTATG CAGCAGCGCG TGGGCCTGGC TCGCGCGCTG
ACCAACGACC CCGACATCCT GCTCATGGAT GAGGCCTTTA GCGCCCTCGA CCCGCTGATC
CGCACGCAGA TGCAGGACGA GCTCATCGAC CTGCAGACGC GCATGCGCAA GACCATCCTG
TTCATCACCC ACGACCTCGA CGAGGCGCTC AAGCTCGGCG ACCGCATCGC CGTGCTCGGC
CCTGGCGGCA AGCTGATGCA GATCGGCACG CCCGAGGACA TCCTCACCGC GCCGGCCAAC
GAGTACGTGC GCACCTTCGT ACAGAACGTC GACCGCACGC GCGCGCTCAC GGCCTCGTCG
ATCATGCACA AGGCGTTGAC CATCGCCGCG CACAAAGACG GCCCTGGCAC CGCCGCGCGC
CGCATGGAGA GCGCCGGCGT GTCCTCGGCC TACGTGCTCG ACAGCGAGCG CCGGCTGCTC
GGCGTGCTCA GCATCGACCG CGCGGTCGAA CTCCAGGCCG GCAAGATCCG CGATGTGAGC
TCGGCGGTCG ACGACGGCGT GTACACCACC ACGCCGCACA CCTCGGTCCG CGAGCTGCTG
GCCACGGCCC TGGTCACCAA GGTGCCCATC GCGGTCCTCG ACGACGATCG ACGTTTGCTC
GGCATCGTCG ATCGCGCCTC GATCCTGGCC GAAATCGCCA GCGAAGACCC CGAGGCCATT
CCCCTGCGGA CGCTGCTCGA CGACGAGTCG TCACCGTCCG CCAAAGACTC CGCCTCCGAC
ATCCCGCAGC GGGCCACGTC CTGA
 
Protein sequence
MSETVIQVEN LFKIFGANPH RVYPMLEKGD SKEEILEKTG CVVAIDDVSF EVQRGEFFVI 
MGLSGSGKST IIRCINRLIE PTRGKILIGG QDVVQMDDKA LIETRRTKMS MVFQHFGLLP
HRTVLANVEY GLEISGMEVA ERQQRARATI AQVGLEGYED SMPSELSGGM QQRVGLARAL
TNDPDILLMD EAFSALDPLI RTQMQDELID LQTRMRKTIL FITHDLDEAL KLGDRIAVLG
PGGKLMQIGT PEDILTAPAN EYVRTFVQNV DRTRALTASS IMHKALTIAA HKDGPGTAAR
RMESAGVSSA YVLDSERRLL GVLSIDRAVE LQAGKIRDVS SAVDDGVYTT TPHTSVRELL
ATALVTKVPI AVLDDDRRLL GIVDRASILA EIASEDPEAI PLRTLLDDES SPSAKDSASD
IPQRATS