Gene Hoch_3713 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_3713 
Symbol 
ID8546103 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp5111670 
End bp5114663 
Gene Length2994 bp 
Protein Length997 aa 
Translation table11 
GC content71% 
IMG OID646388380 
ProductPeriplasmic component of the Tol biopolymer transport system-like protein 
Protein accessionYP_003268106 
Protein GI262196897 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0823] Periplasmic component of the Tol biopolymer transport system 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.419088 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCATGACT TCAACCCGCG GGCCGCGCTG AGTATCGGCG CTGTGGCGCT GTGCTTTGCT 
GCCGGATGCA GCGAGGAGAG CGCTCCCGGC CGCACCTACT ACGATCGCAA CGTCGAGCCC
ATCCTGCTGC AGACCTGCGC ATCCAACGTC TCCGGCTGCC ACGCGGCCAA CGACGACGAC
CCCTTCGCCT TCGCGGCCGG TAACTTCGAC GTCACCAGCT TCGAGAACGT GCAGAAACGC
CGCGACCTGC TCGAGCCCTT CGGCGTCTAT CCCGTGCCGC TGCTGCTGAT CAAGTCCACC
GGCGCCTCGG ACGAGCTGGA GTTCGCCTAC GGCGGCGAGT TCCAGTCGCT GCGGGTGCAG
CACGCGGGCG GCACAGTGCT CGAGGTCGGC TCCGAGGCCT ACCTCACGCT GCTGCGCTGG
ATGGAGAACG GCGCCACCGA GAGCGGCCTG CCGCCGGTCA CGCCGCCCGA GAGCGGCAGC
GGCGGCTGCT CGAACATCAT CGCCCAGGAC TTCGACCCGG CGCCGTACGT CGCCGACGCC
TCCTTCAACC AGTTCGTGGC CGAGGTCCAG CCCGTGCTGG TGAACTCGTG CGCCACCGGC
AACTGCCACG GCGCGCCGCA GTCGGACTTC TACGTCACCT GCGGCGACAG CGAGCAGGCC
CGCGCCTACA ACTTCGCCCA GGTGCAGGCC TTTGTCGACG AGCCGGCCGA AAACTCGCCG
CTCTTGCTGT ACCCGCTGGC CGTGAGCGCG GGCGGCTACT TCCACACCGG CGGCGAGTTC
TTCGGCTCGC GCAACAACGG CGACTACAAA GCGCTGGCGA GCTGGGCCGA GGCCGCTGGC
GCCGTGGACT TCGGCGCCGA CGACGCCGGC AAGGCGTTCT TTGCCGACTA CGTGCAGCCC
ATGCTGCTGC GCCGCGGCTG CCAGTTCGAG GCCTGCCACA GCCCGGCCGC GACCAATGAT
TTCAAGCTGC GCTCCGGCAG CCAGGGCTTC TTCTCGGCCG TGGCCCTGGA GAAGAACTAC
GAGCTGGCGC GCAAAGACTT CATGTCGATG GAGGTGCCGG ACGCGCGCCG CAGCCGCATC
GCGTCCAAGA CCATGCTGCG CTCCTCGGGC GGCATCGCCC ACCGCGGCGG GCCGCTGCTC
GAGGACGCGC GGCTCGACAG CAAGGTCGCC GACATCTCGA GCGCGTGCGC GGCCTTTGCC
CCCGAGGACG CGCCGCCGCT GTGCATCTTG CAGCAGTGGG TCGAGCTCGA GCGCCAGGAC
GCCATCGACG CCGGCGCGAT CCTGCCGCTC GCGGCTGGCG ACACCGTACC GCTGGTGTAC
GTCGAGCGCG AGACCGAGCA CGTGGCCACG CCGCTGGAGT TCGACACCTA TCAGCCGGGC
TCCGACCTGC TCGTGGCCGA TGCCACGCTC GACGAGCGCG GCGCCATCAC CGCGCTGAGC
GAACCGCGCT CGCTGCTCGC CGGCTGTCCC GGCGCTGGCG ATACCGCGAG CGTCGACGTG
CGCGCGCCCG ACCTGCGCCA CGACGGCACC ACCATCGCCT TCGCCATGCG CACCGCGCAG
AGCGACCCGC TGGGCGTGTA CAAGGTCAAC ATCGATGGCG GCGGCTGCCA GCGCCTAACC
CCGGCCGAGG CGCCGGTCGG CGGCATCGCC ATCCACAACT TCGACCCGGC GTGGTCGCCC
GACGGCGCCT CGATCGTGTT CGCCTCGACC CGCGGCGGCG CCAACGCGCC ATCGCTCAGC
CGCCAGCTCT TCCTGCCGCA GTCCGATATC TGGCGCATGC GCGCCGACGG CAGCGCGCCC
GAGCAGGTCA CCTACCTGAC CAACAGCGAG CTGTCGCCGC AGATGATCCG CGAGGGCCGC
ATCATCCTGT CGACCGAGAA GGTGTCGTCG GGCTTCTATC AGGTCGCCGG TCGGCGCATC
AACTGGGACC GCACCGACTA CCATCCGCTG CTGGCGCAGC GCGCGGAGTC GCCCTTCGTC
GATCTCGACG ACCTGGACGA ATTCGCCCCC TCGGTCGGTT ACGCGCAGGC GACCGACATC
CGCGAGGCGC TCAACGGCAA CTTCCTGTTC ATCCTCTCGG ACGCGGGCGC GCGCGGCGGC
GCTGGCACCC TGGCGGTGTT CAATCGCAGC GTCGGCACCT TCGAGGCCGG CCGTGAGCAG
GCCGGCTACC TCGAGTCGAT GAGCATCCCC GACACCGCCG CCACCGGCCG CGCCGGCAGC
GCCACCCAGG GCGCCTACCG CACGCCGTAT CCGCTGCTCG ACGGCCGCGT GCTGGTGTCC
TACGCCAGCT TCAGCGGCGA CCTGGCGACC GCGAACGCGC TCGACTGGGA CCTCGTGGCC
GTCGACCCGC GCACCGGCGC CCGCGAGGTG CTGCTCGACA GCGATAAGGC CCTGGTCGAC
GCGGTGCTCG CCGTGCCCTA CGAGCCGCGC GAGCTGTACT TCAACCGCCG CCAGCTCGTC
TTCGGCGGCG GCGTGGACAC CCAGGCCACC GGCGGCGAGG GCTTCAGCAT CATTCACTTC
CCGGACGCGC CGGTGGTGTT CACCCTGCTC AACGCCAACC TGCGCCGCGG CCGCCCCGTG
GACACCTTCC GCGAAGCCTC GCACCTGGCC GTGTACCGCG AGGCCCCAGC GCCCGCCGGC
ACCACCAGCG GCAGCGGCGA GGGCGGCATC TTCGAGCAGC GCGAGCTGCT CGGCCGCGCG
GCCCTGGCCG CCGACGGCTC GGTGCGCATC CGCGTCCCGG CCGGGGTCGG CGTCATCCTC
GAGCTGCAGA CCGAGGACGG CGGCGCGGTC GAGACCATGC GCGAGGAGCA CCAGGTCGGC
CCCGGCGAGG TGGTCAGCAT CGGCGTCCCC GGCGACCTCT TCGATGGCGT GTGCGGCGGC
TGCCATGGCT CCATCTCGGG CCAGGAGCTC GACGCCACGC TCTCGCCCGA CGTCCTCACC
GGCGCATCCG AATCGATCGC GGCCGACAAC GCCCCGGTCG ATTTGACCCG CTGA
 
Protein sequence
MHDFNPRAAL SIGAVALCFA AGCSEESAPG RTYYDRNVEP ILLQTCASNV SGCHAANDDD 
PFAFAAGNFD VTSFENVQKR RDLLEPFGVY PVPLLLIKST GASDELEFAY GGEFQSLRVQ
HAGGTVLEVG SEAYLTLLRW MENGATESGL PPVTPPESGS GGCSNIIAQD FDPAPYVADA
SFNQFVAEVQ PVLVNSCATG NCHGAPQSDF YVTCGDSEQA RAYNFAQVQA FVDEPAENSP
LLLYPLAVSA GGYFHTGGEF FGSRNNGDYK ALASWAEAAG AVDFGADDAG KAFFADYVQP
MLLRRGCQFE ACHSPAATND FKLRSGSQGF FSAVALEKNY ELARKDFMSM EVPDARRSRI
ASKTMLRSSG GIAHRGGPLL EDARLDSKVA DISSACAAFA PEDAPPLCIL QQWVELERQD
AIDAGAILPL AAGDTVPLVY VERETEHVAT PLEFDTYQPG SDLLVADATL DERGAITALS
EPRSLLAGCP GAGDTASVDV RAPDLRHDGT TIAFAMRTAQ SDPLGVYKVN IDGGGCQRLT
PAEAPVGGIA IHNFDPAWSP DGASIVFAST RGGANAPSLS RQLFLPQSDI WRMRADGSAP
EQVTYLTNSE LSPQMIREGR IILSTEKVSS GFYQVAGRRI NWDRTDYHPL LAQRAESPFV
DLDDLDEFAP SVGYAQATDI REALNGNFLF ILSDAGARGG AGTLAVFNRS VGTFEAGREQ
AGYLESMSIP DTAATGRAGS ATQGAYRTPY PLLDGRVLVS YASFSGDLAT ANALDWDLVA
VDPRTGAREV LLDSDKALVD AVLAVPYEPR ELYFNRRQLV FGGGVDTQAT GGEGFSIIHF
PDAPVVFTLL NANLRRGRPV DTFREASHLA VYREAPAPAG TTSGSGEGGI FEQRELLGRA
ALAADGSVRI RVPAGVGVIL ELQTEDGGAV ETMREEHQVG PGEVVSIGVP GDLFDGVCGG
CHGSISGQEL DATLSPDVLT GASESIAADN APVDLTR