Gene Hoch_3723 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_3723 
Symbol 
ID8546113 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp5124950 
End bp5127550 
Gene Length2601 bp 
Protein Length866 aa 
Translation table11 
GC content73% 
IMG OID646388390 
ProductGeneral secretory system II protein E domain protein 
Protein accessionYP_003268116 
Protein GI262196907 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.162413 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCGGAAG ACGCCGGAGC CCTCCTCGTA CGATCCGGAC TTATCGCCAG TGAACATCTT 
CGCTACGCCC GTACGGCTCA GGCCGAACGG GGCGGAACCG TCGGAGAGCA CCTGGTGTTG
GCCGGATACG TCGAAGACGG CGCACTGACC GACTTCTATC GCGAGCGCCT CATGGTGCCG
CAGGTGAGCC CCGAGGAGCT GGCGCAGATT CCCGAGTGGT TGGTCGCCAA GGTCACGGCG
GAGATGGCGG CGCAGTTCCG CTGCATTCCG GTGGCGTGCG CGCCCGACCG GCATCTCACC
GTGGCGCTGG CGGATCCCAG CGCGACCCAC GCGGTCGACG AGATCTCGTT CCACACCGGC
CACTACGTGG TGCGCGCCGT AGCCACGCAG AACCAGATCG CCTGGTGCCT GTTGCACTAC
TACGGGATGA TGACGCCGCT GGGCGAGCAT CTGCTGAGCA GCGGCCAGAC GCCGGTGCCG
GTGCCGCAGA CCTCGGCCGC GCTGCTCGCG GTCGGCCGGG GCCTGGTGGC CGGCGGGCGC
TCAGCGGCGC GCGGCGCCAT CCCGCGGGCC ACCACCGAGC GCGGCCGCAG CGGCGCGAGC
GCATCGCAGA GCGGGCGCGA TCGCTTCGCG TTACCGCCTC CCGGCGGCGT CGGCGTCGGC
GTCGGCGTCG GTGACGATGA GCCCGACAGC TCGGACTCGC CGGTGCCCGT GATCATCGAG
AACGATCGCC CCTCGCCGCG ACCGCCCTAC ATCCCGCCCG AGACCCGCGA GCCGATGATC
TCGTTTCTCC ACGAGGAGAC CGCGCCGACC GGCCCGATGC GCACGCTCGA GCGCGCCTCG
CGCGCCGACT CGCCGCCCGA GCTGCGCGAC CGCGCCGGTG AGTTCGAGGT GCCGAGCGGG
CCGGTGCCGC GCGTCGGCAA TCGCCTCGAC GAGTCCCTGC CGGCGGTGGT CATCTCGCCT
CTGCTGCACG ACGGCGACGA GATCGGCACC GATCCTGAGC CGCTTGCCGA GTTCGCCAGC
AGCCCGACGA CCCGGGCCGA CATCGAGGGC GCCGAGCTCA CGCCGCTCAG CCACCGCCTC
GACGCCCTGT TTCCGCACCC GCGCGTGTCC ACCACCGCGC CGACCGCGCC CCGCGGCGTG
CCCACCTCGT TCGACAGCGG CCCCACGGAG CCGCGGCGAC TCGACGAGCA AGCGGTCCCC
GAGGCCGCGC CCGAGGTGGT GACCACGCGC GCCAGCAGCG GTGAGCTCGG CAGCGGGGAT
GGTCAGGGCG ACGACGACGA GGCCAGCACC GACGAGCTGG TGCTGCTCGA GCGGCCCAAG
GCCAATCGCC GCCATCGGCG CACGCGCATC GGTCTGGGCA TCGCGCCCTC CACCCTCAGC
GTGCTCACCG GTGGGCGCCG ATTTGGCCAG GGCAGCGAGG CCGGAGCCGA GGCGACCGAA
GGCGACGCTA TCGCCGGGCT CGAAACCAAC GCCGAGACCG AGGCTGCGAG CGAATCGGCG
GGCGCGTCCG CTGCGGCCGA GACCGTGGCC GAGGAGCCGC GCAGCGGTCC TGCGTCTGCG
TCTACGGGCT CGGCGGTGCC GGTTGCCGAG GAGTGGGCCG GGCTCACGCC CAGCGGCGGC
ACCATCCAGG ACGCGGCTCG GGCGGCGGCC GAGGCGTTCG GCAGCGGTGC CACGCGGCAG
CGTCAGGACG AGTCGTCGGC TGCGCAAATC GCGCCCGGCC ACCGCGGCTC CGAGCGCAAC
GCCGTGCCCG AGCTGCCGCA GGACAACCCG CGCTCCAGCG GCGCCGTTGC TGCCGATGCC
CGCGACGACG TGCGCTGGGG ACGGCCGGGG AGCACGATTC CGCCGCAGTA TCTGGGACCG
CAGCCCGATC ACGACACCGA CGACTCGGGC CCCTCGCCCA TCCCGCTGCT GTCCGAGCAT
CTCGAGCCCA CCTTCGACGA TGACGACGTC GACGAGGTGC TCAACGCCGG CTTCGAGGAG
CCGAGCGGTC CGGTGGAGCA GCCCGAGCAG CTCTCGGCGT CGCTGGCGCA GCGTGTGAGC
CCGGCGGGCG GCGGTGGGGC CCAGGCGCCG AGCGCGCGCG ACGGCGAGCC GATGACCGCC
GAGACGCTGC GCGCGCTCGA GGATTCCTCG CTGCGCCTGG TCGAGATCCT GCGCGAGCTC
GATCAGGCCC ACGAGCGCAA CACGGTCATC GACACGCTGG TCAATCATCT GGCGGAGACC
CACGAGCGCG TGGCGTTCTT TGTCGTACGC GCGGGCGAAC TGGTGACCTG GAAGCAGCGT
CTGTCCGCGG GCGGCGTCGA GCGCCGCGAT GGCATCAAGC TGAGTCTGGA CGAGCCCTCG
ACCTTTCAGG ACATCGTCGG CACCCGGCTG CCATTCCGCG GGCCCTTGAC CGACTCGGTG
TCGCGCGCGT TCATCGCCGC GGCCATGGGC TACTCCTCCG GGCAGATGCT CGCCCTGCCC
GTGGCCGTGC GCGGACGCGT GGTCGGAATC CTCTACGGCG ACACCGAGAC CGGGCACGTG
TTCGAGCAGC ACCTGGCGGT GGTGACGCGC GCGGCCGGCG TCGCCCTCGA GCGCATCCTG
CGCCTGCAAA AAGGCACCTG A
 
Protein sequence
MPEDAGALLV RSGLIASEHL RYARTAQAER GGTVGEHLVL AGYVEDGALT DFYRERLMVP 
QVSPEELAQI PEWLVAKVTA EMAAQFRCIP VACAPDRHLT VALADPSATH AVDEISFHTG
HYVVRAVATQ NQIAWCLLHY YGMMTPLGEH LLSSGQTPVP VPQTSAALLA VGRGLVAGGR
SAARGAIPRA TTERGRSGAS ASQSGRDRFA LPPPGGVGVG VGVGDDEPDS SDSPVPVIIE
NDRPSPRPPY IPPETREPMI SFLHEETAPT GPMRTLERAS RADSPPELRD RAGEFEVPSG
PVPRVGNRLD ESLPAVVISP LLHDGDEIGT DPEPLAEFAS SPTTRADIEG AELTPLSHRL
DALFPHPRVS TTAPTAPRGV PTSFDSGPTE PRRLDEQAVP EAAPEVVTTR ASSGELGSGD
GQGDDDEAST DELVLLERPK ANRRHRRTRI GLGIAPSTLS VLTGGRRFGQ GSEAGAEATE
GDAIAGLETN AETEAASESA GASAAAETVA EEPRSGPASA STGSAVPVAE EWAGLTPSGG
TIQDAARAAA EAFGSGATRQ RQDESSAAQI APGHRGSERN AVPELPQDNP RSSGAVAADA
RDDVRWGRPG STIPPQYLGP QPDHDTDDSG PSPIPLLSEH LEPTFDDDDV DEVLNAGFEE
PSGPVEQPEQ LSASLAQRVS PAGGGGAQAP SARDGEPMTA ETLRALEDSS LRLVEILREL
DQAHERNTVI DTLVNHLAET HERVAFFVVR AGELVTWKQR LSAGGVERRD GIKLSLDEPS
TFQDIVGTRL PFRGPLTDSV SRAFIAAAMG YSSGQMLALP VAVRGRVVGI LYGDTETGHV
FEQHLAVVTR AAGVALERIL RLQKGT