Gene Hoch_3974 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_3974 
Symbol 
ID8546370 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp5476122 
End bp5478980 
Gene Length2859 bp 
Protein Length952 aa 
Translation table11 
GC content70% 
IMG OID646388646 
Productprotein of unknown function DUF214 
Protein accessionYP_003268366 
Protein GI262197157 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG4591] ABC-type transport system, involved in lipoprotein release, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.146468 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCGCG CGGGATTCAA ATGGTTCGTG GCCTGGCGCT ATCTGATGGC GCGGCCGCGG 
CGCCTGAGCC CGGCGCTGTT TCTCGTGGCC GGCCTGTCGC TGGTCACGAG CACCGGCGCG
AGCCTGCTGG CCGAGGTGTT TCGCCCGCCG GGCACGCGCT CATTCATCTC GCTCGGCGCG
GCCGAGCTGT TCGCTGGTTG CGGGCTGATT CTCGGTCTGT GGGGCGCGAT CGCGCTGGTC
GCGGGTCTGT TTCGCTTCTG CCGCTCGCAG CTCGAGGGCA GCTCGGTGCC GGCGCTGGCG
CTGGCCTCGT TCATGACCCT GGGCGTCGGC GTGTTGGCGC GCTCGGCCGA GGACGCGGCC
ACCAGCCGGG TCGGGCTGGG GCTGATGGCG CTGTCGGCGC CGCTGCTGCT GGCCTCGTTG
GTGCTCTACC TGCGCGCTGC CGGGCGCCGC GCCTGGGGCG TCTTGGCCTC GTCGGCGCTG
CTGCTCGCCG CGGGCGGGCT CACCAGCCTG TGCGCGCACG CGCTGCTCGC GATCGACGCG
GTGCCGCCGG CCGAAGCCGC GCTGCTCATC GCGCTGGCGC TGTGGGCGGT CGCCGCAGCC
GTGGCCGGCG TCGCGGCGGT GCGCCAGCGC GCCGTGCAGC GGCCCAGCCG CGTGGGCCTG
GTCGCCGCGC TGCTGCTGCT GATCGGCGCC GGCTACACCT GGTCGGCGAG CGGCGCGAGC
GAGCCCCTCG AGCCGGCGCT GGCGCCGCCC GCGGACACGC TCTTCGAGCT GGTCGTCGAC
ATCGGCGCGA TCGCGCCGAT CGTGGCGGCC GCGGGTGGGG CGCTGGCGCT GGTGTGCTTG
CTGGTGTGGT TCCGGCGTCG CTCCGAGCAC GCCACCGCCC CCGCGCCGCG TCTGCTCGCG
CCGCTGTCGG TGGGGCTGGG GCTGCTCGGC GGCTGCACCT GGCTGGCCAG CCTGGTGTTC
ACGAGCGGAT TCGAGCCCTT CCTGGTGCTG TCCGCGCGGC AGTTCTCCGA GCAGCAGGTG
CTGCTGGCGG CCACCGTGCT CCTGGCGCTC GGCCAGCTCA TCCTGCTGCT GGCGGCCATG
CGCTACTTCT TCACCTTCTT CACCACGGTG AGCGTCGCCG GCGTGACCAT CGGCTCGATG
GCCCTGGTCA TCGTGCTCAG CGTGATGAGC GGCTTCGAGA TCGATCTGCG CAACAAGATC
CTCGGCTCCA ACGCGCACAT TCTCATCACC AAGGAGGGCG ACGAGCCCTT CACCGAGTAT
CGTGAGCTGG TCGAGCGCGT GCTCGCGGTG CCCGGCGTGG TCGCCCAGAT GCCGTACCTC
ACCAGCGAGG TGGTCATCGC GGCCAACAGC AACTACGCCA ACGTCATCAT CAAGGGCGTC
GATCCCGAGA CCGTCGGCAC GGTCACCGAG CTGGGCAAGA ACACGCGCCA GCCCGACGCC
ATCGCGCGGC TGTATCCGCT CGCCGAGGAT GGCTCGGTTA TCGGCCGCCC GGCCGAGAAC
AGCGACGGTG GCGGCGAGAC GCCCGACGCG GGCGCCGGCG CCGAGACCGC GGGGCAGGGC
AGTGAGTTCG ATCCGCCGCC CGACGACATG GAGCTCGACT GGGACGTGCC CACGGATTTT
TCCGGTGGCG GCGACGGCGA CGGCGGTAGT GACGGCGCGG ACAGCGACGA GCTCCCCAGC
GGCGCTGACG AAGCATCCGC GATGCTCGAC CGCCCGCCGG CCGACATGGA GCTCGACTGG
GACGAGCCGA TGGACTTTTC CGGCTCGCCG TCCGAAGACG AGCCCGGCGG CGTCGCAGGC
GAAGCGGCGG ACGACCCGCT GGCGTTCGAC GATGAGGTGA CCGCGGAGCC AGGGATGGCG
CCGGGGGAAA TCGACACCGC GGACGATCTC CCCTTTGGCC GCGAGCGCGC GCTCGAGCTG
GGCGATTCCT TCGCCGAGGA GCTGGGGCGC GAGATCGTCG CCCGGGCCAC GAACGAGCAG
GAGCGCGAGG CGCTGGACGA CGAGCTCGAC GTCGACGAGG CCATCGCGCC GGCCAAGAAG
CGCGTGCGCA TCTCGCCGCG GGTGGCGCGG CTGCCCGGCG TCATCGTCGG CAAGGAGCTG
GTCAAGAACC TGCATCTCTA CGCCGGTCAG GAGGTGCGCA TCATCTCGCC GCTGGCCGAG
GATACGCCCG CGGGGCCGGT TCCGCGGACT CGCTATCTGC GGGTCGCCGG CACCTTCTTC
ACGGGCATGT ACGAGTACGA CTTCAAGTAC GTGTACGTGC CGCTCGACAC GCTGCAGCTC
TTCCTCGACA TGGCCGAGCA GGTCGAGGGC ATCGAGATCC GGGTCGAGGA GCCGGCCGAG
ACCGATCTCG TGGTCCGCGA GCTGCGCGCG GCGCTGCCCG AGACCTTCCG CGTCCAGGAC
TGGAAGGAGA TCAACCGCAA CCTGTTCTCG GCGCTCAAGC TGGAGAAGAT CGCCATGTTC
CTGGTGCTGG CGATCATCAT CCTGGTGGCC TCGTTCTCGA TCATCAGCAA CCTGATCATG
GTCGTGGTCG AGAAGGCCAA GGAGATCGCG CTGCTCAAGA CCCTGGGCGC GGCCGACCTC
AGCGTGGTCG GGATCTTCAT CGCGCAGGGC TTCTTCATCG GCTTCATCGG CACCATCGCG
GGCGTGGGCC ACGGCCTGCT GGCCTGCTAT CTCGGCAACG TCTACGGGCT GCCGCTCGAT
CCCGAGGTCT ATTACATCGA TCGCCTGCCC ATCCACGTGG AGTTCATCGC GGTGACCGCG
GTCACCATCG CCGGCATCGT CATCAGCGTG CTGGCCACGC TGTACCCCGC GATGATGGCC
GCGCGCTTGC GACCCATGGA GGGGCTGCGT TACGACTGA
 
Protein sequence
MKRAGFKWFV AWRYLMARPR RLSPALFLVA GLSLVTSTGA SLLAEVFRPP GTRSFISLGA 
AELFAGCGLI LGLWGAIALV AGLFRFCRSQ LEGSSVPALA LASFMTLGVG VLARSAEDAA
TSRVGLGLMA LSAPLLLASL VLYLRAAGRR AWGVLASSAL LLAAGGLTSL CAHALLAIDA
VPPAEAALLI ALALWAVAAA VAGVAAVRQR AVQRPSRVGL VAALLLLIGA GYTWSASGAS
EPLEPALAPP ADTLFELVVD IGAIAPIVAA AGGALALVCL LVWFRRRSEH ATAPAPRLLA
PLSVGLGLLG GCTWLASLVF TSGFEPFLVL SARQFSEQQV LLAATVLLAL GQLILLLAAM
RYFFTFFTTV SVAGVTIGSM ALVIVLSVMS GFEIDLRNKI LGSNAHILIT KEGDEPFTEY
RELVERVLAV PGVVAQMPYL TSEVVIAANS NYANVIIKGV DPETVGTVTE LGKNTRQPDA
IARLYPLAED GSVIGRPAEN SDGGGETPDA GAGAETAGQG SEFDPPPDDM ELDWDVPTDF
SGGGDGDGGS DGADSDELPS GADEASAMLD RPPADMELDW DEPMDFSGSP SEDEPGGVAG
EAADDPLAFD DEVTAEPGMA PGEIDTADDL PFGRERALEL GDSFAEELGR EIVARATNEQ
EREALDDELD VDEAIAPAKK RVRISPRVAR LPGVIVGKEL VKNLHLYAGQ EVRIISPLAE
DTPAGPVPRT RYLRVAGTFF TGMYEYDFKY VYVPLDTLQL FLDMAEQVEG IEIRVEEPAE
TDLVVRELRA ALPETFRVQD WKEINRNLFS ALKLEKIAMF LVLAIIILVA SFSIISNLIM
VVVEKAKEIA LLKTLGAADL SVVGIFIAQG FFIGFIGTIA GVGHGLLACY LGNVYGLPLD
PEVYYIDRLP IHVEFIAVTA VTIAGIVISV LATLYPAMMA ARLRPMEGLR YD