Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hoch_3974 |
Symbol | |
ID | 8546370 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haliangium ochraceum DSM 14365 |
Kingdom | Bacteria |
Replicon accession | NC_013440 |
Strand | - |
Start bp | 5476122 |
End bp | 5478980 |
Gene Length | 2859 bp |
Protein Length | 952 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 646388646 |
Product | protein of unknown function DUF214 |
Protein accession | YP_003268366 |
Protein GI | 262197157 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG4591] ABC-type transport system, involved in lipoprotein release, permease component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.146468 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGCGCG CGGGATTCAA ATGGTTCGTG GCCTGGCGCT ATCTGATGGC GCGGCCGCGG CGCCTGAGCC CGGCGCTGTT TCTCGTGGCC GGCCTGTCGC TGGTCACGAG CACCGGCGCG AGCCTGCTGG CCGAGGTGTT TCGCCCGCCG GGCACGCGCT CATTCATCTC GCTCGGCGCG GCCGAGCTGT TCGCTGGTTG CGGGCTGATT CTCGGTCTGT GGGGCGCGAT CGCGCTGGTC GCGGGTCTGT TTCGCTTCTG CCGCTCGCAG CTCGAGGGCA GCTCGGTGCC GGCGCTGGCG CTGGCCTCGT TCATGACCCT GGGCGTCGGC GTGTTGGCGC GCTCGGCCGA GGACGCGGCC ACCAGCCGGG TCGGGCTGGG GCTGATGGCG CTGTCGGCGC CGCTGCTGCT GGCCTCGTTG GTGCTCTACC TGCGCGCTGC CGGGCGCCGC GCCTGGGGCG TCTTGGCCTC GTCGGCGCTG CTGCTCGCCG CGGGCGGGCT CACCAGCCTG TGCGCGCACG CGCTGCTCGC GATCGACGCG GTGCCGCCGG CCGAAGCCGC GCTGCTCATC GCGCTGGCGC TGTGGGCGGT CGCCGCAGCC GTGGCCGGCG TCGCGGCGGT GCGCCAGCGC GCCGTGCAGC GGCCCAGCCG CGTGGGCCTG GTCGCCGCGC TGCTGCTGCT GATCGGCGCC GGCTACACCT GGTCGGCGAG CGGCGCGAGC GAGCCCCTCG AGCCGGCGCT GGCGCCGCCC GCGGACACGC TCTTCGAGCT GGTCGTCGAC ATCGGCGCGA TCGCGCCGAT CGTGGCGGCC GCGGGTGGGG CGCTGGCGCT GGTGTGCTTG CTGGTGTGGT TCCGGCGTCG CTCCGAGCAC GCCACCGCCC CCGCGCCGCG TCTGCTCGCG CCGCTGTCGG TGGGGCTGGG GCTGCTCGGC GGCTGCACCT GGCTGGCCAG CCTGGTGTTC ACGAGCGGAT TCGAGCCCTT CCTGGTGCTG TCCGCGCGGC AGTTCTCCGA GCAGCAGGTG CTGCTGGCGG CCACCGTGCT CCTGGCGCTC GGCCAGCTCA TCCTGCTGCT GGCGGCCATG CGCTACTTCT TCACCTTCTT CACCACGGTG AGCGTCGCCG GCGTGACCAT CGGCTCGATG GCCCTGGTCA TCGTGCTCAG CGTGATGAGC GGCTTCGAGA TCGATCTGCG CAACAAGATC CTCGGCTCCA ACGCGCACAT TCTCATCACC AAGGAGGGCG ACGAGCCCTT CACCGAGTAT CGTGAGCTGG TCGAGCGCGT GCTCGCGGTG CCCGGCGTGG TCGCCCAGAT GCCGTACCTC ACCAGCGAGG TGGTCATCGC GGCCAACAGC AACTACGCCA ACGTCATCAT CAAGGGCGTC GATCCCGAGA CCGTCGGCAC GGTCACCGAG CTGGGCAAGA ACACGCGCCA GCCCGACGCC ATCGCGCGGC TGTATCCGCT CGCCGAGGAT GGCTCGGTTA TCGGCCGCCC GGCCGAGAAC AGCGACGGTG GCGGCGAGAC GCCCGACGCG GGCGCCGGCG CCGAGACCGC GGGGCAGGGC AGTGAGTTCG ATCCGCCGCC CGACGACATG GAGCTCGACT GGGACGTGCC CACGGATTTT TCCGGTGGCG GCGACGGCGA CGGCGGTAGT GACGGCGCGG ACAGCGACGA GCTCCCCAGC GGCGCTGACG AAGCATCCGC GATGCTCGAC CGCCCGCCGG CCGACATGGA GCTCGACTGG GACGAGCCGA TGGACTTTTC CGGCTCGCCG TCCGAAGACG AGCCCGGCGG CGTCGCAGGC GAAGCGGCGG ACGACCCGCT GGCGTTCGAC GATGAGGTGA CCGCGGAGCC AGGGATGGCG CCGGGGGAAA TCGACACCGC GGACGATCTC CCCTTTGGCC GCGAGCGCGC GCTCGAGCTG GGCGATTCCT TCGCCGAGGA GCTGGGGCGC GAGATCGTCG CCCGGGCCAC GAACGAGCAG GAGCGCGAGG CGCTGGACGA CGAGCTCGAC GTCGACGAGG CCATCGCGCC GGCCAAGAAG CGCGTGCGCA TCTCGCCGCG GGTGGCGCGG CTGCCCGGCG TCATCGTCGG CAAGGAGCTG GTCAAGAACC TGCATCTCTA CGCCGGTCAG GAGGTGCGCA TCATCTCGCC GCTGGCCGAG GATACGCCCG CGGGGCCGGT TCCGCGGACT CGCTATCTGC GGGTCGCCGG CACCTTCTTC ACGGGCATGT ACGAGTACGA CTTCAAGTAC GTGTACGTGC CGCTCGACAC GCTGCAGCTC TTCCTCGACA TGGCCGAGCA GGTCGAGGGC ATCGAGATCC GGGTCGAGGA GCCGGCCGAG ACCGATCTCG TGGTCCGCGA GCTGCGCGCG GCGCTGCCCG AGACCTTCCG CGTCCAGGAC TGGAAGGAGA TCAACCGCAA CCTGTTCTCG GCGCTCAAGC TGGAGAAGAT CGCCATGTTC CTGGTGCTGG CGATCATCAT CCTGGTGGCC TCGTTCTCGA TCATCAGCAA CCTGATCATG GTCGTGGTCG AGAAGGCCAA GGAGATCGCG CTGCTCAAGA CCCTGGGCGC GGCCGACCTC AGCGTGGTCG GGATCTTCAT CGCGCAGGGC TTCTTCATCG GCTTCATCGG CACCATCGCG GGCGTGGGCC ACGGCCTGCT GGCCTGCTAT CTCGGCAACG TCTACGGGCT GCCGCTCGAT CCCGAGGTCT ATTACATCGA TCGCCTGCCC ATCCACGTGG AGTTCATCGC GGTGACCGCG GTCACCATCG CCGGCATCGT CATCAGCGTG CTGGCCACGC TGTACCCCGC GATGATGGCC GCGCGCTTGC GACCCATGGA GGGGCTGCGT TACGACTGA
|
Protein sequence | MKRAGFKWFV AWRYLMARPR RLSPALFLVA GLSLVTSTGA SLLAEVFRPP GTRSFISLGA AELFAGCGLI LGLWGAIALV AGLFRFCRSQ LEGSSVPALA LASFMTLGVG VLARSAEDAA TSRVGLGLMA LSAPLLLASL VLYLRAAGRR AWGVLASSAL LLAAGGLTSL CAHALLAIDA VPPAEAALLI ALALWAVAAA VAGVAAVRQR AVQRPSRVGL VAALLLLIGA GYTWSASGAS EPLEPALAPP ADTLFELVVD IGAIAPIVAA AGGALALVCL LVWFRRRSEH ATAPAPRLLA PLSVGLGLLG GCTWLASLVF TSGFEPFLVL SARQFSEQQV LLAATVLLAL GQLILLLAAM RYFFTFFTTV SVAGVTIGSM ALVIVLSVMS GFEIDLRNKI LGSNAHILIT KEGDEPFTEY RELVERVLAV PGVVAQMPYL TSEVVIAANS NYANVIIKGV DPETVGTVTE LGKNTRQPDA IARLYPLAED GSVIGRPAEN SDGGGETPDA GAGAETAGQG SEFDPPPDDM ELDWDVPTDF SGGGDGDGGS DGADSDELPS GADEASAMLD RPPADMELDW DEPMDFSGSP SEDEPGGVAG EAADDPLAFD DEVTAEPGMA PGEIDTADDL PFGRERALEL GDSFAEELGR EIVARATNEQ EREALDDELD VDEAIAPAKK RVRISPRVAR LPGVIVGKEL VKNLHLYAGQ EVRIISPLAE DTPAGPVPRT RYLRVAGTFF TGMYEYDFKY VYVPLDTLQL FLDMAEQVEG IEIRVEEPAE TDLVVRELRA ALPETFRVQD WKEINRNLFS ALKLEKIAMF LVLAIIILVA SFSIISNLIM VVVEKAKEIA LLKTLGAADL SVVGIFIAQG FFIGFIGTIA GVGHGLLACY LGNVYGLPLD PEVYYIDRLP IHVEFIAVTA VTIAGIVISV LATLYPAMMA ARLRPMEGLR YD
|
| |