Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hoch_1870 |
Symbol | |
ID | 8544252 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haliangium ochraceum DSM 14365 |
Kingdom | Bacteria |
Replicon accession | NC_013440 |
Strand | + |
Start bp | 2575081 |
End bp | 2577930 |
Gene Length | 2850 bp |
Protein Length | 949 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 646386576 |
Product | hypothetical protein |
Protein accession | YP_003266311 |
Protein GI | 262195102 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00747265 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.613683 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACCTG TGAACTTGCG ATGGATGACG TGGGGGTTGC TGCTATGGGC GGCGCTCGGC GTCGGGTGTG GCGGCGAGCC GCCCGCGGTC CCCGAGGCGC CGCCGGATGC GGCGCCGCCC GTAGGTGCGG TATCGACCTG CATGGGGTGC CACAACGGCG AAGCCGACTA CGCCGGGGAG GGGTTGAAAA ACCCGCACCC CTTCGGCGGC CCCGGCAGCA ACAACATCGA GTGTACGACA TGCCACGGCG GCGATCCCGC CGGCGCTGGA AAAGTCGGCA GCCACGTGCC GCCGCCGCCG CTCTTCGGCA CGCCGGAGAA CCCGCGGCAG AAACTTTTTA CCGACCCCGA GGCCGAGTTC AACCGCCGCA CCCTGGCCGG CATCGACAAG ATCGAGGACT ACACGATCGG GGAAAAAACC TATACCGCGC TGCAATATCT GCAATTCATC AATCCCGGTG ATCTGCGCGT GGTGTCCAAG AACCAGGGCT GCGGCGCCTG CCACGGTGCC GGCCAGCAGG GCGAGTGGTT CCATCACTCG CCCATCGGCC AGAGCTCGGG CTTCTTCGGC GGCACGCGCT TCGCGATCGG CGTGGACAAC GCGGTCGCCG AAAACCGACT CCTGTACCAG GACACGGCCG CGGACTACGG CTTCCGCGCG GTGATCGATG AGGGCTTTGA ATACGTGGCC GAAGTCATCG GTCCGGTCGG TAAGCTCATC GAGTTTCCCG AAAAAGCCGG TTATGACAAG ACGGGCGGTA TCTTCGAAAA CCCAGCGTAC AATGCCAATG CGCTGGCGGA TTATCAAGTG ACCGTACCTG GGCAGGGTGG CGCTCACGTC AACCAAGTGA TGACCGGTTC GCCGCTCGAA GACCTGATCA TGGAGCAGGT CGCGATCACC TGCGGCGACT GTCACGCGGG CTCGGCTGGC GCCAACAATC GCTTTGCCGA TTTCCGCTCC TCGGGCTGCA CCTCGTGCCA CATGGAGTAC AGCCTCGACG GCCGCAGCCG CAGCGGTGAC CCCAACGTCA ACCGTAATGA ACCCGCCAAC CCCGACGCCA TCGCGGCCGG CGAGCGCGCG CACATCAAGG ACCACCAGAT CCGCAATATC GCTCGCGACG TTCCCGGTCA CGGACCGGTG TCCGGTATCG CCGACACCGC GTGCGTGGGC TGCCACCAGG GCTCGAACCG CACCGTCTTG CAGTTCTGGG GGATTCGCCT CGATCAGAAC AAGGACCTGG TCAATAACTT CCAGTATCCG GCCAATCCCA ACAGTTTCAT CACCGCAGCG AACGACCCGC GTCTGTTCTC GGCGGCTGCG CAGAACAACA CCTTCAACGG CCGCGACGCC GATCAACTCA TCGCCTTCGA AGATTATGAC GGCGACGGCC TCGACGACAC GCCCGCCGAC GTCCATCACG AGGCCGGCAT GGTGTGCATC GACTGCCACG GTGGTCGCGA TCTGCACGGC GGAACTGCGA ATCCGGCCGA AGGCGACACC CGCAGCGGCA AGATCATCAG CCGCATGGAT CAGGGCGTGG GCATCCAGTG CGAGAGCTGT CACGGCACCA TCGATGAGTC GGCCCTGTAC AAACCGTGCA AGACTGACCT CGGCGAGTCA GCCGAGTGCG CGCAGGATCT GTGGGGCAAC CCGCTCGGCA ACGTCACCCG CAACGCGCAG GGCAAATTCT GGCTGCGCAG TCGCGGAACG GGAGATATGC ACTTGGTTCC GCAGATCAGG GATGTTGTCC ACGACAATAA CGTTACCGAC GAAGGTGGCG CGCCGATCTA CAATCCCAAG GCCGCGTTCG CCATGGGACG CGTCGGCGAT GGCGTCGATC AGGGGCCGAT TCAGAATGAT CCGCTGAAGA AACCCACCAA CGGCTTCACG CACACCGACA ACCTCGACTG CGCGTCCTGC CACTCGTCGT GGACCAACAA CTGCGTCGGC TGCCACCTGG CGACCGAGTA CGACGCCAAC CCGCAGAACT TCTTCTTCAG CAACATCACG GGCGAGCGCA TCGTGCTCAA GGAGGCCGCG GCCGACTTCA CGTACATCAC GCCGGTGCCG TTCCAGCTCG CGGTCAACAC CCGCGGCAAG ATCACCCAGA CCCAGCCGAA CACCAAGATG TTCTATCGCT ACACGGATCT CAACGGCGAC GAGTCCGAGG TGTTCACCTT CAACGATCGT CACGGTCTGG GCAACGATCC CGACGCGACC GGAGCGCCGC CGGCCTTGAG TCACAATGCC ATCCTGGCGC ACTCGATCCG CGGCAAGATC TCGGGCAATA ATGAGGGCCC GCGCTACTGC GTGGCCTGCC ATCTAACCGA AGATGGCCTG GCCAACCACA ATGCGAACTT CACGCAGTTC CGCAACGCGA TCGCCAATAA CAACTACGGC AACCTGAACT TCAATGTGCT CGCGCAGCAC ATCGGCCAGA ACACGGGCAA CCAGCTCAAC TCGCCGCTGT GGGTGCACAT GGTGGCCGGT CTGGGCTCGG GCTTGTTCCT GTTCGACGAG AACGGCTGTC CGGTGAATCG CCTCGACGCC AACGCCGACC GCCAGTTCTG CAACAACCAG GCGCCGAAGA ACCGGTTCAA CGCCAACAAC GTTGTCTACG ACCTCGACCG CATGATAGCC AACGCGCAGA ATGGCGCCGA GAACACCTCG AACAAGCACC CGATGCTCGA CGGCGAGGCA TCGGATCTCC GAACTGGCTC GGAGAACCAG AGGATGGCCG GCCCGCTCGG TCGAAACCTG GTCCGGCGAC TCGCGGATCC CGACTACGCC CAGGCTATCA TCCTCGACCG CTACTACGAC GCCGACGGCC AAGAGCAGAA CGTGGACTGA
|
Protein sequence | MKPVNLRWMT WGLLLWAALG VGCGGEPPAV PEAPPDAAPP VGAVSTCMGC HNGEADYAGE GLKNPHPFGG PGSNNIECTT CHGGDPAGAG KVGSHVPPPP LFGTPENPRQ KLFTDPEAEF NRRTLAGIDK IEDYTIGEKT YTALQYLQFI NPGDLRVVSK NQGCGACHGA GQQGEWFHHS PIGQSSGFFG GTRFAIGVDN AVAENRLLYQ DTAADYGFRA VIDEGFEYVA EVIGPVGKLI EFPEKAGYDK TGGIFENPAY NANALADYQV TVPGQGGAHV NQVMTGSPLE DLIMEQVAIT CGDCHAGSAG ANNRFADFRS SGCTSCHMEY SLDGRSRSGD PNVNRNEPAN PDAIAAGERA HIKDHQIRNI ARDVPGHGPV SGIADTACVG CHQGSNRTVL QFWGIRLDQN KDLVNNFQYP ANPNSFITAA NDPRLFSAAA QNNTFNGRDA DQLIAFEDYD GDGLDDTPAD VHHEAGMVCI DCHGGRDLHG GTANPAEGDT RSGKIISRMD QGVGIQCESC HGTIDESALY KPCKTDLGES AECAQDLWGN PLGNVTRNAQ GKFWLRSRGT GDMHLVPQIR DVVHDNNVTD EGGAPIYNPK AAFAMGRVGD GVDQGPIQND PLKKPTNGFT HTDNLDCASC HSSWTNNCVG CHLATEYDAN PQNFFFSNIT GERIVLKEAA ADFTYITPVP FQLAVNTRGK ITQTQPNTKM FYRYTDLNGD ESEVFTFNDR HGLGNDPDAT GAPPALSHNA ILAHSIRGKI SGNNEGPRYC VACHLTEDGL ANHNANFTQF RNAIANNNYG NLNFNVLAQH IGQNTGNQLN SPLWVHMVAG LGSGLFLFDE NGCPVNRLDA NADRQFCNNQ APKNRFNANN VVYDLDRMIA NAQNGAENTS NKHPMLDGEA SDLRTGSENQ RMAGPLGRNL VRRLADPDYA QAIILDRYYD ADGQEQNVD
|
| |