Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hoch_3167 |
Symbol | |
ID | 8545555 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haliangium ochraceum DSM 14365 |
Kingdom | Bacteria |
Replicon accession | NC_013440 |
Strand | + |
Start bp | 4359197 |
End bp | 4363432 |
Gene Length | 4236 bp |
Protein Length | 1411 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 646387834 |
Product | A-macroglobulin complement component |
Protein accession | YP_003267562 |
Protein GI | 262196353 |
COG category | [R] General function prediction only |
COG ID | [COG2373] Large extracellular alpha-helical protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.541025 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAACGCA CGAGCAAACG CTCGCTTCCC CTTCGTTCCC TGGCCCTGTG TCTGTCGCTG GGCGCGCTGT CCTGCGGCGC CCGCTCTGCC GCCGTCGAAT CCGGCGGCGA GGGCGGCGCC GGGGATCCCG TACAGATCGT CGAGCCCGGC ACCGTGCCGG CGCCCTCGGC GCTGCAGAAT CAGCTCGCGG CCCAGGACCC GGCGCGGCCG CAGATGCCGT CCGACGACCA GAAGCGTCTG GGCGCGGCCA TCGACACGTA CATGCAGCGG CACGCCTCGC AGCGCCTGTA CATGCACGTG GACAAGCCGC TGTACCGGCC CGGCGAGACC ATCTGGTTTC GCGTCTGGGA GCTGGCCGCG CCGACGCTCA CCAAGCAGGA GCAGAACCAC GGCGTCTCGG TCGACCTGGT GAGCCCGCGC GGCTCGCAGG TGCTGTCCAA GCGCGTGCTG GCGCAGGCCG GCGTGGCCGC GTTCGACTTC GAGCTGCCGG CCTCGGTCGA GGGCGGCGTG TATATCCTGC GCGCGCGCAG CGATCTCGGC GCCACGCTCG AGCGCGAGGT GGTGGTCTCG CAGTACCAGC CGCCGCGCAT CAAGAAGAAG GTCGAGTTTC TGCGCAAGGC CTACGGCGCC GGCGACGAGG TCGCGGCCGC GGTGTCGCTG GCGCGGGCCA CGGGCGAGCC GCTGGTGACC GACAAGGCCA CGGCCATCGT CACGGTGGAC GAGATCGAGG TGGCGCGCTT CCCGGTGCAG ACCAGCGAGG ACGGCGAGGC CGTGGCCCGC TTTACGCTGC CGCCGCAGAT CGCGCGCGGC GACGGTCTGC TCACCCTGCT GGTGGACGAC GGCGGCGCCG TGGAGTCGAT GCAGAAGCGC ATCCCCATCC TGGTCAAGTC GCTGGACTTG CAGCTCTTCC CCGAGGGCGG TCAGCTCGTG AGCGGGCTGC CGGGCCGGGT CTACTTCCAG GCCAAGAACC CGCTGGGCAA GCCGGCCGAT ATCGCCGGCC GCGTGCTCGA TGACCGCGGC CAGGTGGTGG CCCAGTTCTC GTCGCTGCAC AACGGCATGG GCCGCTTCGA GCTCACGCCG GCCAAGGGCC GCATCTATCA CGTGGAGGTG ACCAGCCCGC GCGGCATCGA GGCGCCCTTC ACGGTGCCCC CGGCGCGGCC CGATGGCTGC TCGCTGATGG CGGTGGACGA CCCCGAGGGC CAGCGCGACG AGGTGCGCGT GGCGGCCTGG TGCTCGTCGC CGCGCACGGC CGTGGTCACC GGCATCCTGC GCGAGAAGCG CCTGGCCGAT GTCGCGGTCG AGGTCGGCGC CGAGGCGCCC ACGGTGGTCG CGATCCCGGT GCCGCCGGGC GCCCAGGGCG CGATGCGGGT GACGCTCTTT GACGAGCACC TCAAGCCGGT GGCCGAGCGG CTGGTGTACA CCGGGCGCGG CCGCGATATG CAGGTGTCGA TCTCGACCGA CCGTCCCAGT TACGCGCCCC GCGACCGGGT CGCGCTCACG GTCACCACCC GCGATCTGCG CGGCAAGCCG GTGGCCGCGG ATGTGTCGCT GGCCGTGGTC GACGACACGG TGCTGAGCTT CGCCGACGAC AAGAGCGCGG CGATCCTGGC GCGCGTGTAT CTCGAGGCCG AGATGCCGGG CCAGGAGATC GAGGAGCCGC GCTTCTACTT CTCCGAGGAT CCCAAGGCCG GGCCCGCGCT CGACCTGGTG CTCGGCACCC AGGGCTGGCG CCGCTTCGTG TGGCAGGAGC TGTTCGCCGA GGCCCGCGGC GGCGGTTCTC GGAGCAGTGT GGTCGGCGGC GCGCTGGCCG TGCCCGAGGC GGATATGGCC ATGGACGAGG AGATGGAGGC GCTCGACGAC GCGATGCCGA TGCCGCCGCC GCCGCCCGCG CCGGTGGCCG CCCAGGCCGC GCCCGGCGCC AACGAGGCCA TGGCCGCCAA GCCCGAGGCG CCGGCCGAGC CCGCGCCCGA GCTCGAGCGC GCCGAGGCGC CGAGGCGGCT GGCGGGCGGC TTCGGCCGCG GGCTGCGCGC CCGCCGGGCG CGGCCCATGG AGAAGAAGCG GATGATGGCG GCCGACGAGG ACGAGTGGGG CGGCGAGGTC CGCGGCTGGG CCGTGGTGCG CGAGTTCCCG GCGCCCAACT ACGAGCCCGG GTATTCGGGA CCGCGGGTCG ATTTCCGCGA GACCATCTAC TGGCAGCCCT CGGTGCAGAC CGGGGCCGAC GGTACGGCCG AGGTGTCGTT CTCGCTCTCG GACGCGGTGA CCTCGTTCCG GGCCACGGCC GAGGGCGTCT CGGGCGGCGG TCTGCCCGGC CGCGGCGAGG CCCTGGTGCA GTCCAAGCTG CCGGTGTCGC TGGCCGTGAC CATGCCGCTC GAGGTCTCGG CCGGCGACTC GCTCGAGCTG CCGGTGGTGC TCACCAACGA GACCGAGCGG CCGCAGAGCG CGCGCATCAC CAGCGAGTTC GGCGCCGCCT TCCGCGTGCG CGGCGGGGTG CCCAAGCAGG TGCGGCTCGA GCCCGGCGAG CGCCAGTCCT TCTTCGCCCA GCTCGAGGTG GTCGGCAACG GCAAGGATCC CGAGGCCGGC AGCGCGCGCA TCGCCATGGA TACCGCCAAC CTGTCCGATG AAGTGGCGCG CACCATCCGC GTGGTGCCCC TGGGCTTCCC GCAGGAGCTG GCGGCCAGCG GCACGCTCGA CCAGCGGGCC ACGCACAGCT TCGAGCTGGC CGGCGCCATG CCCGGCAGCA TCGAGGCGAC GATCACGATG TATCCCTCGC CGCTCGCGAC CATGGTCCAG GGCACCGAGG CGCTGATCCG CGAGCCCGGC GGCTGCTTCG AGCAGGCCTC GAGCAGCAAC TATCCCAACG TGATGGTGCT GTCGTATCTG GAGAAGAACG ACGCCGCCGA CGTCGCCCTG GTGGAGCGCA CGATGGGCGC GCTCGACCGC GGCTACGCGC TGCTCACCGG CTATGAGAGC AAGTCCAAGG GCTACGAGTG GTTCGGCGGC GACCCCGGCC ACGAGGCGCT CACGGCCTAC GGTCTGCTGG AGTTTGTGGA TATGGCCCAG GTGTACGGCG ACGTCGATCC GCAGATGGTG CAGCGCACGC GGCGCTGGCT GATGAGCCGG CGCGACGGCG AGGGCGGCTT TCTGCGCAAC GATCGCGCGC TGGATTCCTT CGGCCGGGCG AGCGTCGAGG TGACCAATGG CTACATCACC TACGCGCTCA CGGCCGCGGG CGAGAAGGCG CTCGACAAGG AGATCGCGTA CCAGCAGCGC ATGGCCAAGG AGACCAAGGA TCCGTACCTG ATGGCGCTGG CCGCCGGCAC CCTGGTGCAC GTCAAGGCCG CCGAGGCTCA GAGCGCGGTC ACCCGGCTGC GCAGCATGCA GGCCGAGGAC GGCTCGTTCG CGGGCGCCGA TCACTCGATC ACGCGCTCGG GCGGCGAGGC CCTGATCATC GAGACCACGG CGCTGGCGGC CAAGGCCATG ATCGACCTGG GCCTGGCCGG CGACGCCGAC ACGCGCGCGG CCATCGAGTG GCTCAACGCG CACCGCGGCG GCTACGGGCA GTTTTCGTCC ACGCAGGCGA CCATCCTGGC GCTGCGCGCG CTCTCGGCCT ACGCCGAGGC CAGCCGGGCG ACGCAGAGCA GCGGCGTGGC CACGCTGCTG GTCAACGGCA AGCAGGCGGG CACGCTGCGC TTCGAGGCCG GCCACAAGGA CGCGCTGGTG TGGGAGGACG TGGCGCGGCT GCTGCACTCG GGCAAGAACA CGCTCGAGCT GCGCCTCGAC TCGGAGCAGT CGCTGCCGTA CAGCATCGGC ATCTCGTATC GCTCGAAGAT GCCGGCCTCG AACCCCGAGA CCGTGGTCCG CGTGGCGACC TCGCTGAGCA AGGACGAGGT GCCGGTGGGC GAGGGCGTGC GCATGAAGGT CACGGTCGAT AACACCACCG ACGAGGGCCA GCCCATGACC CTTGCGCGCG TCGGTATCCC GGGCGGCCTG GCGTTCCAGA CCTGGCAGCT CGAGGAGCTC AAGGACAAGG GCGTCATCGG CTTCTTCGAG ACCCGCGAGC GCGAGGTGGT GCTGTACTTC CGCGACCTGG CGCCCAAGGC GCACAAGGAG ATCGATATTG ATCTTCTCGC AAGGGTTCCG GGCAGCTACG TGGCGCCGGC GTCGCGCGCG TACCTGTACT ACACCGACGA GTTCAAGCAC TGGGTGCCGC CCACCGAGGT GCGCGTCACC CGCTGA
|
Protein sequence | MQRTSKRSLP LRSLALCLSL GALSCGARSA AVESGGEGGA GDPVQIVEPG TVPAPSALQN QLAAQDPARP QMPSDDQKRL GAAIDTYMQR HASQRLYMHV DKPLYRPGET IWFRVWELAA PTLTKQEQNH GVSVDLVSPR GSQVLSKRVL AQAGVAAFDF ELPASVEGGV YILRARSDLG ATLEREVVVS QYQPPRIKKK VEFLRKAYGA GDEVAAAVSL ARATGEPLVT DKATAIVTVD EIEVARFPVQ TSEDGEAVAR FTLPPQIARG DGLLTLLVDD GGAVESMQKR IPILVKSLDL QLFPEGGQLV SGLPGRVYFQ AKNPLGKPAD IAGRVLDDRG QVVAQFSSLH NGMGRFELTP AKGRIYHVEV TSPRGIEAPF TVPPARPDGC SLMAVDDPEG QRDEVRVAAW CSSPRTAVVT GILREKRLAD VAVEVGAEAP TVVAIPVPPG AQGAMRVTLF DEHLKPVAER LVYTGRGRDM QVSISTDRPS YAPRDRVALT VTTRDLRGKP VAADVSLAVV DDTVLSFADD KSAAILARVY LEAEMPGQEI EEPRFYFSED PKAGPALDLV LGTQGWRRFV WQELFAEARG GGSRSSVVGG ALAVPEADMA MDEEMEALDD AMPMPPPPPA PVAAQAAPGA NEAMAAKPEA PAEPAPELER AEAPRRLAGG FGRGLRARRA RPMEKKRMMA ADEDEWGGEV RGWAVVREFP APNYEPGYSG PRVDFRETIY WQPSVQTGAD GTAEVSFSLS DAVTSFRATA EGVSGGGLPG RGEALVQSKL PVSLAVTMPL EVSAGDSLEL PVVLTNETER PQSARITSEF GAAFRVRGGV PKQVRLEPGE RQSFFAQLEV VGNGKDPEAG SARIAMDTAN LSDEVARTIR VVPLGFPQEL AASGTLDQRA THSFELAGAM PGSIEATITM YPSPLATMVQ GTEALIREPG GCFEQASSSN YPNVMVLSYL EKNDAADVAL VERTMGALDR GYALLTGYES KSKGYEWFGG DPGHEALTAY GLLEFVDMAQ VYGDVDPQMV QRTRRWLMSR RDGEGGFLRN DRALDSFGRA SVEVTNGYIT YALTAAGEKA LDKEIAYQQR MAKETKDPYL MALAAGTLVH VKAAEAQSAV TRLRSMQAED GSFAGADHSI TRSGGEALII ETTALAAKAM IDLGLAGDAD TRAAIEWLNA HRGGYGQFSS TQATILALRA LSAYAEASRA TQSSGVATLL VNGKQAGTLR FEAGHKDALV WEDVARLLHS GKNTLELRLD SEQSLPYSIG ISYRSKMPAS NPETVVRVAT SLSKDEVPVG EGVRMKVTVD NTTDEGQPMT LARVGIPGGL AFQTWQLEEL KDKGVIGFFE TREREVVLYF RDLAPKAHKE IDIDLLARVP GSYVAPASRA YLYYTDEFKH WVPPTEVRVT R
|
| |