Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hoch_3191 |
Symbol | |
ID | 8545579 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haliangium ochraceum DSM 14365 |
Kingdom | Bacteria |
Replicon accession | NC_013440 |
Strand | - |
Start bp | 4397191 |
End bp | 4399902 |
Gene Length | 2712 bp |
Protein Length | 903 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 646387858 |
Product | pentapeptide repeat protein |
Protein accession | YP_003267586 |
Protein GI | 262196377 |
COG category | [S] Function unknown |
COG ID | [COG1357] Uncharacterized low-complexity proteins |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.0671183 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCCTTCA AGCAGAAGGA CCTCGCGAAG TTCAAGAAGG AACTCAACAG CGGGTCTGGT CTTGCCGGCC TGTGGGCAAA GACGAACGAT CCGGACGATG TCGCCGTGCT CCACCTAGCG ATGGTCACGT GCGCGTTTGG CTCAGCCCTA CACCGCTACT GGCGAACGCC GGCGATGGTG CCAGAGAAGA GCCTCGAGGC CCTCGCGAAA CACGCAGGCA TTCGACTCGC CGATGACGGC TCAGCAGACA AAGATGCCGT GGCGCGCTTG GCAAGGCTCT GCGGCAATCC GCTTAACACA CCGTATTACT GCGCCCTGTG GGAGGCGCTC AACGACCCGG CCACCGATCT CGACGCACCG ACCGAACCGA GGGAGTTCGA GAGCCACTTC CGCCGCGCGT ACTCGTTGGC GACCGCGACC GACCAGGGAC AAATCGTCCA ACGCTGGCTG CTGAGCCTGG CCGATGAGAG CGTCGAGGTG ATGCGGCGGA TTCTGGCCGA AGACCTCGCC GCCTGGGGCG AAAGGCACGT ATTCGGCAAC GTCCGCAAAC AATCCAAGGG CGACCCCATG CCCTTCCTGG GATTGGATGC ATCGTACGTC GAGCCGTACG GAGAGTACGA AGACGAGTTT AAACCCATCC TCAAACTCAT CGGCGAGCTG CTCGAGAAGC AGAAAATCGT GGTGGTCTCG GCGGATTTCG GACACGGCAA GTCGCTCACC GCCCGAAGAT TGGCCCGCGA CACAGCGCGC GCATGGCTGG AGAGCGACAC TCCCAGCCCA CAGAACCGCT ACCCCGTGTT CATCAAGTGC GCCCGCGATA TCCGCGACGC CTCTTACAAG CACGACGAGG TAGCCCGGCG CGCACTGTGG GAAGCGGCCA CGGAGGCTCT CGGTGAGGAG TCTTCGAGTG AGGAGCCGCA GTTTCAACCA CCAGACAATC AGCATGCCGC ATTGTTCATC CTCGACGGCC TGGACGAAGT CGCATTCTCG CCCAACCAAC TCGAAGACCT ATTCCGCAGC CTGCGCGAAA AACTAGGCAA GCAGCAACGG GCTATCATCT TTACGCGCCC GAGCACTTTC GACGACCGCC ACGGCCGCCC CGCAGAGAAC ATCCCTCTCA TATCTCTACT CCCATTCGAC GAGTTGCAAA TCGAAGAATG GCTGACGCGT TGGAACAACA ACCCACAGTC CGAATCCGTC AACATCGAGG AGTTGCGCGA GCATCAACCA ATTGCCGAGC TAGCCCACAC GCCCATCCTC TTGCTAATGA TTGCGATGAC CTGGCAGGAA AGCCTAGCCA AAGGAGAAAT GCGCCGCGGA GTTCTATACG AGCAGTTCTT CCGCCAAATC GCACGCGGTA AATACGAGAG CGACAGCGAC AAACACCCGG TCATCCGTAA AGCGTCCGAA CTCATCGCCG ACCGCCTCAC CGAACTCAAA TATCTCGATG CAAAGCAGTT CAGCGGCGAA GAAAGAAGTA TAGAGGCCAT GTTGTGGTTG ATGTCGCGCA TAGCCTGGGA GGCGCACGCC TATGCGTACG AACCAGAGTT TGCCGAAGAT GACCTCTCTG ACGGCGATAT CGAGCAATTG CTCAAAGAAG AGTTAAATAT TCGACGTGGC AAGTCGCCTC TTCTACAAAT GGGGTTACTG CTTGCGTTAC AGTTCGATCC ATCAGGATCA CAAACCACTG TGCTATTTGA GCATCAGTCT TTTCGAGAAT TCCTTGTTGC TCGATACTGG CAGTCGCAAC TTTTATTCCT CACCGATCCA GAGAGAGACA GCGTAGAGAT CGAGCGAGCA GAAAAGATAC TCGCCCAAAC GAGTCTTCTA CAAGACGACG ATCGCGCGTT CGATGCGCTC ATCGAAGGGT TGCAACATCT TGAAGAGCCC AAACGCACGC AGATCAAAGA CTGGGCGGCT CAATGTCTAA AAACAAGGTC CCTGACCGTC TCGCATATTC CTGACAAGGA GGACCGAGCA CCGATCCTTC GAGAAAGTGC TTTGGCGATT GGCAGCACAA TCTTCGAAGA TAGCGGATTA TCCATAGGCA TCCACGAGAT CCGATATTCA GCATTCTGGC ACGACTTTCA CAATCGCATC CTACGCATCA TTGCTCCTAA CGTAAAAAGT CCCAAAAGCA ACCTCGCCAG AGTAGACCTC GCCAGAGCCT ATCTCGCAGG CGCCGATCTC GCAGGCGCCG ATCTCGCAGG CGCCGATCTA TCCCTTGCCC ACCTTGAGCG GGCCAGCCTT GAGCGGGCCA ATTTCCGCTC TGCCAAACTC CTATATTCCA ATCTCCGATA TGCCGACCTC CGGCATGCCG GCTTCGAACA AGCCAACCTC GTACAAGCCA ACCTCATACA AGCCAACTTC GGATACGCCC GGTTCCTAGG CGCAGATCTC CGCGGCGCCC AGCTCCTAGG CGCCAACCTA CAAGACGCAA AACTTCAAAA TGCCAACTTA CAAGGCGCCA ACCTACAAGG CGCCAACCTA CAAGGCGCAA AACTTCAAAA TGCCAACCTA CAAGGCGCCG ACCTACAAGG CGCCGACCTC CGAGCCGCTA ACTTGTCCGC AGCGAACTTC CTGGGAGCGC AGTATTCGAC GGAGACCAAA TGGCCAGACG GTGTCGATCC GGAAGCGCTT GGGTGTATTT TCGTCGATTC GTCCGAGGCT GAATCGGATG CTCTCGATGA GTACGGCGAC GAGGAAGCGT GA
|
Protein sequence | MAFKQKDLAK FKKELNSGSG LAGLWAKTND PDDVAVLHLA MVTCAFGSAL HRYWRTPAMV PEKSLEALAK HAGIRLADDG SADKDAVARL ARLCGNPLNT PYYCALWEAL NDPATDLDAP TEPREFESHF RRAYSLATAT DQGQIVQRWL LSLADESVEV MRRILAEDLA AWGERHVFGN VRKQSKGDPM PFLGLDASYV EPYGEYEDEF KPILKLIGEL LEKQKIVVVS ADFGHGKSLT ARRLARDTAR AWLESDTPSP QNRYPVFIKC ARDIRDASYK HDEVARRALW EAATEALGEE SSSEEPQFQP PDNQHAALFI LDGLDEVAFS PNQLEDLFRS LREKLGKQQR AIIFTRPSTF DDRHGRPAEN IPLISLLPFD ELQIEEWLTR WNNNPQSESV NIEELREHQP IAELAHTPIL LLMIAMTWQE SLAKGEMRRG VLYEQFFRQI ARGKYESDSD KHPVIRKASE LIADRLTELK YLDAKQFSGE ERSIEAMLWL MSRIAWEAHA YAYEPEFAED DLSDGDIEQL LKEELNIRRG KSPLLQMGLL LALQFDPSGS QTTVLFEHQS FREFLVARYW QSQLLFLTDP ERDSVEIERA EKILAQTSLL QDDDRAFDAL IEGLQHLEEP KRTQIKDWAA QCLKTRSLTV SHIPDKEDRA PILRESALAI GSTIFEDSGL SIGIHEIRYS AFWHDFHNRI LRIIAPNVKS PKSNLARVDL ARAYLAGADL AGADLAGADL SLAHLERASL ERANFRSAKL LYSNLRYADL RHAGFEQANL VQANLIQANF GYARFLGADL RGAQLLGANL QDAKLQNANL QGANLQGANL QGAKLQNANL QGADLQGADL RAANLSAANF LGAQYSTETK WPDGVDPEAL GCIFVDSSEA ESDALDEYGD EEA
|
| |