Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2976 |
Symbol | |
ID | 5734848 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 3754215 |
End bp | 3757499 |
Gene Length | 3285 bp |
Protein Length | 1094 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641280120 |
Product | abortive infection protein |
Protein accession | YP_001545742 |
Protein GI | 159899495 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.292884 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCATAAGC CCGATACCAA ACGACTGATT CCCCCAATTT TTGTTCTTTT AGCAATATTT GGCATTTTCT TAGCGATTGT GTTTGCCGCA GAATCAACGC CCATGGCCGA GATTCGCTTT AATCTGGATC GTGAGGCGGC TTTACAACGC TCAGTCGAAG CATTACAGGC AGCGGGTGGC GATCCCAGCC AGTTTACCCA AACAATCACT TTTGGCTCGA ACAACGATGC TCAATCGTAT CTGATTCGCG AACGCAGCCG CGCACTGCTC AACCAACGGG TTGACGAAGA TCTTAATCTG GCCAGTTGGA ATGTGCGGTT TTGGCGTGAG CTTGATCCAG AGCAATGGTC GATTAGTCTT TCACCAGCCA CAGGCCGCAT CTTGGCAATT AATCATATGC GCCCTGATGA AGCGGCTGGA GCAACGTTGA GCCAAACCGA GGCCTTGCTT CTGGCCCAAG CTCAATTGCC AATTCCGCTT GATCAATTAA CCTTGCTCGA TCAATTTACC AATCAACAGC CCAATCGTAC CGACCATACC TTTATTTGGC AGCGCCGTGA TATCAGCGAT GCTGAGGCTC AATATCGCTA CAGTGTTACA ATTGCTGGCG ATCAATTGGG CCAGCGTGGC GAGTATTATT GGCTGCCCCA ATCGTGGTAT TTGGATAAAG ATTGGCAACT GCGCCGTGGT GGTATTCTCA ACCACACTGG CTGGACGCTG ACCTATGCCC TGACTGCCTT GATTGGCGTG GCTTGGTTTA TTCAGGCACG GCGGGGGCGA TTACGCTGGC GTTGGGCCTT ACGCCTGTTT GCAGCGGTCG CGGCGGTCGG CGTGTTGGTC ATGCTCAACA GCATTCCGCT CGATTTGGCT CATTATGATA TTAACCAAAG CCTGCCGGTT TACTGGGGCA ATGTACTTGG TGGTTATGTT GGTCAACTGG TCACAATCGC CACCACAATT ATTTTGGCGG GCATGGCTGG CGAGGCGCTA GTTTGGGAAG AAACCGAGGG CACTGTTTCG TTGAGCGAAA CCTTGACCCA CCGTGGGTTA GTGAGTCGCC CAGTCGTGCA GGCTTTATGG GTTGGCGGCT TGGTTGGAAT TTTCCAGCTA GGCTTTGTTT CGGCCTTTTA TGCCTTGGGT AGCCGCTATT TTGGGGTTTG GTCGCCCGTT ACGCCCTTGT ATGATGATAC GATTGCTACG CCGTTTCCCG CCTTGTATGG CATGGCGCTG GGCTTGTTGC CCGCGATCGG CGAGGAATTG ATCTTTCGCT TGGGCGGGAT TACGGTGTTG ACCCGTTATT TTGGTCGGCC CAAGCTGGCA ATCATTGTGA CAGCCGTGGT TTGGGCCGCA CTGCATGCCA CCTATCCCCA ACATCCGTTT TATATTCGCG TGGTTGAATT AAGCATTATC GGGATTTTGT TTGGCTTTTT GAGCGTGCGC TACGGCGTGT TGGCCTCAAT CGCGGCCCAC TATACCTACA ATGCCTCGCT GTATGTGCCA CTTTTTTGGA AAACCAATAA CTTCTATTTG CTTAGCGGGA TTGCCGCCGC CGCCCTGGTG TTGTGGTTGC TGGTTCCGGC CATCATTCGC CAATTGCGGG GAATTCCGCT TGAGAGCGAT AATACAATTC GCGCCGCGCT GCCACCCCAA GTGCCCGAGC CAGTTGTGCC ACAATTAAGC TGGCAATGGC GAAGCGATTG GAAACTGTTT GCAGGTCTAT CGGGCTTGGC GGTCATTATG TTGCTGGTGA TTGGGCTGAA TCGTGCGCCT GCCTTGACCC GCAACGGCGT GCGCCAAGAT ATGGTTACCC GCACCGAGGC CCTCGCCCAA GAGCGCAAAA TCGATCTAAC AGGCCTGAAC CCCAGTGTTA CGGTCGTGGC CGATTGGGTT GACCTTGATT TAGCCTATAT CTACGATCAA CTAGACCCTG AGCAAACCGC TGCTGCGATT GAAAAAGGCA CAGTTCGCGC CTGGAGCGTG CGCTGGTCGA ATTGGGATCG ACCTGAGTAT AGCTGGTTGG TATATCTCGA TCCAGCTGGG CGTTTGCTAT CGTATCGCTT AAGTTTGCCC GAAAATGCCA AGGCGATTTC AACCACGCTC GAACAAGCGC AAACGATCGC AACCACCCAT GCCAGCCAAT TTATCGCGCT TGATCAGTAT GAGTTGAGTA ATACTAGTAC CAGCCAAAAG CCCAATCGCA ATGATTATAG CTTTGTTTGG CAGACTAAAA TGCCATGGAT TGGCGAGGCC TATCGACGCT TCGAGGTAAC CGTAGCTGGC GATCAAGTAA TTGTCAGCTC GCCCTCGATG TATACCCCGC CTGAATATCG GCGTGAGCGC GACCAAACTA CCCTGAGCGA AAGCATTCTG AGCAATTTGC GCAGCGCCTT GCGTGGCATT CCGGCCACAA TCTTGCCAAT TTTGGGCTTG ATTGGCATTT TTCGGCGGCG TACTGAGCTT TGGCCGTGGG TTTGGCTAGG AATTATTGCC GGGATTGGCT ACTTAATTCA AGGTGCTGGG CGTTGGACAT TTGCCCAAGT TAGCGAACTA CCACGCTTTA TTTTAGCAAT CAGCCAAACC TTGGCGAATG CCGTCTTGAA TGGTGGCGAG CTAGCCTTAT TGGGAGCAGG CGCGGCGACC GCTTGGAACC TGACCAAAAA CGAGCAACAA TTGCCGCTTG AGGCTTTTAT ACGGGCAATT CCGCATCGAA TTCAAGATTT TGTGGCTGAT CGAGCGCAAG TTTTGCGGCG CGAAAGCATT GTGTTAGGGA TTTTGATCGT GCCATGGATC TTGCTGATTC GCAGCAGCAT TGGGTTTACA ACAGCCAAAG CTGGGTTTTG GGCCAGTGTT CAACCATTGA ACGCACAATC AGCAATGCTT GATATACTAT TACAAGCAAC ATTTGATGCA ATTACCACAA GTTTGCTCTT GATCGGTAGC CTGAGCATGC TAACATGGGT AGTACGTGGG AAGCAGCAGA TCGCCTTGGC AATTACCTGT CTTGGAATGG CATTGGTCCT CTTGCCGTTA CGCGAACCAG CTCAATGGCT CGTTTTAGGG CTAGCTGTGC TGTTGAGCTT TTGGCTTGGC CGCATGCTGC GCTGGAATGG CTTGGCTTTA ACTGTTGCTT TATGGTTGGC AAACATTGTA CCAGCCGCCC TAACCTTACT TGCAACAACC CCACTTGCAT TGCAACTGAA CGGCGCAGCC TTGATTGTGT TGCTCTGTGG CTTTTGCGGG TGGTATCTCG GTGGATGGTG GCAAACACAA AATGCAGAAA ATTAA
|
Protein sequence | MHKPDTKRLI PPIFVLLAIF GIFLAIVFAA ESTPMAEIRF NLDREAALQR SVEALQAAGG DPSQFTQTIT FGSNNDAQSY LIRERSRALL NQRVDEDLNL ASWNVRFWRE LDPEQWSISL SPATGRILAI NHMRPDEAAG ATLSQTEALL LAQAQLPIPL DQLTLLDQFT NQQPNRTDHT FIWQRRDISD AEAQYRYSVT IAGDQLGQRG EYYWLPQSWY LDKDWQLRRG GILNHTGWTL TYALTALIGV AWFIQARRGR LRWRWALRLF AAVAAVGVLV MLNSIPLDLA HYDINQSLPV YWGNVLGGYV GQLVTIATTI ILAGMAGEAL VWEETEGTVS LSETLTHRGL VSRPVVQALW VGGLVGIFQL GFVSAFYALG SRYFGVWSPV TPLYDDTIAT PFPALYGMAL GLLPAIGEEL IFRLGGITVL TRYFGRPKLA IIVTAVVWAA LHATYPQHPF YIRVVELSII GILFGFLSVR YGVLASIAAH YTYNASLYVP LFWKTNNFYL LSGIAAAALV LWLLVPAIIR QLRGIPLESD NTIRAALPPQ VPEPVVPQLS WQWRSDWKLF AGLSGLAVIM LLVIGLNRAP ALTRNGVRQD MVTRTEALAQ ERKIDLTGLN PSVTVVADWV DLDLAYIYDQ LDPEQTAAAI EKGTVRAWSV RWSNWDRPEY SWLVYLDPAG RLLSYRLSLP ENAKAISTTL EQAQTIATTH ASQFIALDQY ELSNTSTSQK PNRNDYSFVW QTKMPWIGEA YRRFEVTVAG DQVIVSSPSM YTPPEYRRER DQTTLSESIL SNLRSALRGI PATILPILGL IGIFRRRTEL WPWVWLGIIA GIGYLIQGAG RWTFAQVSEL PRFILAISQT LANAVLNGGE LALLGAGAAT AWNLTKNEQQ LPLEAFIRAI PHRIQDFVAD RAQVLRRESI VLGILIVPWI LLIRSSIGFT TAKAGFWASV QPLNAQSAML DILLQATFDA ITTSLLLIGS LSMLTWVVRG KQQIALAITC LGMALVLLPL REPAQWLVLG LAVLLSFWLG RMLRWNGLAL TVALWLANIV PAALTLLATT PLALQLNGAA LIVLLCGFCG WYLGGWWQTQ NAEN
|
| |