Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3457 |
Symbol | |
ID | 5735318 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 4346603 |
End bp | 4348459 |
Gene Length | 1857 bp |
Protein Length | 618 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641280604 |
Product | hypothetical protein |
Protein accession | YP_001546221 |
Protein GI | 159899974 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGACAAT TATTTGGATG TCTTTGCATC CTGATCGTGG TGCTGAGTTT CAGTTTTTCC CCGCGCCAGC CAGCCCAAGC GGCTCCGATG ATCGAGTATC GTGATCCTGC TCCCTTTTCT AAAGCGGTAA AACCCACCAA CACGATCGCG ATTCGCTTAG GGCCAAGCCT GACCAAAGCC GCCGTTGCAA CCGTAGAATT TCAGGTTGTT GGCTCACGCA GTGGTTTGCA TGCAGGCGAC GTTGTGCTAG CCGAGGATCA ACGCACGGTT ATTTTTAAGC CAGCCAGCCC ATTTGTGTTT GGCGAAACCG TGCGCGTATC GATTAAAAGC AGCGATATTG CCGAGCTTGA TCAGCAAACA TGGTCGTTTG AAGTAGTTGA GCGCTTGGTT AAACAAACTG ATCAGGTCAA GCAACTGCAA ACTGAGCTTG CCGCCGAATT GGCTACTCAA GCTAAAGCAG CACAACCAAG TGGTAGCAGC CCGGTCTTGC GCACTGTGCC CTTTAATCTG CCGCCGTTGA CGGTTACGCT AGCGATTAGC AATACCCCTG GCTATATTTT TGTTAGTCCA TTTAGCTGGA TCAGCAATGT CACGCCCAAT CGCTATTTAA TGATGGTTGA TAACACTGGT GCGCCAATCT ACTACAAGGG GCTAGGTAGT GGCCGCTTTT CACTCGATTT TCGCAAAATT GCCGAGGATA AACTGGTCTA TTTCGATACC AGTACCCTCA GCTACCACGT GATGAATCAG CAATACCAAG AGCTTGGCCA ATATCGGGCT GGCAATGGCT ATCAGATCGA TTTTCATGAA TTTTTGATGC TGCCAAATGG CCATGTCATT TTTATGATCT ACGATGATAT TCCCTATGAT TTAAGCCCTT ACGGCGGCGA AGAGAATGCC ATCCTGACCG AATTGGTGTT GCAAGAGCTG GATACTGCTG GTAACGTCGT CTTCCAATGG CGCTCGACTG AGCATATTCC AGTTTATGAT AGTAGCCATA GTTTGGCTGG AACGGCTCCG GTCGATTATA TTCATGGCAA TGCGATTGAT GTTGATACCG ATGGTCATTG GCTGGTTTCA AGCCGCCATA CCGACGAAAT TACCAAGATT AATCGCCAAA CGGGCGCGGT TATTTGGCGC TTAGGTGGCG AGGGCAATCA ATTTCTCTAT TTGGAAGATA GCCCGCGATT CTACCATCAG CATGATATTC GGCGCTTGGC CAATGGCAAT ATTATGTTGT ATAACAATTG GAACACCTTG CCCCGCTCGC CGGATTCGTT CTCGGCGGCG CTGGAATATG AGATCGATGA AGTTGCGAAA ACTGTGCGTT TGGTTAAGCG CTATCGGGCA ACTCCCGACT ATTTTGCCAC AGCGATGGGC AATGCGCAAC GCCTGCCCAA CGGCAATACT GGCATCGGCT GGGGCAGCAT TCAGCCCTTG TATACCGAAT TCAACTCTCA AGGCCAAGCA GTTTTTGAAT TAACTGCGGC GGCACCGATG GTGAGTTATC GCTCGATGCG CTTTGAATGG CAAGGTGACC CACCATGGCC GCCAACTTTA GTGACCCAAA GCCTTGCTAA CACCACCAAC TTATACTATA GCTGGAATGG TGCGACCGAA GTTGCCGATT ATCAGGTGTT TACTGGGGTT ACTAGCACAA CCTTGAGTTT GCAAAATACC ACACCCAAAA CTAGCTTTGA AACCAATACG ACGGTGGTTA ATAGTGACCA TTGTTTTGCC CAAGTCCGTG CCCGTAATAG CCAAGGCACA GTTTTAGGTT CCTCGGAAAT TGCCTTCTTG GCCAGCGATA CCTGTACTCC TAACCGCATG TATTTGCCAG CCCTAACCAC CCAATAA
|
Protein sequence | MRQLFGCLCI LIVVLSFSFS PRQPAQAAPM IEYRDPAPFS KAVKPTNTIA IRLGPSLTKA AVATVEFQVV GSRSGLHAGD VVLAEDQRTV IFKPASPFVF GETVRVSIKS SDIAELDQQT WSFEVVERLV KQTDQVKQLQ TELAAELATQ AKAAQPSGSS PVLRTVPFNL PPLTVTLAIS NTPGYIFVSP FSWISNVTPN RYLMMVDNTG APIYYKGLGS GRFSLDFRKI AEDKLVYFDT STLSYHVMNQ QYQELGQYRA GNGYQIDFHE FLMLPNGHVI FMIYDDIPYD LSPYGGEENA ILTELVLQEL DTAGNVVFQW RSTEHIPVYD SSHSLAGTAP VDYIHGNAID VDTDGHWLVS SRHTDEITKI NRQTGAVIWR LGGEGNQFLY LEDSPRFYHQ HDIRRLANGN IMLYNNWNTL PRSPDSFSAA LEYEIDEVAK TVRLVKRYRA TPDYFATAMG NAQRLPNGNT GIGWGSIQPL YTEFNSQGQA VFELTAAAPM VSYRSMRFEW QGDPPWPPTL VTQSLANTTN LYYSWNGATE VADYQVFTGV TSTTLSLQNT TPKTSFETNT TVVNSDHCFA QVRARNSQGT VLGSSEIAFL ASDTCTPNRM YLPALTTQ
|
| |