Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4025 |
Symbol | |
ID | 5735886 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 5137458 |
End bp | 5139317 |
Gene Length | 1860 bp |
Protein Length | 619 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 641281175 |
Product | hypothetical protein |
Protein accession | YP_001546785 |
Protein GI | 159900538 |
COG category | [S] Function unknown |
COG ID | [COG1479] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0000214735 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACAAAA AAACATTAAC GAGTTTGTTT GCCGAGTCAA TCTACCAAAT TCCAGATTAT CAGCGTGGTT ATGCTTGGGA AGAAAAACAG TGGAAAGATT TTATACAAGA TATTGATGCC CTGGTCGATG AGCAGGTGAC CAGTCATTAT ACTGGAACCG TTGTAGTTTA CGAAGGCCGC GACGCTGAAA AGCGACCATA TGGCCGAAAG AAGCTAAAAG TGCTTGATGT GGTTGATGGA CAGCAGCGAT TAACCACCAC TTGCCTCTAT CTTTCAGTAA TTATTCGCGC ACTGATTCAG CATGGAGAGT CGGACTATGA GCGCGATATT GATGATTTTC TGTATGCAGG CGCAACCTGC AAGCTGAACC TCAATAATGA AACTGGCGAC ATCTTTTATG ATCTTTTAAA GACAGGCTAT GTCAATACCC CGCTTCAGTC GCCACATCAG CATCGGCTCG TCGAGGCCCA CCGTCGCTTT CAACACCACA TTAGCGAGCA GTTGCAGCAA CGAGGTGGCG CTGGGGTCGC TTATCTCAAA GAATTGCATT ATGCGATTAC CCAAAAACTC AATTTTACCT TCTATGTGAT CGAATCAGAA GCCGAAATTG GCATGACCTT CGAGTTGATG AACTCGCGGG GCAAGGATCT TTCGGTGCTT GAACTGCTCA AAAATTATTT AATGCACTGG GTTTCGCGCA ACGAAAACAA CCTTGCAGAT CGTGAAACCC TCACCAAACT GATCAATCGC AGTTGGAAAG ACACCTACAC CAACCTTGGC GCAAGTTCGG GCAACAATGA AGATCAATGT CTACGCATCG CTTGGACGCT CTATTGCAGC CATTCCCCAG CCAATTGGCA TGGGTATGAA GGCTTCAAAG CTGATGAATA CATCCCGCTG AGAACATTTA GTAAACGCAC GAAGGCCGAG ACAAAAATAT TTATCGAGCA CTTTGTGATG GGCCTTGCTG AAGTTTCACA TCATTATGCC AGCATTATCA ATCCAACCAC CACCACAGCG CTATTCGAAG CTGAGCGGAT TTGGCTCAGC AAGATTCGGC ATACCGGCAA CATTGCCAAT TTCTTGCCCT TGATGGTGGC AGCCCGCAAG CAATACCAAG CAGGGCAGAT TAGCGAAGGC GCATACATCG ACATGCTCAA GGCACTCGAA TGCTATGCCT ATCGTGTATT TCTTTGGGCA GCCCGCCGCA GCAATGCTGG TAAATCAAGC TTCTATCGTT GGGGATACGA GATCTTTACT CAGCCGCAAC TGATCAGTGA TATTACGCGC GGGATTCATC AACTGACCCG CTACTATGCA CCTGAAGATG ATTTTATCAA CGGCAATGCC AACCCCAGCG ATTGGTATAG AACTCGGAAT CGCTTGAGGT ACACCCTGTT TGAGTATGAG TTACATCTGC TTGCGACCGA GGGGAAAAAT AGCGAACCAC GACTTGGCTG GGATCAGCTC AGCGATTCGA CGATTGAGCA TATTCTGCCG CAGAATCCAG CAAAACATTC GCATTGGAAT GGCGTATGGA ACAAAACCGC GTTCAATGCA AGTGTCCACG ATATCGCCAA TCTTGTGCTT ACCCACAATA ATGCCAGCTA TAGCAACTTT GAGTTTGCCC GCAAAAAGGG CCAACCAGGC CTAAGTCCTA GTTATAGCGA TTCTGATATT CGCCAAGAAC GTAAACTCGC GGCCTTTGCC GATTGGACTC CCAAAGAGTT TGCTGAACGC CGAAACGAGT TGATCATATG GATCAATCAG CGCTGGAAAA CCGTCGGCGA ACCCGACAAT GCAACGTTGG AAGTCAACGA CGAGGCTGAT GACGATGGCA TCGAGCATCA AGAAGGATAA
|
Protein sequence | MNKKTLTSLF AESIYQIPDY QRGYAWEEKQ WKDFIQDIDA LVDEQVTSHY TGTVVVYEGR DAEKRPYGRK KLKVLDVVDG QQRLTTTCLY LSVIIRALIQ HGESDYERDI DDFLYAGATC KLNLNNETGD IFYDLLKTGY VNTPLQSPHQ HRLVEAHRRF QHHISEQLQQ RGGAGVAYLK ELHYAITQKL NFTFYVIESE AEIGMTFELM NSRGKDLSVL ELLKNYLMHW VSRNENNLAD RETLTKLINR SWKDTYTNLG ASSGNNEDQC LRIAWTLYCS HSPANWHGYE GFKADEYIPL RTFSKRTKAE TKIFIEHFVM GLAEVSHHYA SIINPTTTTA LFEAERIWLS KIRHTGNIAN FLPLMVAARK QYQAGQISEG AYIDMLKALE CYAYRVFLWA ARRSNAGKSS FYRWGYEIFT QPQLISDITR GIHQLTRYYA PEDDFINGNA NPSDWYRTRN RLRYTLFEYE LHLLATEGKN SEPRLGWDQL SDSTIEHILP QNPAKHSHWN GVWNKTAFNA SVHDIANLVL THNNASYSNF EFARKKGQPG LSPSYSDSDI RQERKLAAFA DWTPKEFAER RNELIIWINQ RWKTVGEPDN ATLEVNDEAD DDGIEHQEG
|
| |