Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4277 |
Symbol | |
ID | 5736136 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 5460973 |
End bp | 5462634 |
Gene Length | 1662 bp |
Protein Length | 553 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641281437 |
Product | hypothetical protein |
Protein accession | YP_001547037 |
Protein GI | 159900790 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0190094 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGTTT CAGTTAAAGC CATTCGAGGG TTGATTATTG CGGTCAGCGT ATGTGGCCTA CTTTTGCGGC TGATTTTTGC GCTCTTGCCC TTGCAAACCC ATTTGCTGGT GCTCGAAGAT GATGCTTGGA TGGTGACGGC CATTGCCCGC AATTGGGCCA TGGGCCAAGG GATCACCGCC GATGGTATCA CGCCAACCAA TGGATTTCAT CCGGTGTATC CACTCACTTT AGGGGCGATT CCCTATGTTT TTGCCCCCGA TAACTTAGCC TTAGGCTTTA CTGTCAACTT GATTATCTGT GCATTGTTGG CCAGTTTAGT CATGTGGCCG TTGTGGCATT TGCTGAGACA TTTGATGAAC TGGCAAGCAA GTTTATTTGG CATAATCTTA TATGCGCTAA ACCCCGTTTT AGTGCGGTTT ACCGTCAATG GCATGGAAAC GTCGATGGCT TTGTTGCTCT GGGTCACGAC GCTGCTAGCT GCGGTCAAAA TCGATCCCAA AAAATTAAGC CATAATCTCG GCCTTGCCGC ACTAACGGCT GCCATGATTC TGACCCGCCT AGATGGAGCA TTGCTCTTTG CCTCGATTGC CGCGGCTCGC TTGATTTGGG CTTGGCGAGC CAAACGCTTA GGCCGTGAAT TGCCCATGCT CACGAGCTAT GTCGTGGTGA CGTTTACGCT GTTAGTGCCC TATTTCTGGC GTAATTTGAC GGTATTTGGT TCGTTTTCGC CCAGTAGTGG CAAAGCCTTG ACCTACTTGC ACAGCTACGT CAACTCCTAC GCCATCTCAA ATGGGCTTGA TGGTTGGTAC GTTAATAGCG CAATTTCGAT GGAAGTATTG GGACGCTCAG TAGTTGGCGC AGCGCTTTGT TTGGCCATTT TTGCGGCATT TGTGGGCTTT TGGGTTGGTC GTCAACTCTG GCTTGGCTTA CCGCTCTTGC TCTATCTGCC GATTCCCTTG GTTTATTATG GCTATATGAT GCAGCAGGAT AATCCACGTC ACTTTGTGCC TTGGTCGCTG GCAGTCATTA TTTTGCTGGC ATGGGCGTTG GCGGCAATGC TCCAGCGTTT GCCATCGATC AGCTATCTGG CCGTGCCAGC GCTGATTGCG GGCGTGCTGA TTGTGCAAAC CCTCGATAGC TCACGCTTTT GGCAAGAAAA AGCAACAGCG CCTAGTCAAT CGCAACCAAC GATGTATCAA GCAGCCTTGT GGATGCGCGA TAATTTGCCC AGTGATGCCT TGATCGGGGC TAAAAATTCG GGCATTTATC AATATTATTC TGGTCATCAT GTGCTGAATA TCGATGGCAA ATTGAACAAC GACATTCTTG AGGTCTATGA TCAACGGCGC ATGCTCGATT ATTTGCGCGA AAAAGGCGTG ACCCACTTGA TCGATCAAGA GGGAACCATG GCCGATCATA TTCAGTTTTA TAGCTATCAA TTTGGTGAAC GGCCCGAGCA TCGTGTGCCC ACAACCTTCA CCCAATTCAA GATCTATGGT CAATTATTGC TGAGCAGTTT GGGGCTAGCC GATAAGCCAG CGCTTGATCG GCGCGATGGT TTTGAGCCAA ATCAGCCATT TAGCAGCATC ACCACGGTGA TTCAGCGCTT CCCACGGCCA AACGATAGCA ATAACCCAAT TGCGATTTTT GAACTTAACT AA
|
Protein sequence | MKVSVKAIRG LIIAVSVCGL LLRLIFALLP LQTHLLVLED DAWMVTAIAR NWAMGQGITA DGITPTNGFH PVYPLTLGAI PYVFAPDNLA LGFTVNLIIC ALLASLVMWP LWHLLRHLMN WQASLFGIIL YALNPVLVRF TVNGMETSMA LLLWVTTLLA AVKIDPKKLS HNLGLAALTA AMILTRLDGA LLFASIAAAR LIWAWRAKRL GRELPMLTSY VVVTFTLLVP YFWRNLTVFG SFSPSSGKAL TYLHSYVNSY AISNGLDGWY VNSAISMEVL GRSVVGAALC LAIFAAFVGF WVGRQLWLGL PLLLYLPIPL VYYGYMMQQD NPRHFVPWSL AVIILLAWAL AAMLQRLPSI SYLAVPALIA GVLIVQTLDS SRFWQEKATA PSQSQPTMYQ AALWMRDNLP SDALIGAKNS GIYQYYSGHH VLNIDGKLNN DILEVYDQRR MLDYLREKGV THLIDQEGTM ADHIQFYSYQ FGERPEHRVP TTFTQFKIYG QLLLSSLGLA DKPALDRRDG FEPNQPFSSI TTVIQRFPRP NDSNNPIAIF ELN
|
| |