Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1210 |
Symbol | |
ID | 5733103 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 1394206 |
End bp | 1395696 |
Gene Length | 1491 bp |
Protein Length | 496 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 641278350 |
Product | hypothetical protein |
Protein accession | YP_001543986 |
Protein GI | 159897739 |
COG category | [S] Function unknown |
COG ID | [COG4995] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0023593 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTGGTT ATGCCGACTT TGAATTAACG ATCAACCCAA AAGATGCTGA AAACAGTTTT GTGGTGCATG GGCGTACTGC CAAAGGCATG CAAGATAGCG ATAGCCTGAT TTTGCCCGTT GACGATCCAC GTTATCAAGC CTTTCAAACC GCCTTGGACT ATAACACCCC GCTGACTGAA GATCAGGTGA TTGATTTTGG GATCGTGCTC TACGAAACAT TGTTGAAAGG GAAAATTTGG GCATTATTTA CGGCAGCCCG TGAGACTGCT CGCTCCCAAG GTCAATCATT ACGGATTAAA CTTAATGTTG ATGCCAACAA CCCTGCTTTA GCGACAGTTG CCACGATTCC CTGGGAGTTT GCCTGCGATA GCGCAGGAAT TCCACTGACA ACCGATCATT CAATCTGTCG ATTTTTGACT TTTCCTGAAT CAGTACCAGT CTTGAGTTTG GGTCAAGAAA AATTACGGAT CGCCTTAGTG GGAGCCTTGC CAGCTGAAAT GGCTACTACC CATCCAGTCG ATATCCAAGG TGAATTAGCG GCGATTCATC GTTCGTTAGA GCCGTTGGTT ACCCAAAATC AAGTTGAAAT TTATGAAGAA ACTCAGTTAA CTGCACCCAA ACTTCAGCGG CTTGTGCGCG AATGGCGACC ACATATCGTG CATTATGTCG GTCATGGCGA TTTCCAAGGC ACAACTGGGG CATTGATTCT CGATGATGGC AATGGCAAAA AACATCTTTC AACCGCTCGC ACATTAGCAA CCCTCTTGCG CAATACCTCA GTGCGCTTGG TGGTGCTGAA TGCTTGTAAA ACTAGCACGG TTTCCTCAAC CGCCTTGCTG CGTGGAATTG CCCCAGCCCT GATGGCGGCC AATATTCCAG CGGTTGTCGC CATGCAATCA TCAATTTTAG ATACAGCAGG CAAGGCCTTT GCCGAAGAGT TTTATCGGGT ACTCGCAACT GGTACGCCAA TTGATGCTTG TGTTGCTGAA GGGCGTAAAT CGATTATTGC CTATGGCTTT GGCCAGCTTG ATTGGGGCTT GGCAACGCTC TATATGCGGG CTGATGATGG CGTGCTGTTC AATATTCCCA CTCCATCAGT GCCAAGCAAT CAGGTGGTAA CTCCAACTAA CGAAACTACT CCGCTCGCAA ATCTAACTGG TGGCAATAGT GTTAGCAATT TGCTGGGTAA CAATAACACC ATTACTGGCG GTAATATCTC AATTGGCAAT GTGGTTGCTG GCAATCATAA TCAAACTACT ATCAACCATG GTGTTGCTCA GCCAAATACT CCAAGTGCAG CCAATAATCA GCAACAAGCC CTCGCAGCGG AACGCGAATT GCTGGCACTC AAACAGAAGA ATTTAAATAT TACCAAACTG CAAATTGAAC AATACGGGAT TGGCGTGCCA GTGTATTTAC AAAACCAACA CGATGAGTTG GTCAAGGATA TTGCGGCAAT TCAACAACGG ATTGCTGAAT TGAGCAAATG A
|
Protein sequence | MSGYADFELT INPKDAENSF VVHGRTAKGM QDSDSLILPV DDPRYQAFQT ALDYNTPLTE DQVIDFGIVL YETLLKGKIW ALFTAARETA RSQGQSLRIK LNVDANNPAL ATVATIPWEF ACDSAGIPLT TDHSICRFLT FPESVPVLSL GQEKLRIALV GALPAEMATT HPVDIQGELA AIHRSLEPLV TQNQVEIYEE TQLTAPKLQR LVREWRPHIV HYVGHGDFQG TTGALILDDG NGKKHLSTAR TLATLLRNTS VRLVVLNACK TSTVSSTALL RGIAPALMAA NIPAVVAMQS SILDTAGKAF AEEFYRVLAT GTPIDACVAE GRKSIIAYGF GQLDWGLATL YMRADDGVLF NIPTPSVPSN QVVTPTNETT PLANLTGGNS VSNLLGNNNT ITGGNISIGN VVAGNHNQTT INHGVAQPNT PSAANNQQQA LAAERELLAL KQKNLNITKL QIEQYGIGVP VYLQNQHDEL VKDIAAIQQR IAELSK
|
| |