Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3914 |
Symbol | |
ID | 5735775 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 4908012 |
End bp | 4909160 |
Gene Length | 1149 bp |
Protein Length | 382 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 641281065 |
Product | hypothetical protein |
Protein accession | YP_001546676 |
Protein GI | 159900429 |
COG category | [S] Function unknown |
COG ID | [COG4842] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTGTGC CAATCGTTCA AATTCAGTAC CAACAGCTCG AACAACTCGC CCAACGCTTC GCCAAGCAAC AAACCCAGGT TCAAGCAATT CTACAAAGCC TCAGGCAAAC TATCCAAGCG CTTGAACATG GCGGCTGGAT GGGCGATGCA GCCACTGCCT GTTTCAAAGA ATTCCATGCC GAGATCGTCC CTGCCTATTG CAAACTTCAG CATGTGTTTA GTGAAAGCCA AACCAGCTTA AACCAAATAG CCCAGCTTTT TCACGAGGCT GAAGCCGAGG CTGCCGGATT ATTTCGGGGT GAAATGGCGC AAGCAGTCGG CAACGCTAGC CCTTGGCAGA TCCTCAATCA TCTTGAAACA AACCTATCCG AGCAACTCAG TCGCTTCCAA CCTGAAACTT TAGAACACAT GCTTGGTGCG ATTGATCCTC AGTCATTTGA TATCACTACG GTGCTGGCGC TGATTGAGAA CGCCTCGCTG GTCGAACGTC AAGCTATTTT GCAAAATCCA GAAGTATTAA ACTTAATCAG GGCCAGCGCA GGTGATCAAG CATATATCGT GATGGCAGTC CTTTCAATTG GATCATTCCG GTGGTTAGCT TCGCAACAAG GAGCAGATGG ATGTATTGCG ATCTTTCTCA CAGACCCGCT AACCCATGAA CCTCGAACTA ACAATTTTGC CCAATGGATC TGCACCGTAC CTAATTCTAA TGAGCCTTCA AGCCAAATAT CACCCCCAAA TCCGCTCACT GGCTCAATGA ATTGTTGGGA ATTTGTTTTA TTTTCAGCAT TCTTGGGTGG AAATATCACC TATGACGAGC TTGATGCAGT CTATTCCGAG GTTGGGACAC CAATCACGCT AGAAGAACAC AAACAAGCAA TTTATACTGC CCTTGGTGGT GAACACAGCA CTGAGTGGAA TCCCTGTAAT GATATTCCTG CTGGCAGTAT TATCTTTTTC GACGATGGCA GCAACCCAAT CGCCCACGTT GCCATTGCGA CAGGCCGTGA GATCAATGGG TCACCCGAGT TAATTAGTTT GTGGGATCGG CCAAACACTA TCGATAGTGT GCAATTTACA ACGATTGAGG CATTAAATGG CTCCGATTAC ACCATCAACG TTGCACCAAA CCCTTGGCTG AACGATTAA
|
Protein sequence | MSVPIVQIQY QQLEQLAQRF AKQQTQVQAI LQSLRQTIQA LEHGGWMGDA ATACFKEFHA EIVPAYCKLQ HVFSESQTSL NQIAQLFHEA EAEAAGLFRG EMAQAVGNAS PWQILNHLET NLSEQLSRFQ PETLEHMLGA IDPQSFDITT VLALIENASL VERQAILQNP EVLNLIRASA GDQAYIVMAV LSIGSFRWLA SQQGADGCIA IFLTDPLTHE PRTNNFAQWI CTVPNSNEPS SQISPPNPLT GSMNCWEFVL FSAFLGGNIT YDELDAVYSE VGTPITLEEH KQAIYTALGG EHSTEWNPCN DIPAGSIIFF DDGSNPIAHV AIATGREING SPELISLWDR PNTIDSVQFT TIEALNGSDY TINVAPNPWL ND
|
| |