Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0017 |
Symbol | |
ID | 5736851 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 20847 |
End bp | 21971 |
Gene Length | 1125 bp |
Protein Length | 374 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641277138 |
Product | hypothetical protein |
Protein accession | YP_001542797 |
Protein GI | 159896550 |
COG category | [S] Function unknown |
COG ID | [COG2311] Predicted membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAATTG TATCCCGTTC CAAGACCACA AATGTGGTCA AAGATTCGCA ACGGATTGTT GGTTATGACG TAGCACGAGC CTTGGCAATT TTAGCGATGG TGATTGTGCA CTTCGCATTA ACCTTCGTCA ACATCGAACA ACCCACAACC AATGGCCTGC TTAGCTTGAT TGTGCAGTTT TGCGAGGGTC GCGCGGCGGC CTTGTTTGTC GTTTTGGCAG GGGTTGGCTT GAGTTTGATC GCCCGTACCC AAGCTAATCC TAGCTCTGAG GCAATTCAAG CCAAACGCTG GACGTTGACC AAACGTGGCT TATTTCTGCT TGGCTTGGGC TTGCTCAACC TAACAATCTG GCCAGGCGAT ATTTTGCGGG TGTATGGGGT TTCTTTTTTG CTGATTGCTT GGCTGTTTCA AAGCTCAAAT CGGCGAATTT TGGGCTTGGC CTTAGGCTTT ATGCTGGCCT TTGTTGGCCT GATGTTGCGC TTCAATTTCA ACCAAAACTG GGATTGGACA ACCCTCGAAT ACACTAATTT ATGGACGATC AAAGGCGGAT TGCGCAATCT ATTTTTTGAT GGCTTCCGCA GTGTGTTCCC TTGGACTAGC CTGATTTTCT TTGGTATCTG GCTTGGCCGC CAAAACGTGC AAGCTGCGCA CGTGCGTTGG CGCTTATTTT GGATTGGGCT AAGTGTGGCG CTGGGCGTAC AAATTGGATC GATTGGATTA ACCTACGTTT TTAGCAACGT TTGGCCAATA TTGGGTCAGG CAGATGCCGA ATTGTTGTTT AGCACTGGCT CAATTCCACC AATGCCCTTA TTTTTGCTCT CGGCTGGTGG TGTGGCCTTG GCAATCATCA TGAGTTGTGT ACAGCTAAGT CAATTATTTG GTGCTAGCCG CATCATTCAT GGTTTGGCGG CAACTGGCCA ATTGGCGCTG ACCTGGTATA TCGGCCATGT GGTAATTGGC TTGGGTGTGT TGACCAGCCT TGGTTTTTAT CAGAATCAGA GCCTTGCAAC GTCGCTCTGG TTGGCCTTAG GCTTCTTTGG ATTAGCGGTT GGCTGCTCGG TTTGGTGGAA AAAACGCTTC AATAATGGCC CATTGGAAAC AGTGCTGCGT TGGGCCACCA GCTAA
|
Protein sequence | MAIVSRSKTT NVVKDSQRIV GYDVARALAI LAMVIVHFAL TFVNIEQPTT NGLLSLIVQF CEGRAAALFV VLAGVGLSLI ARTQANPSSE AIQAKRWTLT KRGLFLLGLG LLNLTIWPGD ILRVYGVSFL LIAWLFQSSN RRILGLALGF MLAFVGLMLR FNFNQNWDWT TLEYTNLWTI KGGLRNLFFD GFRSVFPWTS LIFFGIWLGR QNVQAAHVRW RLFWIGLSVA LGVQIGSIGL TYVFSNVWPI LGQADAELLF STGSIPPMPL FLLSAGGVAL AIIMSCVQLS QLFGASRIIH GLAATGQLAL TWYIGHVVIG LGVLTSLGFY QNQSLATSLW LALGFFGLAV GCSVWWKKRF NNGPLETVLR WATS
|
| |