Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1556 |
Symbol | |
ID | 5733443 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 1806742 |
End bp | 1807965 |
Gene Length | 1224 bp |
Protein Length | 407 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641278695 |
Product | hypothetical protein |
Protein accession | YP_001544327 |
Protein GI | 159898080 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGCTTA ATCAAGCGCC CCAACTCGAT TTAACCTATT GCACCAATAT TCATCCGGCC AACGGCTGGC CAGCGGTTTT AGCTGGTTTG CAACAGCATG TGCTCGATCT CAAACAGCGC CTCGCTCCCA ACCAAGCCTT TGGGATTGGC CTGCGGCTTT CGGGCCAAGA AAGCCACCAA TTGTTGGAGC CAACTGCGCT TGCTGATTTT CAAGCATGGC TAACTGAACA TAATCTTTAT GTCTTTACCT TGAATGGCTT TCCCTACCAT CCCTTCCACC AACAACCAGT CAAGGATCAG GTGCATGCGC CCGATTGGCG TGAGCCTGAA CGAGTGGCCT ATACCTTACG GCTGATTGAG ATCTTGGCGG CACTGTTGCC CAAGGGCATG GTTGGCTCAA TTTCGACCAG CCCTTTGAGC TATAAACCAT GGTTTGCCGA TTTGTCCGCC GTGCCGTGGG CATTGTTGAA TCGCCATGTG TTGCAGGTGG TCGCCGCGTT GGTCCAGCTT GAGCGCCAAC GTGGGATTGT GATTCAATTA GCTTTCGAGC CAGAGCCAGA TGGTTTGCTC GAAACCAGCA GTGAATTAAT CGGCTATGTT GAGCAATTGT TGGATGTTGG CGCTGTTGAA TTAGCAGCTC AACTTGATTG CTCGTTGCGC GAAGCCCAAA ATGCGATTCG TCGCCATGTC GGAGCCTGTT TGGATACCTG TCATTGCGCC GTAGCCTACG AAGCGCCGCG CCACGTGATC GCTGCTTATC AAACGGCAGG CATCAGCATT GCCAAAGTGC AACTTAGCTC AGCCTTGCAA GTGATGCTTG ATGACGATCG CCAGGCCGTA GCAGCAGCTT TAGCACCATT CAGCGAAGCA ATTTATTTGC ACCAAGTGAT TCAGCGCAAC CATGATGGTT CGTTGCAGCA ATATCGCGAT TTGCCTCAAG CCTTGGAAAA GATCGATGAT CCTGCTGCTT GCGAATGGCG GATTCATTTT CATGTACCGA TTTTTACCGC CAGTTTTGGC CTGCTCAACG CCACCCAACC AGCCTTGCTC GAAAGTTTGC AAGCCTTGCT CGAAAGTTTG CAAGCCTTGA ACGAGCAGCC CTACAGCCAG CATTTGGAGA TTGAAACCTA TACGTGGGAT GTGCTGCCAA GCCAATTAAA GCTCGATCTG ACTGAATCGA TCGCGCGGGA GTATGCGTGG GTGTTGCATG AACTCAAACG CTAA
|
Protein sequence | MQLNQAPQLD LTYCTNIHPA NGWPAVLAGL QQHVLDLKQR LAPNQAFGIG LRLSGQESHQ LLEPTALADF QAWLTEHNLY VFTLNGFPYH PFHQQPVKDQ VHAPDWREPE RVAYTLRLIE ILAALLPKGM VGSISTSPLS YKPWFADLSA VPWALLNRHV LQVVAALVQL ERQRGIVIQL AFEPEPDGLL ETSSELIGYV EQLLDVGAVE LAAQLDCSLR EAQNAIRRHV GACLDTCHCA VAYEAPRHVI AAYQTAGISI AKVQLSSALQ VMLDDDRQAV AAALAPFSEA IYLHQVIQRN HDGSLQQYRD LPQALEKIDD PAACEWRIHF HVPIFTASFG LLNATQPALL ESLQALLESL QALNEQPYSQ HLEIETYTWD VLPSQLKLDL TESIAREYAW VLHELKR
|
| |