Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3144 |
Symbol | |
ID | 5735016 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 3971342 |
End bp | 3972625 |
Gene Length | 1284 bp |
Protein Length | 427 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 641280287 |
Product | hypothetical protein |
Protein accession | YP_001545909 |
Protein GI | 159899662 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.613992 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCAACCA ACGTGCCTCA ACTCAGTTTT AATATCAATC TCCAAGCTGA TGAACGGGTC AAGTTGATCG CCTATCGCCA TTGGCTAATC TTGGCACGCA ATCTGTCAAT TTGGTTTACC TTTTGGTTTA TTTGTACCGT GATTTTCTTG TGGCGGGTCT CGGTGGTTGG CGTTGATACC CTGAATACTG TGCTGTTTGG CGCTGGTACG ATCTTTTTCG GGGCCATGAT TTATAGCTAT TTCGATTGGC GCAATGATGC CTTGGTCGTT ACCAATCAGC GGGTTATTTC GTATAACTCG CGTTTTTTGA TCAGCGTGCA GCGCAATGAG CTGTATGTGC GCGAAATTGA AGATGTAAAA ACCGTCACCG AGTCGGTGGT TTCACGCTAT TTCGATTATG GCGAAATTGA AGTCCAAACT GCCAGCCGTT TGCGCAACAT TGCCTTCGTG GGAATCGTCA ATCCTACGTT GGTACGTGAC ACAATTCTCG AATTTGTTGC GCCGCTCAAA GAAGTTGAGC ATGTTGAGCA TATTCAGCAA ATCGTCAGAG CCAAAGTGCT CAAACAAGGC ACAATGCCAA GCTTGCCACC ATTGAGCGAT TTTATTCCAC CAGAACAAAC TGGCCGTACA CTTTTGGGCA TTATTCCGCC AAGCCCCGAG GTGCGCGGCA ATTCGATTAT TTGGCGCAAA CATTGGCTTT TTCTCTTTGT CGAGGCAGCT AACCCAATTT TGCTGTTTCT AATTATCAAT TTGTCGTGGT CACTGTTGCT CGGCTACGAT TTTATTCGGT CTGGTGGCTC GTTAGTATTT TTGGCAATCC TCGATATCTT TTGTCTCGGC TGGCTGATTT ACGAGGTGAT TGATTGGCGC AACGATGAAT ATATTGTCAC CCCAATCAAC ATTATTGATA TTGAGCGCAA GCCCTTGGGC CGTGAAACCA AACGGGAAAC AACCTGGGAC AAAATTCAGA ATGTTTCGCT GAATCAAGAA AATTTATGGG CACGCATTTT GAAATACGGC GATGTCGAGC TATTTACCGC AGGTCAAAAC GAAAATTTCA CCTTTCGCGG GGTAGCCGCA CCCGATAGCG TGTTGGCAGT CATTTCGGAT TATCGCGACC AATTTGAGCA GCGGGCGCGT GACCGTGAAT TTGATAGCAC CTTAATGCTG TTGCAACATT ATCACCAACT ACAACGCGAT GAACTCCAAG TGCTGTTTGA TGATCATCGC AGCCATATCG AAGCCAAATT GCCGCCAACC GAGCGGTTGG AAACTGGAGT GTAA
|
Protein sequence | MPTNVPQLSF NINLQADERV KLIAYRHWLI LARNLSIWFT FWFICTVIFL WRVSVVGVDT LNTVLFGAGT IFFGAMIYSY FDWRNDALVV TNQRVISYNS RFLISVQRNE LYVREIEDVK TVTESVVSRY FDYGEIEVQT ASRLRNIAFV GIVNPTLVRD TILEFVAPLK EVEHVEHIQQ IVRAKVLKQG TMPSLPPLSD FIPPEQTGRT LLGIIPPSPE VRGNSIIWRK HWLFLFVEAA NPILLFLIIN LSWSLLLGYD FIRSGGSLVF LAILDIFCLG WLIYEVIDWR NDEYIVTPIN IIDIERKPLG RETKRETTWD KIQNVSLNQE NLWARILKYG DVELFTAGQN ENFTFRGVAA PDSVLAVISD YRDQFEQRAR DREFDSTLML LQHYHQLQRD ELQVLFDDHR SHIEAKLPPT ERLETGV
|
| |