Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1454 |
Symbol | |
ID | 5736865 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 1692575 |
End bp | 1693954 |
Gene Length | 1380 bp |
Protein Length | 459 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641278592 |
Product | hypothetical protein |
Protein accession | YP_001544226 |
Protein GI | 159897979 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00000963301 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTCCCAC GTTCGACCGT GCGGCGCAGC ATGGCATTAA TCATTCTAGT AATGCTTTTA ACGCTGGTTG TCGCCCGACC CCAAGCTTCT AATGCTGCCG ACCCACCAAC TTATTATGCT GAAACCGGTC ACTACCTTGG TGGTGGTTTC CGCGATTACT GGAATGCCAA CGGCGGCTTA CAAATTTTTG GCTACCCCAT CACTGAAGAA TATCGCAACG CACAGGGCAA AACCATTCAA TGGTTTGAAC GTGCTCGCTT TGAGTTAGCC AGCAATGGCT CAGTCGAGTT AGGGCTTTTG GGGCGTGAAG CAACCGTTAA TCGGGTGTTT CCGCAAATCC CGCCCCGCGA AAACGATGCC AACCACCGCT ACTTCCCCGA AACCAGCCAT ATGATTATGT GGGGGTTCAA AACCATTTGG GAAACCAAAG GTGGTTTAGG CGTATTTGGC TATCCAATTA GCGAAGAAAT GGATGAAATT CTCGCTTCGG ATAACAAATG GCATATCGTT CAATATTTCG AGCGTGCTCG CTTTGAATTT TGGCCCGATT ACCCTGCGGG GCAACGGGTT GTCGTCAGCG ATCTAGGTCG GCGGTTGGCT CCCCGCGAGT TAACCACGCC ATTGCCACCA GGCAGCCCTC CAGGCAGTAC CCCTCCTGGC GCACCTGGGT TGCCACCAAG CAAAGATGCG ATCGTTACGC CAAGCTCTGG GCCTGAAGGG ACAACCTTTG GCTTCAATGG TTTTGGCTTT GTCGCCGGTG AAGAAGTGGT TTTGTGGTTG ACCTCACCTG ATGGTACAGT CTACCCGGCC AATTCAACAA CCTATGCTGA TATCGATGGG TCGTTAACCT CATCAGGTAT TTATGTCACA ATCAGCCAAG GAGTTGGGGT TTGGGCGATT ACCGCCCAAG GTCGGCTCAG CGGCCATGCC AGCATTGGCT ATTTTGAAGT TACCCGAGCA CCTGAGCAGC CACTGCCCGC CGACTATAAT GCACGGGTTG ACCCTCGTGA AGGTCGGCAG GATACGATTT ACAACTTCTA TGCAGGCGGG TTTGTTCCAG GCGAAGTAGT TGCGGTTGGT GTACTCAACG AATATGACGA ACTCGTAACC GAAGTTATTG GCGTTTATGC TGATGGTAAT GGCTCAATCG ATTATGCCAA TATTCGCTTT GTGCCAAACA ATTCCTTCGA TCCAGGTATT TATGAAATCT ATTCCACCAG TGAAAGTGGC CGTGAAGCCT ATGCCTTCTT GCGTATGCGC AGCAATAGTG TGACCAGTGT CTCGACCTTA AGCATGCGTC AAGCGCGAAC CACCAGCGGT TCATTAGGCC GTGGCGATGG GCTAGCCAGC GAAGGCAATA TCGATTTCTT CCAGAAATAG
|
Protein sequence | MLPRSTVRRS MALIILVMLL TLVVARPQAS NAADPPTYYA ETGHYLGGGF RDYWNANGGL QIFGYPITEE YRNAQGKTIQ WFERARFELA SNGSVELGLL GREATVNRVF PQIPPRENDA NHRYFPETSH MIMWGFKTIW ETKGGLGVFG YPISEEMDEI LASDNKWHIV QYFERARFEF WPDYPAGQRV VVSDLGRRLA PRELTTPLPP GSPPGSTPPG APGLPPSKDA IVTPSSGPEG TTFGFNGFGF VAGEEVVLWL TSPDGTVYPA NSTTYADIDG SLTSSGIYVT ISQGVGVWAI TAQGRLSGHA SIGYFEVTRA PEQPLPADYN ARVDPREGRQ DTIYNFYAGG FVPGEVVAVG VLNEYDELVT EVIGVYADGN GSIDYANIRF VPNNSFDPGI YEIYSTSESG REAYAFLRMR SNSVTSVSTL SMRQARTTSG SLGRGDGLAS EGNIDFFQK
|
| |