Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0207 |
Symbol | |
ID | 5732102 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 241511 |
End bp | 242764 |
Gene Length | 1254 bp |
Protein Length | 417 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641277331 |
Product | hypothetical protein |
Protein accession | YP_001542987 |
Protein GI | 159896740 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.835179 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGCACA ACTTCTATGC AACCTTACCC ATTATCACCG ATTTTGTCCA AATTACCGAT GCCAATTGCT ACCATCGCGT CCCTGACGAT TGGGTTATTG TGGTGAGTGA TATTGAGCAA TCGACCAAGG CGATTGGCGA GGGGCGCTAC AAAGATGTGA ATTTTATTGG CGCTAGCACG ATTGTAGCCT TACTCAATTT GCAGCCCAAT CTCGATATCC CGTTTGTGTT CGGCGGCGAT GGCGCAACCG TGTTATTGCC ACCATGGTTA GTCGAGCAAG CCAAACCTGC CTTGCAAGCA GTCCAACATT TGAGCGAATC GATCTACAAT TTGCATTTGC GGGTGGGTAT TATGCCAGTC AGTGAAGTTT ATGCCCATCG CTATCAGCTG GAAATTGCCA AATTCGCCGC CTCGGACAAT TATGCCCAAG CGATGATCAA TGGTGATGGT TTGACCTTCG TCGAACAAAC GATCAAAGAT CCCGTGGCCG GTGCGAAATA TTTGCTAGCC GCCCAGTCGA GCGATCAACC AGGCTTGCTC GATGGCCTCG AATGTCGCTG GCAAGAAATT CCCAGCCGTT ACGGCGAAAC GGTTTCGCTC TTAGTTCGGG CCGAGGCCAA CACCACTACT CAACGTAATG CAATCAATCG CCAAGTTATT GAGCAGATTG AGGCTATCTA CGGCGCTGAC GATTCGCATC ACCCCGTCGA TGTACAACAA CTAAGCCTAA CCTTACGGAT TCAAGATCTG TGGGGTGAAG CCCGCTTGCG TGGTGGTACC AGCAAACTGC AACAATTGCG CTATCTCAAT AATATTTGGT GGCTGAATGT GCTGGGCAAA TTGCTGTTGG CAACTGGAGC TAAAACCGAA TTAACCGATT GGGCCGAATA CCCACAGATT TTGCAAGCCA GCACCGATTA TCGCAAATAC GATGCAATGC TGCGCATGGT GATTGCCGGA ACTCCTGAGC AACGCCAGCA GCTTGAACAA TTTCTGAATG CTGAACGGGC GGCTGGGCGG CTCAACTATG GCCTGCATGT TTCCGATAGT GCCTTGATGA CCTGTATCGT GTTTGAACGG ATGGGCCGCC AAGTGCATTT TATCGATGGC AACAACGGCG GCTATGCCAA AGCTGCTGAT CAACTCAAAC AGCAAAGCCA TTATCTCGAA CCACCTGTTA CAACCAAACC TGTGTCACAG CCAAAAACAG GACTTTTGGG GGATACTTCA TCACCATCGT GGGGGACATG CTGA
|
Protein sequence | MAHNFYATLP IITDFVQITD ANCYHRVPDD WVIVVSDIEQ STKAIGEGRY KDVNFIGAST IVALLNLQPN LDIPFVFGGD GATVLLPPWL VEQAKPALQA VQHLSESIYN LHLRVGIMPV SEVYAHRYQL EIAKFAASDN YAQAMINGDG LTFVEQTIKD PVAGAKYLLA AQSSDQPGLL DGLECRWQEI PSRYGETVSL LVRAEANTTT QRNAINRQVI EQIEAIYGAD DSHHPVDVQQ LSLTLRIQDL WGEARLRGGT SKLQQLRYLN NIWWLNVLGK LLLATGAKTE LTDWAEYPQI LQASTDYRKY DAMLRMVIAG TPEQRQQLEQ FLNAERAAGR LNYGLHVSDS ALMTCIVFER MGRQVHFIDG NNGGYAKAAD QLKQQSHYLE PPVTTKPVSQ PKTGLLGDTS SPSWGTC
|
| |