Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0007 |
Symbol | |
ID | 5736841 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 8321 |
End bp | 9760 |
Gene Length | 1440 bp |
Protein Length | 479 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641277128 |
Product | hypothetical protein |
Protein accession | YP_001542787 |
Protein GI | 159896540 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00000446047 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGCGAA TCCGCGCTTC ATTCCGTCTC GCCATTGTTC TATTGGCAAT GTTGTTCCTC GTTGGCCCCG TTTTTGTAGC TCGGGCACAC GTTAGCCCAG CCCAAACCCA AGCTCCTGCC ACTCCCAACG ATACTCAGTA TCTATGGATT GCCGGCAGTT CGTTTCAAAC CCGCGACTCA ACCACGGCGT TTGAAACCAC TCGTAACGTT AATAATCAAC CAACTGGTTG TATTTATGCC ACCAGCAGTG GCGAGTTTAC CGCCCCAGTT GCCGTGCCCA ATGGCGCAAC GATCTTGGGC TTGGATTACT ATCTCTATGA TACCAGCACG ACGCAAACCA AAGCTGAGTT GACCCTCAAC GACAGCGATA TGGTTACTCC CCGCGAATTG ATCACGGTCG AAATTTCAAG CTCGGTAGGC CTGACCAACA CCTACGCCCC AATGGGTGCA CTCTATGCCC CGTATGTGGT CAATATGCAA ACCCGTGGCT TGTTCTTAGA GTGGTTTCCC CGCGTAACCA ATGCGGCCAT GCAATTATGT GGTGTGCGGA TCGCCTACAC CGCGCCAGCC GAACCACGTC CATCGACTGA TTATCTGTTT ATCGTTGGCA GCACCTTGGT CAACACCAAC TCTAGCACCG AACACGGCTA TGCTGGGGCT GGCTGTACCT TTGTCAAAAT CAACGGGCGC ACGCTCAATG CCGATGTGGA TTTGCCTCAA GGCAGCCAAC TAACGGCGGT ACGCAGCTAT TTCCGCGATG TCAATAACGC TGATCTCACG GTCAAATTAA TTGCCTCAAA TGGCCAAGGC GTGACCAATA CTCTGGCCAC CCTCACCAGC CCGGTGTCGA ATACAGCGGT GGTGAATGCT GATCAATCGT TGAATTACAC GGTTAACGAA AGCAGCGAAT CGTTGAGCGT TTTGGCCGAT TTTGGTGGGG TGTTGAGCAA CCAAATTCGC TTGTGTGGGG TGCGTTTCCA ATATACCAAC CCCAGCGCTA AGCCAACCCA AGATAGCCGC TTCATCACTG GGAGCACCTT CGTGCCCCGC CGCTCGAATG TCAGCTACAC GAGCGATGCC AATGGTTGTG TCAATGTAAG CAACGAAGTT GAAGATTTGA CCACCAATGT AACTGCGCCC GAAGGAGCCA AGGCTGCCCG CGTCACCTTC TACTACAAGA ATGCTGCGGC AGGCCCAACC CTCAACCTCT ACAGCTTTGT TGGCAGCGGC GATTTTACGT CAATCACCAG TGTACCAGTG ATGGGAACGG GTACTCAGAA CGCCGTCAAC ATCAATTATC CAATTGAAAA CGCCGAAAAA GGCTTTGCCT TGGTGTGGGA TGCGTCGTCG GCAAGCAGTG AATATGCCTT GTGTGGTGCG AAGATCGACT TCCTCTACAC CCAACAGGTC TTCTTGCCAG CCGCTATGAA CAACTACTAA
|
Protein sequence | MSRIRASFRL AIVLLAMLFL VGPVFVARAH VSPAQTQAPA TPNDTQYLWI AGSSFQTRDS TTAFETTRNV NNQPTGCIYA TSSGEFTAPV AVPNGATILG LDYYLYDTST TQTKAELTLN DSDMVTPREL ITVEISSSVG LTNTYAPMGA LYAPYVVNMQ TRGLFLEWFP RVTNAAMQLC GVRIAYTAPA EPRPSTDYLF IVGSTLVNTN SSTEHGYAGA GCTFVKINGR TLNADVDLPQ GSQLTAVRSY FRDVNNADLT VKLIASNGQG VTNTLATLTS PVSNTAVVNA DQSLNYTVNE SSESLSVLAD FGGVLSNQIR LCGVRFQYTN PSAKPTQDSR FITGSTFVPR RSNVSYTSDA NGCVNVSNEV EDLTTNVTAP EGAKAARVTF YYKNAAAGPT LNLYSFVGSG DFTSITSVPV MGTGTQNAVN INYPIENAEK GFALVWDASS ASSEYALCGA KIDFLYTQQV FLPAAMNNY
|
| |