Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0031 |
Symbol | |
ID | 5731903 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 38809 |
End bp | 39813 |
Gene Length | 1005 bp |
Protein Length | 334 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641277152 |
Product | hypothetical protein |
Protein accession | YP_001542811 |
Protein GI | 159896564 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00551613 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAACGA CTGCTCAATT CCCAGAGGCT CGGCGTACTA CGTCGGGAGC CGGAGCTATT TTGCGCCAAT TTTGGGCCAC CAGCAAAGGC ATGACGATCT TTTTCCTGCT TAGTTGCTTT TTCTTGGTGC TCGCAATCGC CGGGGTGACT GTTGACCCGC GTGAGGTGCT TGGCCAACCA GTTTGGATGA AAAGCCTAAA ATTTGCGGTT TCGTTTGTGG TCTATGCGCC AACCGTTTTG TGGATGTTTA GCTATGTTAA GTTGCGGCCT CGTTTGATGC GTTTTGTGAT GGATGCCTGT GCTGCTGCGC TTTCAATCGA AATTGTGTTA TTTATTACCC AAGCTGTGCG CGGCCAACCA ATGCACTTCA ACGTCGCAAC TCCCATTGAT GAAACCTTAT GGAGCATCAT GGGCACGACG ATCACAGTCT TCTATCTGAT TAATATTGTT GGATTTGTGG TTTTCCTGCG CCAAAAGCCA ATTGCTGATC GCGTTTTTAT GCTCAGTTTG AAGTTGGGAA TGGGCTTGAT GCTGCTTGGC TTTGGGCTTG GCTTTTTGAT GACCAACCCC AGCCCTGCCC AAATGGAAGT GTTGCAAGCT GGCGGTTCGG TTCCCGCAAT TGGTGCGCAT ACCGTCGGCG CTGCTGATGG CGGTCGTGGC ATCGCGATTT TGGGCTGGAG CAGCGAGCAT GGCGATTTAC GGATCGCCCA CTTTGTTGGC ATTCACGGGG CACAAGTGCT GGCTTTGATT GGCTGGTTGC TCTATAACGC CAAACAACGT TTCAACGATA AGCAACGTTT GGCCTTGACC TGGGGCGCAG CGGTCGCCTA TCTCGGCCTC GTCGCCAGCG TGACGGTGCA AGCATTGCGT GGTCAAGCCC TGTTGCAACC TGATGCAACC ACTTGGATCT CGTGGATTGG TTTGGTAGTT GGCAGTGTGT TGTTTACGAG TGTCGTTGTA AATCAGGGGT CAGTGGCGAG GGTTCAGGGT TCAGGTAGTT TTTAA
|
Protein sequence | MTTTAQFPEA RRTTSGAGAI LRQFWATSKG MTIFFLLSCF FLVLAIAGVT VDPREVLGQP VWMKSLKFAV SFVVYAPTVL WMFSYVKLRP RLMRFVMDAC AAALSIEIVL FITQAVRGQP MHFNVATPID ETLWSIMGTT ITVFYLINIV GFVVFLRQKP IADRVFMLSL KLGMGLMLLG FGLGFLMTNP SPAQMEVLQA GGSVPAIGAH TVGAADGGRG IAILGWSSEH GDLRIAHFVG IHGAQVLALI GWLLYNAKQR FNDKQRLALT WGAAVAYLGL VASVTVQALR GQALLQPDAT TWISWIGLVV GSVLFTSVVV NQGSVARVQG SGSF
|
| |