Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3409 |
Symbol | |
ID | 5735270 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 4295411 |
End bp | 4296748 |
Gene Length | 1338 bp |
Protein Length | 445 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641280556 |
Product | hypothetical protein |
Protein accession | YP_001546173 |
Protein GI | 159899926 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTATCAG CGGAATTTAG CGCGAGTAAC GACCTCATCC AAGCGATCGA TCAACTGTTG CTGGCAGGCT TTACCAATCT CAATCATAGC CATCAAATGA GTTTGCAGCG GGTGGCTCAA GTTTATCGCG ATACGCCGTT TGAGCAGGCG ATGCTCAGCG CTGTGACTGA TTTGAGCAAT GGTGTGTTTC AACCAGCCGC GTTTATGTTG CTGGCGGTTG CCCGCGCATC GCTGCAAGCT GCCCAGCATG ATCAACTTTT GCAGCAAATT CGGGTGCAAC TTGGTCGCCC AATCAATGAT CAATCAACAT CAAAAGCCGT GGCGCTTGCC GCAACTCCGC CCTTGTTGGG TAGCGTGCAG CATTGGCTGA CCGATTTAGC AGTAATGGGC TTTTCGCGAC TGGAGCCAGC GATGATCAAC GCCTTTACGC CAACCCTCGC CCAACTGCAA ACTAACCCCG ATTATCTGCG CACTTCGGCG ATTCTCTCTG GTTGGCTGCA TGAATTGCAA CTTCAGCCTG AACAATTGCC TTTGTTTCGT TGGGGCGATT TGTGGACGCG GGCCATGCTT TCGACCCTCA GCCTGAATTC AACCCCACCA ACCCAACCTG TCAGTGGCAC GCTGTACCCC TTGGGCATCG AATGGCGACA ACATTCCACG CTAGTTAGTT TGGTGGTCTA TGCTGTGCTT GAGGCCGATT CCAAGGCTAG CTTGGCTACA ATCAGCCAAT CGGCCTATAA AGTTGCGGCA ATTCAGCAGG ATCAATTGTG GTTGCTGTTT CCTGAGTTGA GCCTGTTGTT CGATAGCTTG AGCACAGCCA AAGCCTTAGT GTTGCGCGAT GCAGCAAGCT TGCCAACTGG CCAATTGCTG TGGGATGCGC AGACCGCCAG TTTGGGCGCT AAATATGATT TGCTTGATGT GGCCGAACGC TATTTTGGCC TGAATCCCAA ACAATCAATT GCCCAAGCTC AACTTGCCCC CCAACAACGC CACCCAGTCC ATGTGGCCGA GCCAGTTGTT TTGAGCAACT ATCAACTCAA TCAAACTACC GAAGCTTGCA CAATAACGAC TGGTGAACAC AGTTTCATGC TTGAGCTTGG GTTGATCAAC GGCACTGAAA TCGATTTGGC GGTGCTTGAA TCGGCTCAAC GCTTGTTTGG CTTGGTGCGC TACGATGCTG GCGAGTGGCT GCTGCGACCT TTGGCAACCA GCCTGAAAAA AGGCAAGCCG CTATTTATTG GGCTAGAAAA CGGCAAAGTT TTTAAGAAAG CGCCTAAAAA TAATGCTGTT GGCATTCTCA AAGAGCGTGC TAGCCGCTTG CTCAGGGAGA AATCATGA
|
Protein sequence | MLSAEFSASN DLIQAIDQLL LAGFTNLNHS HQMSLQRVAQ VYRDTPFEQA MLSAVTDLSN GVFQPAAFML LAVARASLQA AQHDQLLQQI RVQLGRPIND QSTSKAVALA ATPPLLGSVQ HWLTDLAVMG FSRLEPAMIN AFTPTLAQLQ TNPDYLRTSA ILSGWLHELQ LQPEQLPLFR WGDLWTRAML STLSLNSTPP TQPVSGTLYP LGIEWRQHST LVSLVVYAVL EADSKASLAT ISQSAYKVAA IQQDQLWLLF PELSLLFDSL STAKALVLRD AASLPTGQLL WDAQTASLGA KYDLLDVAER YFGLNPKQSI AQAQLAPQQR HPVHVAEPVV LSNYQLNQTT EACTITTGEH SFMLELGLIN GTEIDLAVLE SAQRLFGLVR YDAGEWLLRP LATSLKKGKP LFIGLENGKV FKKAPKNNAV GILKERASRL LREKS
|
| |