Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3361 |
Symbol | |
ID | 5736903 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 4238588 |
End bp | 4240006 |
Gene Length | 1419 bp |
Protein Length | 472 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641280508 |
Product | hypothetical protein |
Protein accession | YP_001546125 |
Protein GI | 159899878 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00000455234 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGTTTG CGAAGCTGCG TGTACGGGCA ATGATGGTTG CCTTGTTGCT TTCAATCTTT GCAGTAGGCG GTCGGGGCGT ATCAGCTCAA ACCAATGCCT ATGCAACCGC ATTCACCACC TCGATCACCT ACCAAAACAT TGGTACTGGT ACTGCTAACA TCAACTTGAC CGTGTACAGC TCAAGTGGAA CTCCATCAGC CATCCCAGCC TCAACCTTGG CTGCTAATGG CGCTGGTGCC TACTTTGTTG GCTCAGTCAG TGGCTTGGGC ACCACCTTCA ATGGTTCAGC AGTGATCTCA GCAGATCAAC CAATTGCTGC AACCTTGGTG CAAATCCCAG CCGCAGCTTC ACAAGTCAAG AACCGCCCAT TGTCAAATGG TTTCTCAAGC GGCTCAGACA CCGTGTTGAT TCCAACGGTT TTGAAGGCAT CATCAAACTA CACCACCAAG TTTGTGATTC AAAACACCGA CTCAGTGGCA AATGACTTCA CCGTTCAATT CATCAATCCA GCAACTGGGG CAGTTGTTCA CACTGCTAAC CCAACTAACG TCTTGCCAAA CACCTCAGTC TACTACGATG CTGGCACGAT TTCAGCCTTG GGTGCAAGCT TCAGCGGCTC AGTCAAAGTA ACGGCTGTCA AGAATGGCAC CAGCAACCCT GGTAGCGCCG TTGGTACCGC CCTTGAATTG CAAACCAATG GTGTTGGTGC TTATGCTTCA CAAGCATTCC CATCAACTGC TGCTGCAACC AAAGTTTCGA TGGCAACTGC CCTCTGTAGC TATGTGATTC CAAGTGGTCA AACCACCTCG TTCTATGCAG TCCAAAACGC TGGTACTTCA TCAGCAAGCG TGACTGTAAC CTACGTTGGT ACCGCTGCTG GTTCACCAGT CAACGTTACA AGCACCGCAG TCAACATTGC TGCTGGCGCT AAAGCTAGCT TCAATCCTTG TGGTACCACT CCAACCAACT TCACTGGCTC AGCAACCATC AACTCAACCC AACCAATCTT GGCTGTTGGT AAAGTTAATG GTGGTGGCTT GTACACCGCA TTCGAAGGTG CAACCGCTGG TAGCGCCAAG ACTGCATTGC CATACGTTCG CTGGTTGACC CCAGCTCAAG GTGGCCAACA AACCTACATC GCTATCCAAA ACGTTGGCAC GAGCGCAGCA AGCAGCGTAA CCGTCAAGTA CTATAGCGGT GCTGGTGCAT TGCTCGGTAC TCACACCATC CCAAGCATCG CTGCTGGCGC TAAAGCTAGC TCAAACCCAA CCAACGCTGG CGTAACCAAT ATGGGTGTTG GTGGTGGTTC AGCCGTGGTT GAAGGCGCTG GCGCTCAATT GATTGTGGTT GCCCGCGTAA CTTCACCTGT TGGTACTGGT ACCACCGGCG AAGACTACAA CGGTATTCCT TTCAACTAG
|
Protein sequence | MTFAKLRVRA MMVALLLSIF AVGGRGVSAQ TNAYATAFTT SITYQNIGTG TANINLTVYS SSGTPSAIPA STLAANGAGA YFVGSVSGLG TTFNGSAVIS ADQPIAATLV QIPAAASQVK NRPLSNGFSS GSDTVLIPTV LKASSNYTTK FVIQNTDSVA NDFTVQFINP ATGAVVHTAN PTNVLPNTSV YYDAGTISAL GASFSGSVKV TAVKNGTSNP GSAVGTALEL QTNGVGAYAS QAFPSTAAAT KVSMATALCS YVIPSGQTTS FYAVQNAGTS SASVTVTYVG TAAGSPVNVT STAVNIAAGA KASFNPCGTT PTNFTGSATI NSTQPILAVG KVNGGGLYTA FEGATAGSAK TALPYVRWLT PAQGGQQTYI AIQNVGTSAA SSVTVKYYSG AGALLGTHTI PSIAAGAKAS SNPTNAGVTN MGVGGGSAVV EGAGAQLIVV ARVTSPVGTG TTGEDYNGIP FN
|
| |