Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3912 |
Symbol | |
ID | 5735773 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 4905573 |
End bp | 4906796 |
Gene Length | 1224 bp |
Protein Length | 407 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641281063 |
Product | hypothetical protein |
Protein accession | YP_001546674 |
Protein GI | 159900427 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGTAACG AAATCGTCCA AATCGATTAT CAGGAGGCCA ATCAGCTAGC CCAACGTTTC TCGAAGCAAC AAACCCACGT GCAGGCAATT TTGCAAAGCC TCAGGCAGAC CATTCAAGCG CTTGAGAATG GCGGCTGGAT GGGCGATGCA GCAACCGCAT GTTTCAAAGA ATTTCATGCC GAGGTAGTCC CTGCCTATAC CCGACTCAGT AACGTCCTTG GTGAAGGCCA AAGTACCCTG CAAGGTATTG CAACGCTCTT TCGTCAGGCC GAAGAAGAAG CTGCCGCACT CTTCCGTGGC GAGTTTGCCG CCGCCGCTGG GGGTGAAAGT GGTATCGCCC AAGCCGTCGG TGGGAGTGGT GGCAACATTC AAGCCGACAA TGGTGGTATG TATCAAACAG TTCAGGCATT TGCCACTAAT TTAAGTGGTA GCGGTGGTGG ACAAGGCCAA GCAAGCGAGC TAGAAACCCT CATCCGTTCG CTCAAAGATC AATGTGTAGA CCCCAATGTT ATTTTAGATG CAATTGCCAA GGCAACCCCA GCAGAACGTC AAGCGATTTT AAATGACCCT CAATTAATGG AATATATTCG CATCAGCGAA GGTACAGCTG CCGATGTCCT TACGGCGGCA TTGCTCGAAG GAGCATTATT CTGGCCAGCT GGCTCAGGCC CAGCCAATAA TGGGGCGATT CAATCTGATG TGATTAACCC TTTAACCGGA GAAGAACGAA CTAATGATTT TGCACTTTGG ATGCGTGGCG AAGCAGGGCC ACCTGATCCG CTTAACGGCA CGATGAATTG CTGGGAAGCA ACCATGTATG CCGCCTATTT AAGTGGCGAA ATCAGCGAAA GTCAATTACG CGAAATTCAT CAAAACGCGG CTGATGCAGG CGCTGATTAT TATAATGTGA TTGAAGATGC CTTTGGTGCT GATAATCGCA GCACATGGCA ATCAGGCGAT CAACCAGCCG CAGGCAGTAT TGTCTTTTTT GAAAGTGACG GTAGCCCACT TGCTCACGTT GCAATTGCTA CAGGCCGAAC AACCCCTGAT GGCAAGACTG AGATCATGAG TTTATGGGTA TTGCCTCAAG ATTCAAATGG TAATTTTGTG CGATCAATGC AACGTACCAC CATTGAAGAT TTACAGGCCT CAATGGCCGA CGCTGGCATT CCATTAGATC AAATTAGCAC TTCACCCAAT CCATGGGATC AGCCGAATGA TTAG
|
Protein sequence | MGNEIVQIDY QEANQLAQRF SKQQTHVQAI LQSLRQTIQA LENGGWMGDA ATACFKEFHA EVVPAYTRLS NVLGEGQSTL QGIATLFRQA EEEAAALFRG EFAAAAGGES GIAQAVGGSG GNIQADNGGM YQTVQAFATN LSGSGGGQGQ ASELETLIRS LKDQCVDPNV ILDAIAKATP AERQAILNDP QLMEYIRISE GTAADVLTAA LLEGALFWPA GSGPANNGAI QSDVINPLTG EERTNDFALW MRGEAGPPDP LNGTMNCWEA TMYAAYLSGE ISESQLREIH QNAADAGADY YNVIEDAFGA DNRSTWQSGD QPAAGSIVFF ESDGSPLAHV AIATGRTTPD GKTEIMSLWV LPQDSNGNFV RSMQRTTIED LQASMADAGI PLDQISTSPN PWDQPND
|
| |