Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4519 |
Symbol | |
ID | 5736370 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 5786045 |
End bp | 5787319 |
Gene Length | 1275 bp |
Protein Length | 424 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641281682 |
Product | hypothetical protein |
Protein accession | YP_001547279 |
Protein GI | 159901032 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00000498589 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGAGCTG AAATCGTTCA AGCTCAGTAT GACCAACTCA CTCAAGTTGC AAGTCGGTTT GGCAAGCAAT CGGAGGTAGT TGATCAAATC AACAGCCAAG TTCGCCAGAG CTATGAAACA TTAGCCAATG GCGGATGGAT GGGTGATGCG GCAAAGGCAT TTTTCAATGA AATGCAGACG GAAATTTTCC CCACAATGCA ACGTCTAACA GGTGTATTGC GGGAAGCTCA AACGGTAACC CAACAAATTA GTACGATCTT CCAGCAAGCT GAACAAGAGG CAGCTAAGGG GATCACATTT GCTGATGGTG GAGCTAGCTC AAGTGCTGGC AGTGGTGTTT CGTTCAGCGC AGCAGCAGGT AATGCGGCAG GCTCGGCGGC AGCGAACCAG TTGCCACCAC CACGCATGTA CATTGTCAAC GGGATTAACG CCAGCGAGCC AGATGGTACT CCAGGTGAAG GGCCACAGCA ACTGGCCGGC TTGTTGGCTG CCCACGGCTA TGATCCTAGC CAAATCAAGG CGATGCCGGC AATTTACAAC ACCAACTACA CCACCAACTT GCAAGGCACC GATTTGCAAG GTACCAATCA TGGTGGTTGG TTATCGCCCG TCGATTGGCT GACCGGAGCC GGAGCCTCGA TCGTGAATGG AGTTACGGGT GCCGGTGCCA GTGTTGTCAA TGGGGCTTCG GCGTTGTTTA ATACTGGAGT CGGGGTCAGC GAAGTTGTGC AAGAGTATAC AATGCAAGAT CAAGGCAAGT ATACCCAGGA AAGCTACAAC TTTATTCAAC AAGACCTTGC CCGTAACCCA TTATTGCCCG GCCAAACCGT CATGCTAATT GGGCATAGCG GCGGTGGGGC AGTTGTCAGT AACCTCGCGC CAATGCTTGA AAATAACATG GGCGTTGATG TTTCTGGGGT GGTTACGCTC GGGTCGCCGG TAGCCAATGC TGATCGGGCG ATGCAATATG CCAAATTCCT CAGCGTTAGC GACAAAGGCG ATTATATTGG CCAACCATGG ATTCGCTCCG ATGAAGGGCG TAATTTCCTA ACTCCAGGCT TGATGACTGG CATTTTAGCG CCGAAATCCT TGCCATTGGT TGTGCCAGGG GTGCTTGGAG CCGATAACGC CGCCCGCGAT GCTGGGATCA ATTACTTTAC GACCAATGCC AATGCGGGCA ACCCAATTAG CAATCACAAC TCCTATTGGA CGAGCAACGA TGTGGTTAGC ATCATTAAAA ACAGCTATCC CCAAGTTGCT CCATACCTGA AGTAA
|
Protein sequence | MGAEIVQAQY DQLTQVASRF GKQSEVVDQI NSQVRQSYET LANGGWMGDA AKAFFNEMQT EIFPTMQRLT GVLREAQTVT QQISTIFQQA EQEAAKGITF ADGGASSSAG SGVSFSAAAG NAAGSAAANQ LPPPRMYIVN GINASEPDGT PGEGPQQLAG LLAAHGYDPS QIKAMPAIYN TNYTTNLQGT DLQGTNHGGW LSPVDWLTGA GASIVNGVTG AGASVVNGAS ALFNTGVGVS EVVQEYTMQD QGKYTQESYN FIQQDLARNP LLPGQTVMLI GHSGGGAVVS NLAPMLENNM GVDVSGVVTL GSPVANADRA MQYAKFLSVS DKGDYIGQPW IRSDEGRNFL TPGLMTGILA PKSLPLVVPG VLGADNAARD AGINYFTTNA NAGNPISNHN SYWTSNDVVS IIKNSYPQVA PYLK
|
| |