Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3541 |
Symbol | |
ID | 5735400 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 4456122 |
End bp | 4457477 |
Gene Length | 1356 bp |
Protein Length | 451 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641280688 |
Product | hypothetical protein |
Protein accession | YP_001546305 |
Protein GI | 159900058 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.000377942 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGTTTTAC GTCGGATTTT AGGCATCGTC ACTGCGGCTA TCGCTGTCGT GAGTTTAGCT GTTCAACCTC GTTCAACTGC GGCTGCAGAA CCAATCAAAC TTGATACCCA ATGCTTTGAT GTACCAGGGA TTATCAACTG TTTAGATGAT AAATTTCTGA GCTATTGGCG CAGCAATGGT GGTTTGCCAG TGTTTGGCTA CCCAATTACT GCGGCTGCCA ATGAAGTTAA CCCCGATACC CAGCAAAGCT ACTTGACACA GTGGCTCGAA CGTAATCGGT TTGAATTACA CCCCGAAAAT GCAGGTACGC CTTACGAGGT GTTGTTGGGC TTGTTGGGCA AAGAACGCTT GACCCAACTT GGCCGTGAAA TTGAGCCTCG CGAAGCTGGC CCAGTCGATG GCTGCTTATG GTTCGAACAA ACTGGCCACA ATGTCTGTGA TCAAGCAGGT AGTTTGGGTT TCAAGAGCTA TTGGCAATCG CATGGCTTGA AAATTGATGG CCTAGACAAT TATGCTCGTT CATTGCAATT GTTTGGTTTG CCCTTGACCA GCGCTAAGAG TGAAACCAAT GCCAACGGCG ATACAGTTGT CACCCAATGG TTTGAACGTG CTCGCCTCGA ATGGCACCCA AGCAATCCCG ATGAATTCAA GGTGCTCTTG GGCTTGCTCG GTAAAGAAAT TATCGATGGC CGTAGCCAAC CAACTCCACC AACGCCAATT GATCCTTGTG CTTCAACCCC TGATCCAGTG TCAGCTCGCG TGCGCCCAGC CAAATGTGGT GAGCAAGGCA CCGAGTTTTC GTTTGATTTC TATGGTTTCA AGGCCAGCGA AGAGGTTGGC TTCTGGATTA CCAATCCCGA TGGGATCAAT GTTGGGACAC GGCAAACAGC GAAGGTTGGC CCAAACGGGA GCATTAGTGG CCTCCCATTC GATAGCCGCG ATGCCACACC TGGCACTTGG CAATTTACCA TGCAATCAGC CTATCAAAGC CATCAGGCTA TTGTCTATAT TACAGTAATT GCCAAGGCTC CCCAGCCAAC CCCAAACCCA AGCAACTGTA CTAGCACGCC TGAACCAGTT TCAGCGCGAA TTAGCCCAGC AAAATGTGGT CCAGCAGGCA TGGTCTTTAT CTTCGATGTA TTTGGGTTCC AACCCAACGA ACAAGTTGGC TTCTGGATCA CTAATCCCGA CGGAATTAAT GTTGGGATTG CCAATACTAT GAATATTGGC CCCGAAGGTG CAATCTCAGG GATCGAGTTC CCAACTGATG GTTTTACTCC TGGCACATGG CAATTTACCA TGCAAGGGAC GACCAGCAAT CACGCTTCAA TCATCTACTT TACGATTACC GAATAA
|
Protein sequence | MVLRRILGIV TAAIAVVSLA VQPRSTAAAE PIKLDTQCFD VPGIINCLDD KFLSYWRSNG GLPVFGYPIT AAANEVNPDT QQSYLTQWLE RNRFELHPEN AGTPYEVLLG LLGKERLTQL GREIEPREAG PVDGCLWFEQ TGHNVCDQAG SLGFKSYWQS HGLKIDGLDN YARSLQLFGL PLTSAKSETN ANGDTVVTQW FERARLEWHP SNPDEFKVLL GLLGKEIIDG RSQPTPPTPI DPCASTPDPV SARVRPAKCG EQGTEFSFDF YGFKASEEVG FWITNPDGIN VGTRQTAKVG PNGSISGLPF DSRDATPGTW QFTMQSAYQS HQAIVYITVI AKAPQPTPNP SNCTSTPEPV SARISPAKCG PAGMVFIFDV FGFQPNEQVG FWITNPDGIN VGIANTMNIG PEGAISGIEF PTDGFTPGTW QFTMQGTTSN HASIIYFTIT E
|
| |