Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1818 |
Symbol | |
ID | 5733676 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 2115342 |
End bp | 2116775 |
Gene Length | 1434 bp |
Protein Length | 477 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641278961 |
Product | coagulation factor 5/8 type domain-containing protein |
Protein accession | YP_001544589 |
Protein GI | 159898342 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG4305] Endoglucanase C-terminal domain/subunit and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00000535261 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTTGGC GTAAAGGCAC TTTTTTCAGC TTTGGTTTAA TCTTGAGTTT ATTGTTAACC TCAGCGCTAT TTGTTGCCCA AGGCAGCGCC CAACGTATGC AAACGATTGC TGATCCATGG GCGTTGCGTA GCGGGCGGGC AACGTTTTAT GATCCAACGG TAGGTATGGG TAATTGTAGT TTGCCAGTGC CCAGCGATAT GCTTTTGGCA GCCATGAATA CGACCGATTA TGGCTTGGCC GATTATTGCG GAGCCTATGT GACGGTCAAT GGGCCACGGG GCAGCGTTAC GGTAAAAATT ATTGATCGTT GCCCTGGCTG TGTGGTGGGC GGGATCGATC TTAGCCCGCA GGCTTTTGAG CGAATTGCAG CGCTTGAGGC GGGGAATGTA CCAATTACTT GGCAATTGAT CAGCGCACCA AATGTCAGTG GCAACGTTAT CTACAACTAT AAAGAAGGCA GTAGCCAATG GTGGGCTGGC GTACAAGTCC GCAATCATCG TAATGCAATT GCCAAGTTTG AATATCGCAA TCCTCAGGGT ATTTTCCAGA ATGTTGATCG CGTCCAGTAT AATTATTTTC TGGTGCCAGG TGGAATGGGC ACTGGCCCCT TTACCTTTCG CCTGACTGAT GTTTATGGCA ATGTCTTTAC CGATAATACT ATTCCATTGC GCTCAGAGGG TGATGTCGCC GGAAATCAAC AGTTTCCTTT TGTGCCAACG CCTGGCAGTA CGCCGACCGC CAGCCCAACC CGCACCGCCA GCCCAACCGT GGTTAGCCCA ACTACTCGGC CTGACCAAAC CATTATGAAT GCTTCATCGA CTGAAGCTGG CGGTAGCACG CACTATGCGC TTGATGGCAA CCTCAACACC CGATGGAGTA GTGGCTTACC ACAGGCAGCT GGTCAATGGA TTTATATCGG TTTGCCGCGG GTTACGCCTA TCTCAGGCAT TAAACTTGAT GCAGGCAGTT CAGGTGGCGA TTATCCAGCA GGGTTTATTG TTCAAACTCG CGACGACACC AGCGATTGGG CAACTGTGGC GACAGGAAGC GGCTCAAGCC AAATCACAAC CATCAACTTT GCTGCTCGGA ATGCTCGCTA TGTTCGCATT GAGCTAACTG CTCGTTCGGC TAATTGGTGG TCGATTCATG AATTGACGGT GATATTCGCT GCTCAACCTG CGACGCTTAC CCCAACGATT TTGCCGCCGA CAAACACAAT TGTGCCTGCC ACCGTCACCC CAACCCTTGT GCCAACCATG GCTCCACCGC CATTGACTGC CACCCCAACC CTGAGTGCTT GGCAAGCCTA TACTAATTAT AGCGTTGGCA GCCTTGTACA GCACAATGGT ATTAACTATC GTTGTATTCA AGCCCATACT TCGTTACCAG GTTGGGAGCC ACAGATCGTT CCAGCACTTT GGCAACCACT TTAA
|
Protein sequence | MAWRKGTFFS FGLILSLLLT SALFVAQGSA QRMQTIADPW ALRSGRATFY DPTVGMGNCS LPVPSDMLLA AMNTTDYGLA DYCGAYVTVN GPRGSVTVKI IDRCPGCVVG GIDLSPQAFE RIAALEAGNV PITWQLISAP NVSGNVIYNY KEGSSQWWAG VQVRNHRNAI AKFEYRNPQG IFQNVDRVQY NYFLVPGGMG TGPFTFRLTD VYGNVFTDNT IPLRSEGDVA GNQQFPFVPT PGSTPTASPT RTASPTVVSP TTRPDQTIMN ASSTEAGGST HYALDGNLNT RWSSGLPQAA GQWIYIGLPR VTPISGIKLD AGSSGGDYPA GFIVQTRDDT SDWATVATGS GSSQITTINF AARNARYVRI ELTARSANWW SIHELTVIFA AQPATLTPTI LPPTNTIVPA TVTPTLVPTM APPPLTATPT LSAWQAYTNY SVGSLVQHNG INYRCIQAHT SLPGWEPQIV PALWQPL
|
| |