Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_5059 |
Symbol | |
ID | 5737017 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009973 |
Strand | + |
Start bp | 73378 |
End bp | 74598 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641282224 |
Product | alpha beta-propellor repeat-containing integrin |
Protein accession | YP_001547815 |
Protein GI | 159901569 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.241627 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTGTTACC AACCTATTGG CTTAGTGTGC ACCGGGGATA CACTGGTCGT GGGGGCACAG GCTGAAGATA GCGCGGCCAC CGGAATCAAC GGCAATCAGG CTGATAATAC TGCGCCCGAT ACCGGCGCTG CCTATGTCTT TGTCCGATCT GGCACAACCT GGACCCAACA AGCCTATCTG AAAGCATCCA ACGCCGAGGC TGGAGATGGC TTTGGCTTCA GTCTTGCGAG TGATACCAAT CGAATTCTGG TTGGAGCACC CTTTGAGGAT AGCAATGCCT CGGGCAGTAA TAATGGTATC GGCGAACAGA ATAATGATCT TCCGGGCGCA GGGGCAGCCT ATCTTTTTCA CCAGTTCAAT GGACTATGGA CGCAAGAGGC ATATCTTAAA ACCAATCATC GTACAAAAGA TGAAGCCTTT GGTCACGCCG TGGCCATGGA GGAAGGAACC ATTGTTATCG GATCACCCTA CGCAGATGGG TATGAGGCCG TAAAAACAGG ACTGATCACC GTCTTTGTTT ATCAAGGAGC AGGAGTAGGA TGGCATCACA GCCAAACCAT GGGAAGTCCA GGCCCAAATA CGGGCGATGG ATTTGGACAA TCGGTCGCGA TTACCAATCA GCGCATCGCG GTAGGAGCCT ATGGCGAAGA TAGTAATGCG ACCCTGATTA ATGGAGATAG CAGCAATAAT ACGGCAGCAA ATGCTGGGGC AGCCTATATC TATGATCGCC ATCCAACCTT TTATGAACAT GTGTGGCATC CAACGACCTA TATCAAGGCA TCGAATACGG ATGCCGTGGA TATTTTTGGC CGGAATCTTG CCTTCTGTGG CCCAACCTTG CTCGTCGGAG CACCCTATGA AGATAGTGCC GCACAAGGCA CCAATGGCAA CCAAACCAAT AATAGTCTCG CGAGTGCGGG GGCCGTCTAT CGCTATATCT GGGATGGCTC GCAGTGGCAG CATCGGCATT ATAACAAAGC CCTCAATCCT GATGCGCTTG ATTATTTTGG CATGAGGCTC GCCTGTCATG ATCAACTCCT TGCCGTTAGC GCCCCGGGTG AAGATAGCGC TGCCCAAGGA GTCAATGGCG ATCAAACAGA TAATTCGGCA CTCGATGCTG GTGCGGTGTA TGTCCTCAGC CTGCCGATGC AAGGCTACAC CCACCTTCCT GCGGTGACCG GTGAAGAAAT CACATCGCCC TATCCGCTGC CGCAACGCTA A
|
Protein sequence | MCYQPIGLVC TGDTLVVGAQ AEDSAATGIN GNQADNTAPD TGAAYVFVRS GTTWTQQAYL KASNAEAGDG FGFSLASDTN RILVGAPFED SNASGSNNGI GEQNNDLPGA GAAYLFHQFN GLWTQEAYLK TNHRTKDEAF GHAVAMEEGT IVIGSPYADG YEAVKTGLIT VFVYQGAGVG WHHSQTMGSP GPNTGDGFGQ SVAITNQRIA VGAYGEDSNA TLINGDSSNN TAANAGAAYI YDRHPTFYEH VWHPTTYIKA SNTDAVDIFG RNLAFCGPTL LVGAPYEDSA AQGTNGNQTN NSLASAGAVY RYIWDGSQWQ HRHYNKALNP DALDYFGMRL ACHDQLLAVS APGEDSAAQG VNGDQTDNSA LDAGAVYVLS LPMQGYTHLP AVTGEEITSP YPLPQR
|
| |