Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_5100 |
Symbol | |
ID | 5737058 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009973 |
Strand | + |
Start bp | 128914 |
End bp | 130560 |
Gene Length | 1647 bp |
Protein Length | 548 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641282265 |
Product | alpha beta-propellor repeat-containing integrin |
Protein accession | YP_001547856 |
Protein GI | 159901610 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.908648 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACGATC ATACCCCTCA TCAGAAACCC TTTGTATCGG ATAGCCCGCC CTCCAATCGA TGGCGTGCAT TCCGCCATTG TGTTGGAATC AGCATGTTGT TTGTACTGAT TCAGGGATCG GTTCTTGTCT ATACGAGAGG ATCGCTACCG GTAGAACACG CAATCGCGCA ATCAGCCCAA CACCCGCATG GATCGCCGAA CCTCCTTCCT ACCTATATTA AGGCATCGAA TACCGATCCT CACGATGCGT TCGGCTTTCG CGTAGCGCTT GATGCTACCA CACTGGCAGT AAGCGCCCCA TACGAATCAA GTGCCGCAAC GGGTATTCAG GGTGACCAAT CAAATAATAT GGCGCTCCAA TCAGGAGCCG TGTATATTTT TGTCCGGGAT GGCGATACAT GGGTCCAACA AGCGTATCTC AAAGCGTCCA ATACCGACGC CGGCGATGGC TTTGGGGTCA GTCTTGCCCT CGATGGGGAT ACGCTTGTGG TTGGGGCGTA TGCTGAGGAC AGTGCTGCCA CCGGAATCAA CGGCAATCAG GCCGATAATT CCGCTGCGAA CGCGGGGGCG GCCTATGTCT TTGTCCGATC AGGGTCAACC TGGAGTCAGC AAGCCTATCT GAAAGCATCC AATACTGATG AAGGCGATGG GTTTGGCTAT AGGGTTGCGA TTGATGCAAC CACAGTCGTG ATTAGCGCCC GTGGCGAAGA TAGTGGAGCA ATGGGGGTAA ATAATGATCA GGCGAACAAT GATAAAGTGG ACGCGGGGGC GGCCTATGTT TTTGTCCGAT CAGGTTCAAC TTGGAGTCAG CAAGCCTATC TCAAAGCATC CAATACCGAT GCAGACGATG GGTTTGGCTA TAGTGTATCA ATTGAGAACC AGCTGATCGC CGTTGGCGCG AATGGGGAGG ATGGGAGTAC GACTGGCGTA AATGGAGGGC AGGACGATAA TACTGCTCCG GACGCAGGGG CGGCCTATGT CTTTGTCCGA TCGGGTTCAA CTTGGAGTCA GCAAGCCTAT CTCAAAGCAT CCAATACCGA TGCAGACGAT GGGTTTGGAC AGCGTGTTCA GCTTGCAGGA TCAACGGTAG TGGTGAGTGC CGTTCGGGAA GATAGCGCCG CCACCGGAGT CAATGGCAAT CAGCATGATA ATACTGCCAT GGATGCAGGA GCGGCTTATG TCTTTGTTCA GAATGGGAAT ACGTGGAGTC AACAAGCCTA TCTAAAGGCC TCAAATACTA ACGCAGGCGA TGGGTTTGGC TATAATCTCC ATGCGTTGGG TGATTGGATA CTGATTGGCG CACCATATGA GGCGAGTGCG GCCACAATCA TCAACGGGAA TCAGCATGAT AATAATGCCA ACCGTGCAGG AGCCGCCTAT CTTTTTGCGC GGCAACAGAC ACTATGGAGT CAGTCCGCCT ATCTGAAAGC CATGAATACC GATTCAGGCG ATCTCTTTGG GAATACTATG GGCATGAATG AGTCACTCAT CATCGTTGGA GCGTCAAATG AAGATAGCAA TACCCTCGGG ATCAATGGCG ATCATGCGAA TAATCTAGCC CTTAATTCAG GCGCAGTCTA TAGTTTCCCA TTTGCCATGA TTCCTTCCAT ACGGGCGTAT CTCCCATTGA CCACCCGGGG TGAATAG
|
Protein sequence | MDDHTPHQKP FVSDSPPSNR WRAFRHCVGI SMLFVLIQGS VLVYTRGSLP VEHAIAQSAQ HPHGSPNLLP TYIKASNTDP HDAFGFRVAL DATTLAVSAP YESSAATGIQ GDQSNNMALQ SGAVYIFVRD GDTWVQQAYL KASNTDAGDG FGVSLALDGD TLVVGAYAED SAATGINGNQ ADNSAANAGA AYVFVRSGST WSQQAYLKAS NTDEGDGFGY RVAIDATTVV ISARGEDSGA MGVNNDQANN DKVDAGAAYV FVRSGSTWSQ QAYLKASNTD ADDGFGYSVS IENQLIAVGA NGEDGSTTGV NGGQDDNTAP DAGAAYVFVR SGSTWSQQAY LKASNTDADD GFGQRVQLAG STVVVSAVRE DSAATGVNGN QHDNTAMDAG AAYVFVQNGN TWSQQAYLKA SNTNAGDGFG YNLHALGDWI LIGAPYEASA ATIINGNQHD NNANRAGAAY LFARQQTLWS QSAYLKAMNT DSGDLFGNTM GMNESLIIVG ASNEDSNTLG INGDHANNLA LNSGAVYSFP FAMIPSIRAY LPLTTRGE
|
| |