Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2195 |
Symbol | |
ID | 5734082 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 2785027 |
End bp | 2787057 |
Gene Length | 2031 bp |
Protein Length | 676 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641279336 |
Product | alpha beta-propellor repeat-containing integrin |
Protein accession | YP_001544963 |
Protein GI | 159898716 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.848428 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTTTCAT CATCGTGGTT AATGCATTGG TCGCGGCGCA GCATTTGGAT TTTACTGTTT GGCTTGTTGT CGATGGGCGT TGGTTTTGGA AGCAAGCCTG CTCCATCAAC GGCTGCTGCC TTGTTTACTG CTACTGACTG GCGGGCTATC CGCGCCCTAT TGCCATTGCG GCAACAAGCC TATTTCAAGC CTTCGCAGGT CTCTGCGGAC GATGCAATGG GTTGGAGTAT TGCGGTGTCC GGCGATACAA TTGTGGTCGG TGCGCCTCAC GAGGACAGTA GCACGGCAGG AGTTCAGAAT GGGACGACAC CGACGGTCGA TGAGGTGGCG AGTGATGCGG GAGCCGCCTA TGTGTTTGTG CGGACTGGCA CGAGCTGGAG TCAGCAGGCC TACCTGAAAG CGTCGCAGGT AGCGGCAGGC GATGGCTTTG GCACCAGTGT GGCGGTGTCC GGCGATATCA TTGTGGTTGG TGCGCCTCAC GAGGATAGCG GTACGGTGGG CGTGCAGAAT GGGGCGACAC CGACAATCGA TGAGGCGGCG AGTGATGCGG GAGCCGCCTA TGTGTTTGTG CGGACTGGCA CGAGCTGGAG TCAGCAGGCC TACCTGAAAG CGTCGCAGGT ATCGGCAGGC GATGATTTTG GTGCCAGCGT GGCGGTGGCA GATACAACAG TTGTCGTTGG GGCACACCAA GAGGATGGTA GTACGACGGG TGTACAGAAC GGGGCCACAC CGACGGTTGA TGAGGCAACG AGTGATGCCG GAGCAGCGTA TGTGTTCGTG CGTGATGGGA GAAGCTGGAG CCAGCAGGCC TACCTGAAAG CTTCGCAGGT TTCGGCAGGC GATATTTTTG GTGCCAGCGT GGCGGTGGCA GATACAACAG TTGTTGTTGG GGCACACCAA GAGGATAGTA GCACCGCCGG GGTTTTCCAT AGCGCCGCGC CAACGGTCGA TGAGGCGGCG AGTGATGCGG GAGCAGTGTA TGTGTTCGTG CGCGATGGCA CAAGTTGGAG TCAGCAGGCC TATCTGAAAG CGTCGCAGGT GTCGGCAGGC GATATCTTTG GCTTTAGTGT GGCGGTGGCA GGGGAGACGC TGGTCGTCGG TGCGCCCTAT GAAGATAGTA GTACGGCGGG TGTTTCCAAC AGTGCCACAC CAACGGTCGA TGAAGATATG ACCAACGCTG GAGCAGTGTA TGTGTTCGTG CGCGATGGCA CAAGTTGGGT TCAGCAAGCC TATCTGAAAG CGTCGCAAGT TTCGGCAGGC GATATCTTTG GCTTTAGTGT GGCGGTGGCG GGCGATACTA TCGTAGTCGG TGCGCCCCAT GAAGATAGTA GTACGTCAGG TGTTTCCAGC AGCGCGACAC CAACGGTCGA TGAAGCCATG AGTGATGCAG GGGCCGCGTA TGTCGTCGTG CGCAATGGCA CAACTTGGGT TCAACAAACC TACCTCAAGG CTGCCCAAAC TTCAGTGTCC GACATCTTTG GCTTTAGTGT GGCGGTGGCA GCAGATACGC TCGTTGTTGG TGTACCCTAC GAGGATAGCA GTACGGCGGG TGTTGATCAT AGCACGCTGC CCACGGTTGA TGAGTTAGCA AGCGATGCCG GAGCGGTCTA TGGGTTTACC AATCTGCCAA CCTTGTATCT GCCTTTCGTC ACAACCAGTC AGCCATTGGT GATTGCGCCA CTCACTCCTG TCGCCGTGCC AACAACTCCA GTTGGTACGC CTGGCATGAT TTTCTTAACC AAAACAATCA CGTTGCCAAC CCCATTATCA AGTAGTGGTC ACTATTGGCT ATCCGCAAGC CCGACGGCAC TAGTGCCAGG TTTGGTTGAT GATGCGGTGA TTCTGCGAGT GGGGTCAACT GAAATCCTGC GTCATCACTA TGGGATGACG GGCGAACTTC AAGCCGCCTT GGTGGTAGTC CCGGTCGGTG ATCTGTTGCC ATGGGCAGGC CAAACAATTA CCGTCGATTT TACCGATATT TCTGGCCTCG TGTATAGCAC GACCCCGTTA TATTTGGTGT GGACACCCTA A
|
Protein sequence | MLSSSWLMHW SRRSIWILLF GLLSMGVGFG SKPAPSTAAA LFTATDWRAI RALLPLRQQA YFKPSQVSAD DAMGWSIAVS GDTIVVGAPH EDSSTAGVQN GTTPTVDEVA SDAGAAYVFV RTGTSWSQQA YLKASQVAAG DGFGTSVAVS GDIIVVGAPH EDSGTVGVQN GATPTIDEAA SDAGAAYVFV RTGTSWSQQA YLKASQVSAG DDFGASVAVA DTTVVVGAHQ EDGSTTGVQN GATPTVDEAT SDAGAAYVFV RDGRSWSQQA YLKASQVSAG DIFGASVAVA DTTVVVGAHQ EDSSTAGVFH SAAPTVDEAA SDAGAVYVFV RDGTSWSQQA YLKASQVSAG DIFGFSVAVA GETLVVGAPY EDSSTAGVSN SATPTVDEDM TNAGAVYVFV RDGTSWVQQA YLKASQVSAG DIFGFSVAVA GDTIVVGAPH EDSSTSGVSS SATPTVDEAM SDAGAAYVVV RNGTTWVQQT YLKAAQTSVS DIFGFSVAVA ADTLVVGVPY EDSSTAGVDH STLPTVDELA SDAGAVYGFT NLPTLYLPFV TTSQPLVIAP LTPVAVPTTP VGTPGMIFLT KTITLPTPLS SSGHYWLSAS PTALVPGLVD DAVILRVGST EILRHHYGMT GELQAALVVV PVGDLLPWAG QTITVDFTDI SGLVYSTTPL YLVWTP
|
| |