Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Lferr_2308 |
Symbol | |
ID | 6878302 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidithiobacillus ferrooxidans ATCC 53993 |
Kingdom | Bacteria |
Replicon accession | NC_011206 |
Strand | + |
Start bp | 2282264 |
End bp | 2284498 |
Gene Length | 2235 bp |
Protein Length | 744 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 642790167 |
Product | Haemagluttinin domain protein |
Protein accession | YP_002220716 |
Protein GI | 198284395 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR03304] outer membrane insertion C-terminal signal |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGACTT TCAGAAAAAA AATCCTATTC GTGGCCGTAT CCGGCACCTT TGCAATATCC ATGGCGGGGG GGGGCACGGC GGATGCAAGC ACCGTCCAAG GTGGTGTCGG CCTGGTGGCG GCCCTCACCC AAACCCTGGG ACTCGGTTCG GCGGTCACCC AGTACGGTGA CAGCAACAAT ATCGCATCGG GACCCGGCAG TGCCGCCGGT ACCAATGATA CGGCCGTGGG CGTGAATGCG ACGTCCACCG GCACCAACTC CGTGGCCCTC GGCTACAACA GCAGCGATGG TGGGCAAAAC AACGTCGTAG CGGTGGGTAG CGCCACCCAG CAGCGCAAGA TCATCAACGT GGCGCCGGGC ACCCTGAGCC AGACCAGCAC GGATGCGGTC AACGGCAGCC AGTTGTATGC CACAGACCAA CAGCAACTGA CCAATACCAG CAATATCAGC AATCTCCAGA ATCAGCAGAA GATCGATCAA ACCAATATTT CCCACTTGCA ATCGACGGTC AGCAATATCA GCAACCTGAC CTCGGTGGCC GGTGACCTCA CCGCCATCAA GCAGCAGCAG CAGACGGATA TGAGCAATAT TGCCGTCAAC ACTTCCGACA TCAGCAACCT CAAGGGTCAG CAGGGCACCG ACGTCACCAA CATCAGCAAC CTGCAGAAAC AGCAGGCCAC CGACGTCAGC AATATCGCCA ACAATACCAG CAACATTGCC AGCAATACCT CCAATATTGC CGTCAACACT TCCGACATCA GCAACCTCAA GGGTCAGCAG GGCACCGACG TCACCAACAT CAGCAACCTG CAGAAACAGC AGGCCACCGA CGTCAGCAAC ATCGCCAACA ATACCAGCAA CATCAGCAAT CTGAGCAACG TGGTCGGGGG CTTGACGTCA ACGGCAGTCG ATCTCACCAA AATCAAAAAA CAGCAAGCCA CCGATGTCAC CAATATCGCC AGCAACACCA GCAACATCGC CAGCAATACC TCGGACATCA GCAACCTGAA AAATCAGCAG GGCACCGACG TCACCAACAT CAGCAACCTG CAGAAACAAC AGGCCACCGA CGTCAGCAAC ATCGCCGGCA ACACCACCAA TATCGCCAGC AACACCTCCG ACATCAGCAA CCTGAAAAAT CAGCAGGGCA CCGACGTCAC CAACATCAGC AACCTGCAGA AACAACAGGC CACCGACGTC AGCAACATCG CCGGCAACAC CACCAATATC GCCAGCAACA CCTCCGACAT CAGCAACCTG AAAAATCAGC AGGGCACCGA CGTCACCAAT ATCGCCAGCA ACACCAAAGA CATCAAGAAC ATCAAGACGC AACAGGCTAC CGACGTCAGC AACATCGCCA GTAACACCAC CAATATCGCC AGCAACACCT CCGACATCAG CAACCTGAAA ACCCAGCAGG GTACCGACGT CACCAATATC GCCAGCAACA CCAACGACAT CAAGAACGTC AAGACGCAAC AGGCTACCGA CGTCAGCAAC ATCGCCATGA ACACCAGCAA TATCAGCCAG TTGCAAACCA TCGTCAACGG CAAAGTCGCC ACCTGTCAGG TCGTCAACGG CGGCCTGCAG TGCACCTACG CCCAGGCCAA GGGTACGAAC GACGTGGCAG CAGGAAACGG GGCGCTCGCC AACGGCACCA GCAGTATCGC CATCGGTACC AACGCCACGG CAACCTACAA CGGCGCGGTA GCCATCGGTG ATGGTGCCCG GGCGGTGGCC GATCCCGCGA CAGCCATTGG TGCCAACGCT CAGGCCAATG CCAACAATAG TACGGCTATC GGTGCCAACA GCACGGCCAA CGGGATCAAT TCCGTTGCCC TGGGACAAGG TTCCACCGCC AATCGCGCCA ATTCCGTTTC CGTGGGCAAC GCCTCTACCG GGCTGACGCG ACAGATCACC AATGTGGCCC CCGGCACCAC ACCCAATGAT GTGGCCACCG TCGGTCAGTT GCAGGGTGCC GTCGGGCAGG CGCAGCATTA CGCGGCCCAA GTGGGCTCCG TCAATGCCGC AGCCCTGAAC GCCGCCGCAT CCGCCGCATC CGGTCAGGGA CCCAACACGG TGGCCGGCGG TTATGGTGAG TACGATGGAC AGAGCGCATT CGCGTTTACA TATCAGCACC GCTTCAACTG CAACTGGCAG GCGCTGTTGA CGGTGGGCAG CAACGGTTCG GGCAAAAATA CCGAAGTGGG CGCGGGAGCG TCCTACAGTT GGTAA
|
Protein sequence | MSTFRKKILF VAVSGTFAIS MAGGGTADAS TVQGGVGLVA ALTQTLGLGS AVTQYGDSNN IASGPGSAAG TNDTAVGVNA TSTGTNSVAL GYNSSDGGQN NVVAVGSATQ QRKIINVAPG TLSQTSTDAV NGSQLYATDQ QQLTNTSNIS NLQNQQKIDQ TNISHLQSTV SNISNLTSVA GDLTAIKQQQ QTDMSNIAVN TSDISNLKGQ QGTDVTNISN LQKQQATDVS NIANNTSNIA SNTSNIAVNT SDISNLKGQQ GTDVTNISNL QKQQATDVSN IANNTSNISN LSNVVGGLTS TAVDLTKIKK QQATDVTNIA SNTSNIASNT SDISNLKNQQ GTDVTNISNL QKQQATDVSN IAGNTTNIAS NTSDISNLKN QQGTDVTNIS NLQKQQATDV SNIAGNTTNI ASNTSDISNL KNQQGTDVTN IASNTKDIKN IKTQQATDVS NIASNTTNIA SNTSDISNLK TQQGTDVTNI ASNTNDIKNV KTQQATDVSN IAMNTSNISQ LQTIVNGKVA TCQVVNGGLQ CTYAQAKGTN DVAAGNGALA NGTSSIAIGT NATATYNGAV AIGDGARAVA DPATAIGANA QANANNSTAI GANSTANGIN SVALGQGSTA NRANSVSVGN ASTGLTRQIT NVAPGTTPND VATVGQLQGA VGQAQHYAAQ VGSVNAAALN AAASAASGQG PNTVAGGYGE YDGQSAFAFT YQHRFNCNWQ ALLTVGSNGS GKNTEVGAGA SYSW
|
| |