Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Afer_1949 |
Symbol | |
ID | 8324049 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidimicrobium ferrooxidans DSM 10331 |
Kingdom | Bacteria |
Replicon accession | NC_013124 |
Strand | + |
Start bp | 2044208 |
End bp | 2046046 |
Gene Length | 1839 bp |
Protein Length | 612 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 644953076 |
Product | General secretory system II protein E domain protein |
Protein accession | YP_003110526 |
Protein GI | 256372702 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1215] Glycosyltransferases, probably involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAGAGCC CGCGGCTTCG CGAGCGCGCA GCTTCGCCAC CGCTCGGCGC CGCGCTCGTG GCCGCGGGCC TGCTCGACGA GCACGACCTC GCCGTGGCAC TCGAGCAGCA GACCCGGAGC CGACGACGCC TCGGTGAGGT GCTCGTACGC CGAGGCATCG TCCCACGCCT CGATCTCGCC CGTGTGCTCG CAGAACGCGC TCGCGTGCCC TTCGTGACCC TCGTCGATCG CGAGCCACGA CGGGAGCTGC TCGAGGGGCT CGACCTCGAG GCGTGCGTCC GCGAGCGGCT GGTCCCGATC GACCGAGACG ACGATGGCAC GCTCGTGGTT GCGAGCTCGG AGGTGCCGAC CGACACCGTG CGCGAGAGCG CCGAACGTCT CAGCGGCGCT CCTGTCCGGC TCGTGCTCAC CACCGAGTGG GACCTGCTCC GCTACATCGT GCGCGCAGGC GCCGATCACA TCGGCCGACG AGCGGCCTAC GGTCTCGCGG AGACGGATCT CGACCTGTCG GCAGCCCACG TGCTCACGAG GGCCCAAGCC ATCGGCCTCG TCGTCGTCGC GGTCGTGGTC GTCCTCGTTG GCATCGTCGA CACCCGAGCA CTCCTCGGGG CCGCACTCGC TGGCGCTGCC ACCCTCCTCG CGCTCGTCGT GCTCTTTCGA GCGGTCGTTG CCTTCCGGGG CGCCGGCGTG CCGTGGGTCG TGCCAGAGCG CCTCCTCGAC GACGCAGACC TGCCGACCTA CACCATCCTC GTCCCGTTGT ATCGAGAGGC GGCGGTGGTC CCGGCGCTCA TGACCTCGCT CGCCAACCTC GACTACCCAC CAGAGAAGCT CGAGGCGCTC GTACTCGTCG AGGCCGACGA CGACGCCACC CGGGACGCGC TCGTCGCGGC GCGCCCGCCG AGCTGGGTCA CCGTCGTGAC CGTGCCGCCC GAGGGGCCCG CGACGAAGCC GAAGGCCTTG AACGTCGGGC TCGCGCTCGC CTCCGGTGAG CTGCTCGTGA TCTACGACGC CGAGGATCGA CCCGAACCCG ACCAGCTCCG GATCGTGGCC TCGATCTTCG CCGACGCCGA CCACGACCTT GCGTGCGTCC AGGCAGCGCT GAACTACCAC AACGCCCGCC ACAACCTGCT CACGCGCCTG TTCACCCTCG AGTACTCCCA GTGGTTCGAT TACCTCCTCC CCGGCCTCGA GGCGCTCGAG CTGCCGATCC CACTGGGAGG GACCTCGAAT CACTTCCGCA CGCAGCTGCT GCGCTCGTTG GGAGGCTGGG ATCCGTTCAA CGTCACCGAG GATGCCGATC TCGGCATTCG GGCCGCGGTG AAGGGTGCCC GGGTCGCCAC CGCCGCCTCC ACCACGTGGG AGGAGGCGAC CGCTCGCCCC GGCGCCTTCA TCCGCCAACG GACGCGCTGG ATCAAGGGCT ACGTCCAGAC AGCCCTCGTG CACGCCCGCC ACCCGATCCG GCTGGTCCGA GCGGTCGGGC CGATCCAGGC CCTCGCCTTC GCCGTCCTCA TCGCCGGCAC GCCGATCGCC TTCTGGACGA TCGTGCCCCT CGACCTGACC TTCGTGGCCT CTCTCGTCCT CTCGCCGCAC GTCCTCGTCG ATCTCGTACC GCGCTGGGCG CTCGCCGTCG GATTCGTCGA CCTCGTCATC GGCAACGCCG TGGTGATCTA CCTCTCCATC GTGGCCGTCT TCCGTCGGCG TCAATGGTCG CTCGTGTGGG CCGCCTTGCT CACGCCGGTC TACTGGATCT TGCACTCGCT CGCGGCCTAT CGAGCCCTCG CCCAGCTCGT GACCCGGCCG CACTACTGGG ACAAGACCGA CCACGGAGTC GCGCTGTGA
|
Protein sequence | MESPRLRERA ASPPLGAALV AAGLLDEHDL AVALEQQTRS RRRLGEVLVR RGIVPRLDLA RVLAERARVP FVTLVDREPR RELLEGLDLE ACVRERLVPI DRDDDGTLVV ASSEVPTDTV RESAERLSGA PVRLVLTTEW DLLRYIVRAG ADHIGRRAAY GLAETDLDLS AAHVLTRAQA IGLVVVAVVV VLVGIVDTRA LLGAALAGAA TLLALVVLFR AVVAFRGAGV PWVVPERLLD DADLPTYTIL VPLYREAAVV PALMTSLANL DYPPEKLEAL VLVEADDDAT RDALVAARPP SWVTVVTVPP EGPATKPKAL NVGLALASGE LLVIYDAEDR PEPDQLRIVA SIFADADHDL ACVQAALNYH NARHNLLTRL FTLEYSQWFD YLLPGLEALE LPIPLGGTSN HFRTQLLRSL GGWDPFNVTE DADLGIRAAV KGARVATAAS TTWEEATARP GAFIRQRTRW IKGYVQTALV HARHPIRLVR AVGPIQALAF AVLIAGTPIA FWTIVPLDLT FVASLVLSPH VLVDLVPRWA LAVGFVDLVI GNAVVIYLSI VAVFRRRQWS LVWAALLTPV YWILHSLAAY RALAQLVTRP HYWDKTDHGV AL
|
| |