Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gmet_1201 |
Symbol | |
ID | 3738741 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter metallireducens GS-15 |
Kingdom | Bacteria |
Replicon accession | NC_007517 |
Strand | - |
Start bp | 1348163 |
End bp | 1349710 |
Gene Length | 1548 bp |
Protein Length | 515 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 637778481 |
Product | pentapeptide repeat-containing protein |
Protein accession | YP_384163 |
Protein GI | 78222416 |
COG category | [S] Function unknown |
COG ID | [COG1357] Uncharacterized low-complexity proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.0514272 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 45 |
Fosmid unclonability p-value | 0.288763 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATCCGT CCAATCCAGC ATTTTCAGGC GACACTGAGC ATCTCAGCGT CCTGCTAAGG GCAGTTGAAC AGTGCGACAT ATCAATCTGG AACAACTGGA GAACGACCAA TCCAGCAATA CGCCCCGCCC TGACAGGTGC TACCCTTGAA GGAGCAGTTC TGTTCGATGC GAATCTGGCA GAGTCAAACA TCGGGAACGC AAATCTTCGC AACTGCCGGA TGATGGGGAC CAATCTGCGG GGCGCCATCC TTGCCGGCAC TGATTTGGAC GGTGCCAACC TTTTCGGTGC AAACTGCCAG GGAGCAAATT TCTGCAACAG CAGTCTTCGG GGAGTAACCC TTCGCTGCGG CATGCAAGGG TGCATTCTCG AAGGTGCGAA CCTTGAGCGC GCCGATATGA TCGGGACCGA TTTCACCGGT GCACGACTGG CAGGAGCATC GTTCCGAAAT GCTGAAGCCT TTGGCGCCCG GTTCCGGAAC GCGATCCTCA CAGGTGCTGA CCTATCAGGA ATCAACTTGG GCACTGCAGA CCTGTTCAAG GCAGAGTGCA ACGAAGCGGT GTTTTCCGAT GCCAGCCTCC TCAGCGCAAC GTTTAGCGAG GCGAGCCTCC GCCATGCATC CTTCTGCCGG GCAGACCTCT CAAATTCCTA TCTCAATAGC GCAACACTCA TCGCCGCCGA TCTCAGCGGA GCTAAACTTG CTTTCGCGAA CCTCATGGAA GCCGATTTGC GCGACGCTAT CTTCCGAGGA ACGAATCTCC AGGGGGCTGT GCTCATGAAT GCGAAACTGA CTGGGGCCGA CCTTCGCAAC GTCAACATGA GTCTATCTAA CCTCACAATG GCCGATCTTG CGGAGGCCAA CCTCACCAAT GCAAATCTGA CGGGCGCCCT AATGGTCGGC ACCGATCTGA GACGCGCCAC CCTCAGGGGA TGCTTGGTCT TCGGGATTTC ACCGTGGGAG GTACTCGCGG AGGATGCCGA CCAACAGGAT CTGGTAGTCA CTAAGTTGAA CCACCCGCAG GTTACGGTAG ACGACATCGA GATGGCCCAG TTTATCTATT TGCTTCTCAA TAACGCCAAG GTACGCCGAG CTGTAGACAC TATAACCTCG AAGATTGTAC TCATCCTCGG CCGTTTCTCT CCAGAGCGAA AATCGGTGCT CGATGCCGTG CGAGATCAGC TTAGAGCCTG CAACTACGTA CCTGTACTCT TTGACTTCAA TCGGCCAGAG TCCCTCGATG AAGTGGAAAC CGTTACACTC CTTGCACGGA TGGCCAGATT CATTATTGCC GACCTGACGG AGCCTCACTG CGTTCCCGAT GAACTACGCT CAATCGTTCC CGATGTAGAA GTGCCTGTTC TGCCAATAAT CGAAGGAGAT AACCCCTACG GAACATTTGG AACTCTCCGC AAATATCCGT GGGTGCTGGA CATACATCGA TATCGGGGCA TTGAAGACCT GCTTGCCTCC TTTGACGAAT CGGTCGTCGT CCCTGCCGAG AACATGACTC GGGAAATTGG AAAGCGTAAA GCACTTCGAA ATAATTGA
|
Protein sequence | MNPSNPAFSG DTEHLSVLLR AVEQCDISIW NNWRTTNPAI RPALTGATLE GAVLFDANLA ESNIGNANLR NCRMMGTNLR GAILAGTDLD GANLFGANCQ GANFCNSSLR GVTLRCGMQG CILEGANLER ADMIGTDFTG ARLAGASFRN AEAFGARFRN AILTGADLSG INLGTADLFK AECNEAVFSD ASLLSATFSE ASLRHASFCR ADLSNSYLNS ATLIAADLSG AKLAFANLME ADLRDAIFRG TNLQGAVLMN AKLTGADLRN VNMSLSNLTM ADLAEANLTN ANLTGALMVG TDLRRATLRG CLVFGISPWE VLAEDADQQD LVVTKLNHPQ VTVDDIEMAQ FIYLLLNNAK VRRAVDTITS KIVLILGRFS PERKSVLDAV RDQLRACNYV PVLFDFNRPE SLDEVETVTL LARMARFIIA DLTEPHCVPD ELRSIVPDVE VPVLPIIEGD NPYGTFGTLR KYPWVLDIHR YRGIEDLLAS FDESVVVPAE NMTREIGKRK ALRNN
|
| |