Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_0455 |
Symbol | |
ID | 8135764 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 550973 |
End bp | 553522 |
Gene Length | 2550 bp |
Protein Length | 849 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 644868073 |
Product | protein of unknown function DUF214 |
Protein accession | YP_003020293 |
Protein GI | 253699104 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG4591] ABC-type transport system, involved in lipoprotein release, permease component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 92 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGTTCC TGCTCCGCAC CCTCTCCCTC TCCTACGCCC GCCGGCACTT CGGCAAGACC ATCCTGACCC TCCTGGGGGT GGTCGTCGGC GTCACCACCT TCAGCTCCAT CAAGACCGCG CAGGACGCCC TGGTAAAAGG GATCGGCTCC ACCGTGGACC GCGTCGCCGG CAAGGCGCAC CTGCAGGTGA CCATGGAGGG AGGGGTCCCA GAGGAAATTC AGGAAAAACT GAGATCCCTC CCCGGCATCA GGGCCACCTC GCCGGTGATC GAGCAGGTCG TGGTCCCGGA AAAAGCCGAG CTGGGAAGCC TGATGGTGAT CGGCATCGAC CTTCTGGGCG ACCGCGAGAT GCGCGATTAC GGCTTCGAGG GGGACGACGC CGACCTGGAC GACCCGCTCC TCTTCCTGGC GCAGCCCGAC TCCGCCCTCT TCACTCGCGA CTTCGCCCAA AGGGCCGGGG TCGCCTCGGG AAGCGCGCTC TCGCTCAAGC TCCCGACCGG ATACAAGAAG GTGACGGCGC GGGGGCTGAT GAGCCCCAAA GGTTTCGCCG AGGCCTTCGG CGGCAACCTG ATGGTGGTCG ACGTCTATGC CGCGCAGGAC CTCTTCGGTC GCGGCAGGCG CTTTGACCGG ATCGACGTGC GGCTCCAGGA CGGGGTGACC GTGGCACAGG GGACCGCGAC GCTGCAAAAC GCGCTTGGCC CCGCCTTCCA GGTGGAGACC CCGGCGCGCC GCGGCGAGCA GATGGAGCGC CTGGTGGCCA ACTTCACCGC CGGCTTCAAC GTCTCCAGCG CCTTCGCCCT CGCCATCGGC GTCTTCCTCA TCTTCAACGC CTTCAACGTG GCGGTGAACC GCCGCCGCCG GGACATCGGA ACACTCAGGG CCCTGGGGGC CACGCCGCGC CAGGTCCAGG CCCTATTCCT CGCCGAGGCG CTCGTCCTCG GCATCATGGG CGGGGTGCTT GGGTGTCTCG CCGGGACAGC CTTCTCGCAG GGGCTCCTGG TGAGCATGGG GCAGAGCACC GAGGCGGTCT ACGGCATCAG CGGCTCAGGC GTCGTCCATC TCACCCCGGC CATAATGCTG CAGTCGATCC TCCTCGGGGT CGGCGCCTCG CTCGCCGGTG CCTGGGGCCC GGCCCTCGCC GCCTCGCGCA TCCCCCCGAC GGAGGCCTTC GCCAAGGGCG CCTTCCAGGC CCGCGTAGAG CGCCGGGTAG CGCCGCGGCT GGCGGCCGCG GCTGCGCTTC TCGCGGCGGC CGGCTACTTC GCCCTTTTCG CCGGGCTCAC GGGAAACCAG ATGTTGCTCG CGGTGCTTCT TTTGGGCGGC ACCGGGCTCA TTTTGCTCCT CGGCCCCCTC TCCCGGGCCC TCCTGGTCGC GGCGGCGCCG CTCCTCTCCC GGTTTTTCCC CTCTGCAGGC CCACTCTCCT CGGACGCCCT GCTGGCGAAC CCTCGCAGGA GCGCCGGCAC CATCATGGCC ATGACGCTCT CGCTCACCTT CGTCCTGGGA CTCGGCGGGT ACCTGGGATC CACCAAGGGG ACCATCGTGC GCTGGATGGA CGACGTCCTC ACCTGCGACC TCTTCGTCAG GGCCTCCGCC AACTTCACCC GCGGCGATTT CCTCTATCCA GGCGCGCTTC GCGAGGAGCT GATGCAACTG CCGGGAGTGC GCGCCATCGA GAGCATACGC GCCATCAGGC CGCAGTTTTT GGGGAAGCGG ATCCTGATCA ACTCGGTGGA GATAGGTCAG CTCCTGGACC GCGCCAAGTA CGAGTTCGCG CAAGGTGACG CGCGCGCCAT GCGCGAGGGT GCCTCGAAAG GGATGTGCGC CGTGTCCGAA AACTTCTACC GGAACTTCCA TCTCGGGGTG GGGGACCAGG TAGAGCTGAT GACGCCGGGG GGATTGGTCA AATTTCCCAT CTCCGCCGTG GTGCGAGACT ACAGTTCCGA CCAGGGATCG ATACTGCTCG ACCGCCCCGT GTTCCTCAAG CACTGGAACG ACGACCGCGT GGACATCTAC GACGTCTCCG TGCATCCCGG GGTGAACCCC AAGGCGGTCC GGGAGGAAAT CCGGGCCAAA CTGGCGGGGA GGTACCCAGC GCTGGTATCG ACGCGCGCCG AGTTCAAGGC CGAGATAGGG AAGGCCATCG ATGCTTTCTA CGCCGTCATG CGCATCACCG TCTTCCTCGC CCTCGGCGTC GCGTTTTTGG GTATCGTGAC CTCGCTTCTC ATCTCCGTTG CGGAAAGGAC CCGGGATATC GGGATACTCA AGGCGCTGGG CGCCGTCCCG TCCCAGATCG CAGGGAGCAT CGTCATCGAG GCGTTGGTGC TGGCCCTTGC GGGGCTCCTC CTGGCGCTTC CGGCCGGCAA CCTGTTCGCT TCCTTCATGG AGGGGCCGGT CGCCGTCGCC TTCACCGGCT GGAGCATGCC TCACAACTAC CCGTGGGACA CCCTGGGGCA GCTTCTCTTC GCCCTGCCGC TCGTGTCCGC GCTCGCCGCC TGGATTCCGG CCCGGCAGGC CGCGAGGGTC AAGGTGACCG AGGCGATAGA GTACGAATGA
|
Protein sequence | MRFLLRTLSL SYARRHFGKT ILTLLGVVVG VTTFSSIKTA QDALVKGIGS TVDRVAGKAH LQVTMEGGVP EEIQEKLRSL PGIRATSPVI EQVVVPEKAE LGSLMVIGID LLGDREMRDY GFEGDDADLD DPLLFLAQPD SALFTRDFAQ RAGVASGSAL SLKLPTGYKK VTARGLMSPK GFAEAFGGNL MVVDVYAAQD LFGRGRRFDR IDVRLQDGVT VAQGTATLQN ALGPAFQVET PARRGEQMER LVANFTAGFN VSSAFALAIG VFLIFNAFNV AVNRRRRDIG TLRALGATPR QVQALFLAEA LVLGIMGGVL GCLAGTAFSQ GLLVSMGQST EAVYGISGSG VVHLTPAIML QSILLGVGAS LAGAWGPALA ASRIPPTEAF AKGAFQARVE RRVAPRLAAA AALLAAAGYF ALFAGLTGNQ MLLAVLLLGG TGLILLLGPL SRALLVAAAP LLSRFFPSAG PLSSDALLAN PRRSAGTIMA MTLSLTFVLG LGGYLGSTKG TIVRWMDDVL TCDLFVRASA NFTRGDFLYP GALREELMQL PGVRAIESIR AIRPQFLGKR ILINSVEIGQ LLDRAKYEFA QGDARAMREG ASKGMCAVSE NFYRNFHLGV GDQVELMTPG GLVKFPISAV VRDYSSDQGS ILLDRPVFLK HWNDDRVDIY DVSVHPGVNP KAVREEIRAK LAGRYPALVS TRAEFKAEIG KAIDAFYAVM RITVFLALGV AFLGIVTSLL ISVAERTRDI GILKALGAVP SQIAGSIVIE ALVLALAGLL LALPAGNLFA SFMEGPVAVA FTGWSMPHNY PWDTLGQLLF ALPLVSALAA WIPARQAARV KVTEAIEYE
|
| |