Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_1777 |
Symbol | |
ID | 8137108 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 2066287 |
End bp | 2067267 |
Gene Length | 981 bp |
Protein Length | 326 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 644869389 |
Product | oligopeptide/dipeptide ABC transporter, ATPase subunit |
Protein accession | YP_003021589 |
Protein GI | 253700400 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG4608] ABC-type oligopeptide transport system, ATPase component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 0.00000396246 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACAGCGC CCCTGTTGCA GGCCGAAAAA CTGGTCAAGC GCTTTGCCGT GCGCGGCGGT TTCCTGGCGG AGAAAAGGGA GTTGACCGCC GTAGCCGGCG TCGACCTCGA GATCTTCCCG GGCGAGACGC TCGGCGTCGC CGGGGAGTCG GGCTGCGGCA AGTCCACCGT GGCGAGGCTT CTGACCGGGC TCGTCCCCCC TAGCGAGGGG TCGATCCGCT ACGGCGGCCG CGAACTCTCA GCCATGAACA GGGGGGAGCT CGCCCAGTTC CGCCGCGAAG TGCAGATGAT CTTCCAGGAC CCCTTCTCCT CGCTGAACCC GAGGATGCGC GTGGCCCAGA TCGTCGGGGA GCCGCTCGAG ATCCACGGCA TCGGGAGCCC CGCCGAGCGG CGCGAGCGGG TGGCCCGCCT GATGGAACGG GTGGGGCTTT CCCCGGAGCA GCTCTCGCGC TTTCCGCACC AGTTCTCCGG CGGCCAGCGC CAGCGCATCG GGATAGCGCG CGCTCTCGCG GTCTCCCCCC GGCTCATCAT CGCCGACGAG CCGGTTTCGG CGCTCGACCT CTCGATCCAG GCCCAGATCA TCAACCTGCT CCAGGAAGTG AAAATGGACC TGGGGCTGTC GTTTCTCTTC ATCACCCACG ACCTCTCGGT GTTGAGGCAC CTAAGCGACC GGATCGCCAT CATGTACCTG GGACGGATCG TCGAGTCCGG GAGCCGGGAC GACGTACTGT CGAGGCAACT GCACCCGTAC ACGGAGGCGC TTTTAAGCGC CATACCGAGC ATCGACCCGC GGGAAAAAAG CAGGCACGTC GTAGCGCGCG GGGAACTCCC CTCCCCGCTC TCCCCCCCCC CAGGATGCCC CTTCCATACC CGCTGCCCCT ACGCGGAGGC GATCTGCGGC GAGGAGCGCC CCGAGCTTTT GGAGAAGGAA CCCGGCCACT TGGCCGCCTG CCACTTCAGC AAAAGGATCT ACCGCTCCTA G
|
Protein sequence | MTAPLLQAEK LVKRFAVRGG FLAEKRELTA VAGVDLEIFP GETLGVAGES GCGKSTVARL LTGLVPPSEG SIRYGGRELS AMNRGELAQF RREVQMIFQD PFSSLNPRMR VAQIVGEPLE IHGIGSPAER RERVARLMER VGLSPEQLSR FPHQFSGGQR QRIGIARALA VSPRLIIADE PVSALDLSIQ AQIINLLQEV KMDLGLSFLF ITHDLSVLRH LSDRIAIMYL GRIVESGSRD DVLSRQLHPY TEALLSAIPS IDPREKSRHV VARGELPSPL SPPPGCPFHT RCPYAEAICG EERPELLEKE PGHLAACHFS KRIYRS
|
| |