Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_2676 |
Symbol | |
ID | 8138018 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 3111286 |
End bp | 3112527 |
Gene Length | 1242 bp |
Protein Length | 413 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 644870280 |
Product | hypothetical protein |
Protein accession | YP_003022470 |
Protein GI | 253701281 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR02532] prepilin-type N-terminal cleavage/methylation domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 46 |
Fosmid unclonability p-value | 0.0000629143 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCGAAATA GCAAGGGCTT TACGCTGGTA GAACTACTCG TCGCCATGTC CATTTTCATC ATCATCATGA TCATGGCGGG CAACGCTTTC GAGCGGGTGC TATCGACTGC CGGGCAGCAG TCCAGGTCTG CTCAGAGCAA CTTCGAGGGT GTCGTGGGGC TCGAAATGCT CCGGTACGAT ATCGAGCACG CCGGCTACGG TGTTGCCACA GAATTCAATA ACTACACGTC CGACATAAAG TTCAAAAATC TCGGGGGGAC CGCCGACCAA GAATTAGTGG CTGCGGCCAA CGTGCCGCTT GACGGTTTCA ACTCCACTAC CCTGAACGCC AGGCGCGCCA AGAACATCCA ACCCATCGCG GCCGGGACCT CCACGAAAGA GGTCAACCAT ACGGCGGTTG CGAACGCAGC CGGGGGGCCC GACTATCTCG TGATCCGCTC CACCGTATCC TCGCTCGATG ACTCGGCCAG GAAATGGAGC TACGTGAACT ATTCCACCGA CAGTTCGTCC AACAACCTGA GCGAGATGAA GCTGTGGGAC TACGGCCCTA ACCTAACCTC CAACGACAGG ATCATCGCCA TCCGCGACCG CTTCATTGAC GGGAAGGAAC AAAAGACATT GCTTCTTAAC GGCACCGACG GTTTCGAGCT GAGCCTGACT GCATCGGGCG CCATGCCTGC TGGGGAATAT TTCAAGCCGA CCTCCAAGGA GGACATGGTG GTGTTGTACG GGGTCAGATC TGCGGCCGAC TCCGCCTTGC GTATGCCATA CAACAGAACC GACTACTACG TCACGCGTCC GGCGAGCGGG ATGCCCACCC ACTGCAACCC CGGCACCGGC GTCCTCTACA AGTCCTTGGT TAGCCACGCC ACCGGGGGGT TCGACACAGC CTACCCGCTT TTGGACTGCA TCGCGGACAT GCAGGTCGAG TTCGAGTACG ACCCCAACGA TAACGGCATG ATAGTCCCGT TGGATCCTAT AGGCCTGACA GAGAAAACGG CGGACGATAT GCGTCTGCAC CTGAAAAACG TACGCATCTA CATCCTCACC CACGAGGGGA AAATGGACAA AAGCTACCGG TATCCCGGGG ACAGCATCCA TGTCGGAAAC CGTAGAAACG GCACCTCCTC GGGACGCACC CTGAGCGCCA GCGAGATGAA CAGCCTGTTT ACGAGTGATT GGAGAAAATA CCGGTGGAAA ATCTACACCA TGGTCATCCG CCCCAAGAAT CTGGCCCAGT AG
|
Protein sequence | MRNSKGFTLV ELLVAMSIFI IIMIMAGNAF ERVLSTAGQQ SRSAQSNFEG VVGLEMLRYD IEHAGYGVAT EFNNYTSDIK FKNLGGTADQ ELVAAANVPL DGFNSTTLNA RRAKNIQPIA AGTSTKEVNH TAVANAAGGP DYLVIRSTVS SLDDSARKWS YVNYSTDSSS NNLSEMKLWD YGPNLTSNDR IIAIRDRFID GKEQKTLLLN GTDGFELSLT ASGAMPAGEY FKPTSKEDMV VLYGVRSAAD SALRMPYNRT DYYVTRPASG MPTHCNPGTG VLYKSLVSHA TGGFDTAYPL LDCIADMQVE FEYDPNDNGM IVPLDPIGLT EKTADDMRLH LKNVRIYILT HEGKMDKSYR YPGDSIHVGN RRNGTSSGRT LSASEMNSLF TSDWRKYRWK IYTMVIRPKN LAQ
|
| |