Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_3831 |
Symbol | |
ID | 8139205 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 4412823 |
End bp | 4415927 |
Gene Length | 3105 bp |
Protein Length | 1034 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 644871448 |
Product | Tetratricopeptide TPR_2 repeat protein |
Protein accession | YP_003023606 |
Protein GI | 253702417 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3063] Tfp pilus assembly protein PilF |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 54 |
Fosmid unclonability p-value | 0.0133068 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACCGAT ACGCGCGACT GAACCAGGCG CTGAGCGAGA ACCGGGTGGA GCTGGCGAAT GAGATCTGCC ACGAGCTCTT GCGCGCGGAG CCGGAGAACG TGGAGCTCTT GACGCTGGAC GGCCTTCTGG CCTACCGCCG CGGAAATCTG GAGGAGGCGC TGCAGGCCTT TTCCCGAGCC GCCTTCCTGC AACCGGAAAG CGCCGAGCTG CGCAACAACG TAGGGGTGGC CTACCAGGAT CTCAACTGCC ACGACAGCGC GGCGCTCCAC TTCCGGGAGG CCCTCTCGCT CAGGGGGGAG TACCCCGAAG CCCGCTGCAA CCTGGCGACC GCCCTTTTGC ACCTGGGGGA CGCCGAGGAG GCGATCCGCA ACTACTGCGA CGCCATCGCC GCCGCCCCCG GTTATGCGGA CGCCTATCAC CTCCTCGGCA ACGCGCTCCG CCGCCAGGGG GAGTGGGAAG GGGCGGTGCA GTGTTACCAA AAGGCGCTCG AGCTCGATCC CCAGAACCTG AAGACGCTGG TCAACCTGGG GGGATCCCTC TTCACCCTGA ACCGGTTCGA CGAGGCGATC GCGGCCCAGC GCCGGGCGCT TTCGATCGAC CCGGACCACG TCGACGCGCA CTGGAACCTG GCCCTGGTGC TCTTGACCAC CGGAAATTAC CAAGAGGGGT TTCGCGAGTA CCAGTGGCGC CTAAAGGACC CGGCCGCCGG TTTCCCGGAG AGCTGCGCCG GGAAAAAGCC GTGGGACGGC ACCGCGCTTT GCGGCCGCAC CCTCCTTTTG CGCTGCGAGC AGGGGTTCGG CGACACCATC CAGTTCTTCC GCTACGCCCA GTTGCTGGCG CGCCGGGGGG AGCGGGTGGC GCTTGAGTGC CGTAGCGAGC TCCTGCCGCT GTTGGCCTCG CAGGAACCGG CGATGACCTT TTTCGCCGCC AGCGATCCCC CCCCTTCCTT CGACTGCTTC GCCTACCTGA TGAGCCTGCC GCACCTTTTG GGGACCCGCG TCGACACCAT ACCGCCGCAA GAGCCGCCGC TTCGTGCCGA CCCCGTGCGG AGCGTCCGCT GGCGCGACTG GATCCCCGGG GGGAGCACGA AGGTGGGGGT GGTCTGGGCC GGGAGCGCCG GCTACCGGAA CGACCGCTAC CGATCTTTGC CGGCACGGGC GCTGGCATCC CTGGCCGGCA TCCCGGGGGT CAGGCTCTAC AACCTGCAGC TTCCCGCCGC CGCGGACGAC TTGGCCGCCA TCGGTGAGAT CCGCGACCTG ACCGGCCGCA TCAGGGATTT CTCCGACACC GCCGCGCTCA TCGAAGAGTT GGACCTGGTG GTGTCGGTGG ACACCGCCGT GGCGCACCTG GCCGCCGCCA TGGGAAAACC GGTGTGGCTG CTGCTCCCCT TCTCCTGCGA ATGGCGCTGG CTCTCCGGGC GCAGCGATTC CCCCTGGTAC CCCTCGGTCA CCATCTACCG TCAGCCGTCC TTCGGGGACT GGGAGGGTGC CGTCGCGGCG GTGGCTGCCG ACCTCGCGGC ATGGCCCGCG CGCGCTCAAA CCGAAGCGCC CAGCTTGCCG GCAGTCACAC CGAGTGCGGA CGCATCGGAG GCGGCATCCC GATCCCGGCC GACTCACGGT GCGGAAGCGG CACCGGGAGC GGATCCCGCC CAGGCCCCGC TTATAGCGGC GCCCGGGGAA GACGACCCGA ACCAGGAGTT TCGCCGGGCC AACGCCCTGC GCGCGGCGGG CGAGCTTCCC GGCGCCGTGG CGCTCTACCG GCGCCTGCTG GAGCGGCTCC CCGCATGCGC CGAAATCCAC AACAACCTGG GGCTCGCCCT GCAGGACCAG GGGCTAGACA CTGAAGCGGA GCAGAGCTTT CGGCGGGCGC TGGAGCTGAA ACCGGAGCTC GCCGACGCCC ACAACAACCT GGGGACCCTC CTGGTCGCGC GGGGCGAGCA CGAAGGGGCG CTCCCCTTTT TCGAGAAGGC GCTTGAGTTG CGCGAGGGCT ACCTCCCCGC CTACGCGAAC CTCGGTTCCT GCCTGCAGGT GCTGGAAGAG CCGGAGCGCG CCGTCGAGCT TTACCGGCGC GCCATCGCGC TCGACCCCGG CTTTTTCGAG GCGCGCATCA ACCTCGGCAC CGCCTACCAG GACCTTATGC AGCCCGAAAA GGCGATCGAG GTGTACCGGG AACTCCTGGA GCTTGCCCCG GAGCACCCGG AGGCCCACTG GAACCTCGCC TTGAGCCTTT TGTCGGTAGG CGATTTCAAG CGGGGGTGGG AGGAGTACGA GTGGCGACTG GCGAGCGGGG AAGCCCCCCT CTCCCCTCTT CCCTACTGGC GGGGGGAGGA GCTCTCCGGC CAAAGCATCC TGGTCGAGTG CGAGCAGGGG CTGGGGGACA CGCTGCAGTT CGTCCGCTAC CTCCCTCTCC TCGCTGAACG CGGCGGGGAG GTGCTGCTCA AGTGCCAGAA CCTCGGGCTC AAACCACTGC TGGAGCGGGT CCCTGGAGTG GCGGCCGTCT TTGTCCCCGG CGAGGAGCCG CCGGCGTGCT CTTTGCGGGT GAAGCTACTG AGCCTGCCGC ACCTTTTCGG CACCACCCTG GAGGCCATGC CTCAATGGGA CCCTTACCTT CTCGCCGACC AGCGCCGCGC AACCCTTTGG GAGCTGTTGC TGGACCAGGG AAGCGACCTC AAGGTGGGGC TTGTCTGGCG GGGAGGGGCG CTCCCTAGAA ACCGCGCCTG CCCCTTCGGC GAATTCGCCC CCTTGCGCGA CCTTAAGGGC GTCAGCTGGT TTTCGCTGCA GTTGGGCGAG GCGCCCGACC CGGGGGTCCT GGAGGCGACA GACCTGGCGC CGCAGATAAA GGATTTCGGG GATTCGGCGG CGATCCTCTC CGGGCTCGAC CTCTTACTGA CGGTGGACAC GGCCGCCGCG CACCTGGCGG GGGGGATGGG GGTGCCGGTG TGGCTCATGC TTCCCTTCTC CTGCGACTGG CGCTGGATGT CCGGGCGCGA GGATTCGCCC TGGTACCCGA CGCTGCGCAT CTTCAGGCAG GGGCGCCCGG GTGATTGGCC GGGGGTAGTG GGGGGGGTCC GCGGGGCGCT GGAGGAGATG CTGAGGCGTC GATAA
|
Protein sequence | MDRYARLNQA LSENRVELAN EICHELLRAE PENVELLTLD GLLAYRRGNL EEALQAFSRA AFLQPESAEL RNNVGVAYQD LNCHDSAALH FREALSLRGE YPEARCNLAT ALLHLGDAEE AIRNYCDAIA AAPGYADAYH LLGNALRRQG EWEGAVQCYQ KALELDPQNL KTLVNLGGSL FTLNRFDEAI AAQRRALSID PDHVDAHWNL ALVLLTTGNY QEGFREYQWR LKDPAAGFPE SCAGKKPWDG TALCGRTLLL RCEQGFGDTI QFFRYAQLLA RRGERVALEC RSELLPLLAS QEPAMTFFAA SDPPPSFDCF AYLMSLPHLL GTRVDTIPPQ EPPLRADPVR SVRWRDWIPG GSTKVGVVWA GSAGYRNDRY RSLPARALAS LAGIPGVRLY NLQLPAAADD LAAIGEIRDL TGRIRDFSDT AALIEELDLV VSVDTAVAHL AAAMGKPVWL LLPFSCEWRW LSGRSDSPWY PSVTIYRQPS FGDWEGAVAA VAADLAAWPA RAQTEAPSLP AVTPSADASE AASRSRPTHG AEAAPGADPA QAPLIAAPGE DDPNQEFRRA NALRAAGELP GAVALYRRLL ERLPACAEIH NNLGLALQDQ GLDTEAEQSF RRALELKPEL ADAHNNLGTL LVARGEHEGA LPFFEKALEL REGYLPAYAN LGSCLQVLEE PERAVELYRR AIALDPGFFE ARINLGTAYQ DLMQPEKAIE VYRELLELAP EHPEAHWNLA LSLLSVGDFK RGWEEYEWRL ASGEAPLSPL PYWRGEELSG QSILVECEQG LGDTLQFVRY LPLLAERGGE VLLKCQNLGL KPLLERVPGV AAVFVPGEEP PACSLRVKLL SLPHLFGTTL EAMPQWDPYL LADQRRATLW ELLLDQGSDL KVGLVWRGGA LPRNRACPFG EFAPLRDLKG VSWFSLQLGE APDPGVLEAT DLAPQIKDFG DSAAILSGLD LLLTVDTAAA HLAGGMGVPV WLMLPFSCDW RWMSGREDSP WYPTLRIFRQ GRPGDWPGVV GGVRGALEEM LRRR
|
| |