Gene GM21_3831 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3831 
Symbol 
ID8139205 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4412823 
End bp4415927 
Gene Length3105 bp 
Protein Length1034 aa 
Translation table11 
GC content69% 
IMG OID644871448 
ProductTetratricopeptide TPR_2 repeat protein 
Protein accessionYP_003023606 
Protein GI253702417 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3063] Tfp pilus assembly protein PilF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value0.0133068 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACCGAT ACGCGCGACT GAACCAGGCG CTGAGCGAGA ACCGGGTGGA GCTGGCGAAT 
GAGATCTGCC ACGAGCTCTT GCGCGCGGAG CCGGAGAACG TGGAGCTCTT GACGCTGGAC
GGCCTTCTGG CCTACCGCCG CGGAAATCTG GAGGAGGCGC TGCAGGCCTT TTCCCGAGCC
GCCTTCCTGC AACCGGAAAG CGCCGAGCTG CGCAACAACG TAGGGGTGGC CTACCAGGAT
CTCAACTGCC ACGACAGCGC GGCGCTCCAC TTCCGGGAGG CCCTCTCGCT CAGGGGGGAG
TACCCCGAAG CCCGCTGCAA CCTGGCGACC GCCCTTTTGC ACCTGGGGGA CGCCGAGGAG
GCGATCCGCA ACTACTGCGA CGCCATCGCC GCCGCCCCCG GTTATGCGGA CGCCTATCAC
CTCCTCGGCA ACGCGCTCCG CCGCCAGGGG GAGTGGGAAG GGGCGGTGCA GTGTTACCAA
AAGGCGCTCG AGCTCGATCC CCAGAACCTG AAGACGCTGG TCAACCTGGG GGGATCCCTC
TTCACCCTGA ACCGGTTCGA CGAGGCGATC GCGGCCCAGC GCCGGGCGCT TTCGATCGAC
CCGGACCACG TCGACGCGCA CTGGAACCTG GCCCTGGTGC TCTTGACCAC CGGAAATTAC
CAAGAGGGGT TTCGCGAGTA CCAGTGGCGC CTAAAGGACC CGGCCGCCGG TTTCCCGGAG
AGCTGCGCCG GGAAAAAGCC GTGGGACGGC ACCGCGCTTT GCGGCCGCAC CCTCCTTTTG
CGCTGCGAGC AGGGGTTCGG CGACACCATC CAGTTCTTCC GCTACGCCCA GTTGCTGGCG
CGCCGGGGGG AGCGGGTGGC GCTTGAGTGC CGTAGCGAGC TCCTGCCGCT GTTGGCCTCG
CAGGAACCGG CGATGACCTT TTTCGCCGCC AGCGATCCCC CCCCTTCCTT CGACTGCTTC
GCCTACCTGA TGAGCCTGCC GCACCTTTTG GGGACCCGCG TCGACACCAT ACCGCCGCAA
GAGCCGCCGC TTCGTGCCGA CCCCGTGCGG AGCGTCCGCT GGCGCGACTG GATCCCCGGG
GGGAGCACGA AGGTGGGGGT GGTCTGGGCC GGGAGCGCCG GCTACCGGAA CGACCGCTAC
CGATCTTTGC CGGCACGGGC GCTGGCATCC CTGGCCGGCA TCCCGGGGGT CAGGCTCTAC
AACCTGCAGC TTCCCGCCGC CGCGGACGAC TTGGCCGCCA TCGGTGAGAT CCGCGACCTG
ACCGGCCGCA TCAGGGATTT CTCCGACACC GCCGCGCTCA TCGAAGAGTT GGACCTGGTG
GTGTCGGTGG ACACCGCCGT GGCGCACCTG GCCGCCGCCA TGGGAAAACC GGTGTGGCTG
CTGCTCCCCT TCTCCTGCGA ATGGCGCTGG CTCTCCGGGC GCAGCGATTC CCCCTGGTAC
CCCTCGGTCA CCATCTACCG TCAGCCGTCC TTCGGGGACT GGGAGGGTGC CGTCGCGGCG
GTGGCTGCCG ACCTCGCGGC ATGGCCCGCG CGCGCTCAAA CCGAAGCGCC CAGCTTGCCG
GCAGTCACAC CGAGTGCGGA CGCATCGGAG GCGGCATCCC GATCCCGGCC GACTCACGGT
GCGGAAGCGG CACCGGGAGC GGATCCCGCC CAGGCCCCGC TTATAGCGGC GCCCGGGGAA
GACGACCCGA ACCAGGAGTT TCGCCGGGCC AACGCCCTGC GCGCGGCGGG CGAGCTTCCC
GGCGCCGTGG CGCTCTACCG GCGCCTGCTG GAGCGGCTCC CCGCATGCGC CGAAATCCAC
AACAACCTGG GGCTCGCCCT GCAGGACCAG GGGCTAGACA CTGAAGCGGA GCAGAGCTTT
CGGCGGGCGC TGGAGCTGAA ACCGGAGCTC GCCGACGCCC ACAACAACCT GGGGACCCTC
CTGGTCGCGC GGGGCGAGCA CGAAGGGGCG CTCCCCTTTT TCGAGAAGGC GCTTGAGTTG
CGCGAGGGCT ACCTCCCCGC CTACGCGAAC CTCGGTTCCT GCCTGCAGGT GCTGGAAGAG
CCGGAGCGCG CCGTCGAGCT TTACCGGCGC GCCATCGCGC TCGACCCCGG CTTTTTCGAG
GCGCGCATCA ACCTCGGCAC CGCCTACCAG GACCTTATGC AGCCCGAAAA GGCGATCGAG
GTGTACCGGG AACTCCTGGA GCTTGCCCCG GAGCACCCGG AGGCCCACTG GAACCTCGCC
TTGAGCCTTT TGTCGGTAGG CGATTTCAAG CGGGGGTGGG AGGAGTACGA GTGGCGACTG
GCGAGCGGGG AAGCCCCCCT CTCCCCTCTT CCCTACTGGC GGGGGGAGGA GCTCTCCGGC
CAAAGCATCC TGGTCGAGTG CGAGCAGGGG CTGGGGGACA CGCTGCAGTT CGTCCGCTAC
CTCCCTCTCC TCGCTGAACG CGGCGGGGAG GTGCTGCTCA AGTGCCAGAA CCTCGGGCTC
AAACCACTGC TGGAGCGGGT CCCTGGAGTG GCGGCCGTCT TTGTCCCCGG CGAGGAGCCG
CCGGCGTGCT CTTTGCGGGT GAAGCTACTG AGCCTGCCGC ACCTTTTCGG CACCACCCTG
GAGGCCATGC CTCAATGGGA CCCTTACCTT CTCGCCGACC AGCGCCGCGC AACCCTTTGG
GAGCTGTTGC TGGACCAGGG AAGCGACCTC AAGGTGGGGC TTGTCTGGCG GGGAGGGGCG
CTCCCTAGAA ACCGCGCCTG CCCCTTCGGC GAATTCGCCC CCTTGCGCGA CCTTAAGGGC
GTCAGCTGGT TTTCGCTGCA GTTGGGCGAG GCGCCCGACC CGGGGGTCCT GGAGGCGACA
GACCTGGCGC CGCAGATAAA GGATTTCGGG GATTCGGCGG CGATCCTCTC CGGGCTCGAC
CTCTTACTGA CGGTGGACAC GGCCGCCGCG CACCTGGCGG GGGGGATGGG GGTGCCGGTG
TGGCTCATGC TTCCCTTCTC CTGCGACTGG CGCTGGATGT CCGGGCGCGA GGATTCGCCC
TGGTACCCGA CGCTGCGCAT CTTCAGGCAG GGGCGCCCGG GTGATTGGCC GGGGGTAGTG
GGGGGGGTCC GCGGGGCGCT GGAGGAGATG CTGAGGCGTC GATAA
 
Protein sequence
MDRYARLNQA LSENRVELAN EICHELLRAE PENVELLTLD GLLAYRRGNL EEALQAFSRA 
AFLQPESAEL RNNVGVAYQD LNCHDSAALH FREALSLRGE YPEARCNLAT ALLHLGDAEE
AIRNYCDAIA AAPGYADAYH LLGNALRRQG EWEGAVQCYQ KALELDPQNL KTLVNLGGSL
FTLNRFDEAI AAQRRALSID PDHVDAHWNL ALVLLTTGNY QEGFREYQWR LKDPAAGFPE
SCAGKKPWDG TALCGRTLLL RCEQGFGDTI QFFRYAQLLA RRGERVALEC RSELLPLLAS
QEPAMTFFAA SDPPPSFDCF AYLMSLPHLL GTRVDTIPPQ EPPLRADPVR SVRWRDWIPG
GSTKVGVVWA GSAGYRNDRY RSLPARALAS LAGIPGVRLY NLQLPAAADD LAAIGEIRDL
TGRIRDFSDT AALIEELDLV VSVDTAVAHL AAAMGKPVWL LLPFSCEWRW LSGRSDSPWY
PSVTIYRQPS FGDWEGAVAA VAADLAAWPA RAQTEAPSLP AVTPSADASE AASRSRPTHG
AEAAPGADPA QAPLIAAPGE DDPNQEFRRA NALRAAGELP GAVALYRRLL ERLPACAEIH
NNLGLALQDQ GLDTEAEQSF RRALELKPEL ADAHNNLGTL LVARGEHEGA LPFFEKALEL
REGYLPAYAN LGSCLQVLEE PERAVELYRR AIALDPGFFE ARINLGTAYQ DLMQPEKAIE
VYRELLELAP EHPEAHWNLA LSLLSVGDFK RGWEEYEWRL ASGEAPLSPL PYWRGEELSG
QSILVECEQG LGDTLQFVRY LPLLAERGGE VLLKCQNLGL KPLLERVPGV AAVFVPGEEP
PACSLRVKLL SLPHLFGTTL EAMPQWDPYL LADQRRATLW ELLLDQGSDL KVGLVWRGGA
LPRNRACPFG EFAPLRDLKG VSWFSLQLGE APDPGVLEAT DLAPQIKDFG DSAAILSGLD
LLLTVDTAAA HLAGGMGVPV WLMLPFSCDW RWMSGREDSP WYPTLRIFRQ GRPGDWPGVV
GGVRGALEEM LRRR