Gene GM21_0276 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0276 
Symbol 
ID8135583 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp335165 
End bp337039 
Gene Length1875 bp 
Protein Length624 aa 
Translation table11 
GC content63% 
IMG OID644867896 
ProductTetratricopeptide TPR_2 repeat protein 
Protein accessionYP_003020118 
Protein GI253698929 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3063] Tfp pilus assembly protein PilF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones83 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATTATC CAAAAAACGG CAGGATCTAC CGACCCGCGC TGCTACTCCT GGGCCTTTTA 
TCCTGCGGCT TTACCTGGGG CGCTCCGAGC ATGCCCCCCT GCGACAAGGC AAGGGAGGCG
GTCCGGGAGA TCACGCGGCA GAGTTCGGCC GACCAAAGGC TTGAGGCGGA GAAAAAGGTC
GAAAAACTCT GCGCCGACGG CGGCGCCGCA CATTATCTTA AGGGGCTCGC CCTGGAGACG
GCTCAACGAC AGGAAGAGGC GGTCGATGAG TACCGGACCG CCGTGAAAAA AGAACCCAAG
CTTGCCGAGG CGCACGGCAG GCTCGGGTTA CTCCTTTTTG AAAAGGGGGC GCGTGAAGAG
GCATCGGTTG AACTGTTCGA AGCGTCGAAG GCGAACGCGG ATCCAGCCTA CGCGAGGGCC
TTGGGAGACA TCTTCCAGGC AGCGCAGCTT TACGCCCTGG CCCTGTCGCA GTATCAACAG
GCGCTGCCGC AGTACGGAAA AGACGCGAAG CTGCGTGTCG GCATGGCGCG CAGTTATCTC
GGGTTCGGCG AGCGCGCGAA GGCACGGGAT CTGCTCATCG AGGCGCTAAG GCTCGATCCG
GCGAACCTTC CGGCGCGCCT GGAGCTTGCC GGGATTTACA AGGGGGACAA GCGGTACCAA
GAGGCGTTGG AACAGCTGCG GCAGGCAAGC GCCTCCCATC CCGAGGACCG GGACGTCCAC
TTCCGCCTGG CCCGCCTTCT GGACCTGATG GGAGAGGAGA AGCTCGCCGA TGCGCAATAC
CGGCAGGCCG GGATGGAGCG GGCGGCAAGT CCCGAAGAGC ACCTGAAAAA GGCGGCGCTG
TACCGGCAAG GAACCGCCTT TTCGAAGGCG GCGCGGGAGT ACGAGGCCCT GCTTTTGAAG
CAGCCGGACG CGCCGGGGGT CCGCGAGAAA CTGGGGGATG CACTCCTTGC AGCGGGGCAT
GACGGCGAGG CGATAGCCGC CTACGAGGAA GCGTTGCGGC GCAAGGAAGG ATCGAGCGCG
GTTCTCTACA ACCTGGGCAC CCTCTATGAG CGCAAGGGAG ATCTCGACCA GGCGATGCGC
CGCTTTTCCG AGGCGATACG GCTCGACCCG GAATACGGCG ACGCCCGCAG GAGGCTCGCC
GAAATCCACT CGGTGCGCGG CGATCTGAAC GCCGCCATCG CGCAGTACCG GGAGCTCGTC
TCGCGCCACG GGGACAACCC GCTTAGCTAC TACAAGCTGG CCCGGCTCTA CGAGCAAGGC
CGCCAGTACG CCGACGCCAT CGCCGCCTAC TCCAAGGCCA TCGAGCTCGA CCAGGACAGC
GAGGTCGCCC ACCAGGGGAT CGCGCGGCTC TACCTGAAGC GCAAACAGGC GGAGGAGGCG
GAAAAGCACC TCCTCGAAGT GCTGAGGCTC GACCCGAAGC ACGCCGAGGC GAGGGAGCTC
CTCATCTCGC TGTACGTCAA GGCGCGGCGC TACGACGACA CCGAGAAGCT TCTTAAGGCC
TCGGCGGAGC TGAACCCGGA TAGCGCCAAC GACCAGTACC GGCTGGGGGT CATCTACGCC
TTCCGCGGCA ACAACGACGG CGCGCGGGAG CAGTACCAGA AGGCGCTCGA GCTGAAGCCG
GACCACGCCC GGGCGCTTAA TGCGCTGGGT AAGCTCTACC TGCGACTGGG CCAGAAGGAA
AAGGCCCGCG AAGCACTGGC TGCGGCACGC AAAGCCGACC CGGACCTCCT GGAGCCGGTG
GAGCTCCTGA GCAAGCTGGA CCTCAAAAAG GCGCAAAAGA AGCAGGAGTA CAGGAAACAT
AAGAAAAAGA AGGCGAAGAA GGTTTCGAAG AAGCGCAAGG GGAAGTCTAA AAAGAAGAAG
AAAGGCAGGA GATAG
 
Protein sequence
MNYPKNGRIY RPALLLLGLL SCGFTWGAPS MPPCDKAREA VREITRQSSA DQRLEAEKKV 
EKLCADGGAA HYLKGLALET AQRQEEAVDE YRTAVKKEPK LAEAHGRLGL LLFEKGAREE
ASVELFEASK ANADPAYARA LGDIFQAAQL YALALSQYQQ ALPQYGKDAK LRVGMARSYL
GFGERAKARD LLIEALRLDP ANLPARLELA GIYKGDKRYQ EALEQLRQAS ASHPEDRDVH
FRLARLLDLM GEEKLADAQY RQAGMERAAS PEEHLKKAAL YRQGTAFSKA AREYEALLLK
QPDAPGVREK LGDALLAAGH DGEAIAAYEE ALRRKEGSSA VLYNLGTLYE RKGDLDQAMR
RFSEAIRLDP EYGDARRRLA EIHSVRGDLN AAIAQYRELV SRHGDNPLSY YKLARLYEQG
RQYADAIAAY SKAIELDQDS EVAHQGIARL YLKRKQAEEA EKHLLEVLRL DPKHAEAREL
LISLYVKARR YDDTEKLLKA SAELNPDSAN DQYRLGVIYA FRGNNDGARE QYQKALELKP
DHARALNALG KLYLRLGQKE KAREALAAAR KADPDLLEPV ELLSKLDLKK AQKKQEYRKH
KKKKAKKVSK KRKGKSKKKK KGRR