Gene GM21_0943 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0943 
Symbol 
ID8136264 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1117060 
End bp1119021 
Gene Length1962 bp 
Protein Length653 aa 
Translation table11 
GC content64% 
IMG OID644868558 
ProductTetratricopeptide TPR_2 repeat protein 
Protein accessionYP_003020767 
Protein GI253699578 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG5010] Flp pilus assembly protein TadD, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.0000000395442 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAAAAA GACGCAAGTT AGCAAAGCTT GAGCCGCAAG CCGGGCCGAA GGCACCGCCC 
GCGGCCGAGG AGGAGTTGAA CCTCTGGCGC GATCCCCTGG TTCATGTCCT GCTGGTGCTG
GCGCTGGGCT TCGCGGTCTA CTCGAACATC ATAGCCGCGC CGTTTGTCTT CGACGACCTC
CCTTGCCTGG TCAACAACCC GATCATCAAG GACTTCTCCT TCTTCGCCGA CCCGAAGCAG
GTGTTCGGGC TCCCGATCAA CCCGGACCTG AAGAACAACT TCATCCTGCG CCCGGTAGCC
TACTTCACCT TCGCGCTGAA CCACGCCTTG CACGGGCTGG ACGTGCGGGG TTACCACATC
GTCAACCTCC TGCTGCACAT GGCCGACGCC CTGCTGGTGT ACCTCGTTTC GTGGCTCACC
CTGAGGACGC CGGCGCTGCA ACCGGAGCAG GGTAAAGAGG CGGATCCCCC GACGGAGAAA
TACTTCTATC TCCCGTTTCT GGCGAGCCTT TTGTTCGTCT GCCACCCGCT GCAGACGCAG
TCGGTCACCT ACGTGGTGCA GCGCTTCGTA CCCCTGGTCG CCTTTTTCTA CCTGGGGTCG
CTGGCGCTGT ATGCCGCGGC GAGGCTCTCG GAGACAAAGG GGATTCGGGT CGGCTGCTAC
CTCGGCTCGC TTTTCGCCTG CGTCCTCGCC ATGAAGAGCA AGGAAAACGC CTTCACCCTC
CCGGCGGCGA TCGTGCTGTA CGAGTTCGTC TTCTTCCGGG GCGCGGTCAC CGCCGCCCGG
CTCGCGCGGC TGGTCCCGTT CCTCTTCACC ATGGCGATCA TCCCAGTCAA GCTGATGTCT
CTCTCGGCCA TGGCTGCCAC GGGGGGCAAG GTGGCCGGTG CTGTCAACCT AATCAATTTC
AAACAGACCT CCCCCTGGGA ATACCTGATG ACGCAGTTCG GGGTGATAAC GACCTACTTG
CGGCTGCTCA TCCTGCCGGT CAACCAGAAC TTGGATTATC AGTACCCGCT GCAGAAGGTT
TTCCTCGCCC CGGCCGTAGT CCTGCCGCTG CTTTTGCTGC TGGCGCTGGC AGCCGGGGGT
ATTTATCTTC TGGCGACCTC GCGCAGGGGA GACGACCGGG CCGGCATGCG CGCGCTGGCC
GGCTTCGGCA TCTGGTGGTT TTTCATCACC CTCAGCGTCG AATCGAGCGT GGTGCCGATT
GACGACGTCA TCTTCGAGCA TCGGGCCTAC CTGCCGTCGG TAGGATTCTT CATCGCCCTG
CTCGCCGCGG CGTTCTCCCT CCCCCCCCGC TTCGGCGGGA CCCCGCTTTG CACCTCGCGC
CCAGCGGTCG CCGTTTTCGC CTTCCTGGTA CTCGCCAGTT CAGTCGCCTG CTACCTGCGA
AACGAGGTGT GGACGACGCC GGTGGCCCTG TGGCGGGACA CGGTGCAGAA GAGTCCGGGA
AAGGGGAGGG CGCACTTCTC CCTGGGGTTC GCGCTGGCCA ATACCCTGCC TCCCTGGCAC
ACCGACGACA TCAACGTGAT GCTCCAGCCC ATGGACGCCG CCCAGAACCA GGTGCTGGCG
GAAGCGGTCC GGGAGTTCCG CGCCTCGACG AAGCTCGAAC CCGAATCGGC GGCCGGATAT
TCATTCCTGG GGGCGGCGCT GATGGTGCAA CGGAAGTTCG ACGAAGCGGC GGCCGCATTG
GCGACCGCCG CCGCGCTCGA TCCGAAGGAC GCAAGGACCC GCGCGTTTCT CGGGCAGTTG
AGCGAGGCCC GGGGGGATCT TGCCGCCGCT CGCCTCCAGT ACCGGCAGGC CCTTTCTCTC
AGCCCCCGGG AGCCCTTCCC GCATCTGTTC CTGGCGCTTC TCTCCCTGCG AGAGGGGAGG
CACGCCGAGG CACTTAAGGA GTACGAGATC GCCCACCGGC TCGCTCCCCG CCCCGACCTG
GAGCCGAAGA TGGCGCAGTT GAGATTCATG GTGGGACGAT GA
 
Protein sequence
MKKRRKLAKL EPQAGPKAPP AAEEELNLWR DPLVHVLLVL ALGFAVYSNI IAAPFVFDDL 
PCLVNNPIIK DFSFFADPKQ VFGLPINPDL KNNFILRPVA YFTFALNHAL HGLDVRGYHI
VNLLLHMADA LLVYLVSWLT LRTPALQPEQ GKEADPPTEK YFYLPFLASL LFVCHPLQTQ
SVTYVVQRFV PLVAFFYLGS LALYAAARLS ETKGIRVGCY LGSLFACVLA MKSKENAFTL
PAAIVLYEFV FFRGAVTAAR LARLVPFLFT MAIIPVKLMS LSAMAATGGK VAGAVNLINF
KQTSPWEYLM TQFGVITTYL RLLILPVNQN LDYQYPLQKV FLAPAVVLPL LLLLALAAGG
IYLLATSRRG DDRAGMRALA GFGIWWFFIT LSVESSVVPI DDVIFEHRAY LPSVGFFIAL
LAAAFSLPPR FGGTPLCTSR PAVAVFAFLV LASSVACYLR NEVWTTPVAL WRDTVQKSPG
KGRAHFSLGF ALANTLPPWH TDDINVMLQP MDAAQNQVLA EAVREFRAST KLEPESAAGY
SFLGAALMVQ RKFDEAAAAL ATAAALDPKD ARTRAFLGQL SEARGDLAAA RLQYRQALSL
SPREPFPHLF LALLSLREGR HAEALKEYEI AHRLAPRPDL EPKMAQLRFM VGR