Gene GM21_3024 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3024 
Symbol 
ID8138370 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3511366 
End bp3512604 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content66% 
IMG OID644870625 
ProductTetratricopeptide TPR_2 repeat protein 
Protein accessionYP_003022811 
Protein GI253701622 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG5010] Flp pilus assembly protein TadD, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones132 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAAGC TCTCCTTCCT CGCCTTTATC CTCATCCTGG CGTTCGCCGC CACGGCGGCC 
GCCGTGGATC CCGTCACCTA TGCCGCCAAG GCCCGGCAGA TGATCGAGCG CGGCGAATAC
GAGCAGGCGC TCGACGAGCT GAAGCGGGCC TCCTCGCTCT TTCCCTACGA GCAGTCGCTC
AAGATCAACC TGGCCCTGAC CTACACCGAG ATCGCCAAGC GCGACATGAA CGCGGGGCGC
TACGCCAAGG CGGCCCGCAG CCTTTCCGAG GCGCGGGAAC TCTTGCCGGA GAACAGGGAG
CTGCGGCTCA TGCGCGGCGT GGCCCTCTAC CTGGACAAGG ATTACGCGAC GGCGGCAAGC
GAGTTCCAGG AAGCAGGCGA CGGCGTCGAG CCGCTCATCT ACCTGGGGAA AATCGGCTAC
GACACCGGCG ACCTGCAGGG TGCGCTTTCC TATTGGCGCC GCGCCCGCGA ATTGGAACCT
GACAACAAGA TGCTGGGGAC CCTGATCGCG AAGGCGGAGC GGGAGCTTCC GGTTGAGTCC
CGCATGGACA AGGGGTTCAG CTCCATGTTC GACCTGAGCT TCGACGCTGA ACTCCCCCCG
GGGCTCTCGG CCGAGGTGCT GGACGCCTTG GAGAGCGCCT ACAACTCGGT GGGGGCCGAT
CTCGGGGTTT TCCCGACCGC CCGCATCCCG GTGCTCCTCT ACACCAAGCG CGACTACAGC
AGCGTGACCG CGGGCCCCGA CTGGTCCGGA GGGCTCTACG ACGGCAAGAT CCGGCTCCCG
ATAGGGGGGA TAACCAGGAT CACCCCGCAA CTAGCCGCCG TCATCTTCCA CGAATACACC
CACGTGCTGA TTGCGGAGAT CACCCACGGC AACGTCCCCA CCTGGCTCAA CGAGGGGCTG
GCGGAGATCG AGGGGCGCAA GGAGTTCGTC CACCCCGGCC GCAACCAGAA CCTGGTCGAC
GCCTCGCATC AGTTGCCCCT GGTCACCCTC TCCGGTCCCT TCACCTCCAT GGACGGTCAG
CAGGCGGGCC TCGCCTACCA GCAGAGCTAT TCCATGGCCC AGTTCATGGT GAACCGCTAC
GGGTGGTATG CCGTGCAGAA CGTGCTCAAG AACCTGGGGG AGCGGGCCAC CATGGAGAAG
GCGGTGGCCA ACGCCCTTTC CGACTGGTCG CTCGACCTGC CGGGACTGCT GCGCGAATGG
CAGCAAGCGC TGCCGACCTC CGCCGGCAAG GCTCAGTGA
 
Protein sequence
MKKLSFLAFI LILAFAATAA AVDPVTYAAK ARQMIERGEY EQALDELKRA SSLFPYEQSL 
KINLALTYTE IAKRDMNAGR YAKAARSLSE ARELLPENRE LRLMRGVALY LDKDYATAAS
EFQEAGDGVE PLIYLGKIGY DTGDLQGALS YWRRARELEP DNKMLGTLIA KAERELPVES
RMDKGFSSMF DLSFDAELPP GLSAEVLDAL ESAYNSVGAD LGVFPTARIP VLLYTKRDYS
SVTAGPDWSG GLYDGKIRLP IGGITRITPQ LAAVIFHEYT HVLIAEITHG NVPTWLNEGL
AEIEGRKEFV HPGRNQNLVD ASHQLPLVTL SGPFTSMDGQ QAGLAYQQSY SMAQFMVNRY
GWYAVQNVLK NLGERATMEK AVANALSDWS LDLPGLLREW QQALPTSAGK AQ