Gene GM21_2659 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2659 
Symbol 
ID8138001 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3094807 
End bp3095976 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content63% 
IMG OID644870263 
ProductTetratricopeptide TPR_2 repeat protein 
Protein accessionYP_003022453 
Protein GI253701264 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value0.0000575717 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGTAGAGG ACGCTTTATC CTTCTGGACG GAGATACAGC GCTACGAGGA CATGCTGGCA 
GCCGACGCCA AAAGCTTGTG CTTCGCGCCC CTCTCGGAGC TGTACCGAAA GCTCGGCCTT
CTTGACGATG CCATCATGGT GGCCGAGAAA GGGTGCGCCG CACATCCCGA CCTTCCTGCC
GGTTTCCTGG CCCTTGGCAC CGCCTGTTAC GCGAAAGGGC TCACCGGCCA GGCGCGCAGC
GCTCTTGAGC GTGCGGTGGC GCTCCAGCCG AATCATCTTG AGGCCCTGAA ACTCCTGGGA
CAGCTCTATG TCGAACAAAA CGAGGTGGGT CTGGCGCGCA AGGTGCTGGA GCAGGTGCTA
CAGCAGGACC CGGACGACCT GGAGAGTTCG CTTTTGCTGA ACTCGGTCGC CGTCCCTCCC
TCCTACGAGG AGCCTGAGGA ACTGCTTGAG GATCTCGAGA TCATAGAGGA ACTTGAAGAG
GTGGTGGAAG AGGTGGTGGA ACCGGCCCGC CCCCAGGCGT CTGCTGCCGC GGCACCCCTG
GAAGACGACG ACATCTGGGC CATCGAGGAC CTGGAAGAGG TCGAGGTCGA GCCTCTTTCT
CCGCAACGCA GCGAGGCGGC CACGCCCGAT CCGCTGACCA CTGCGACCCT GGCGGAACTG
TACGTGTCGC AAGGGTTCAT CGAGAAGGCG CTGGGGATCT ACCGGGAACT CATCACCGCG
CACCCGGCCA ACTCGCAGTA CCGGTTACGC TGCGCCGAAC TTCAGGAAAT GCTGGAATTG
CAGCAGATGC CTGCCGGGCC CGCAGGTTCG GCAAAGGGAC TGGCTGCCGC GACAGCGCCT
GCCGTGGCGG AACTGCAGGA GGAAACGGAC GAGCTCGAAA CGGGATGGGA TGTCCCTGTC
GAGACCGGTG AACCGGTCGA GCCGGAAGGC GTGCCTGAAA CGGAGCGGAG CTCCAAATTG
GTGATACCCG CCGCCCTGGA GACGCCGGCG ACGCTTGAAT TTCCGGCAAC CGCAGAGGCG
GCGTTTGCCA TCGATTCACC GGCCGTAATG GAAGAGCCTG CCTTGGAGGC ATTGGAGACG
AAGAGCGCGC CGTCCGGTGG CGCCGGCGAA GTCGAGGTCG AGCTGCAACG CTGGCTGGAA
AACATAAGGA GAAGAAGAGA TGGGGTTTAA
 
Protein sequence
MVEDALSFWT EIQRYEDMLA ADAKSLCFAP LSELYRKLGL LDDAIMVAEK GCAAHPDLPA 
GFLALGTACY AKGLTGQARS ALERAVALQP NHLEALKLLG QLYVEQNEVG LARKVLEQVL
QQDPDDLESS LLLNSVAVPP SYEEPEELLE DLEIIEELEE VVEEVVEPAR PQASAAAAPL
EDDDIWAIED LEEVEVEPLS PQRSEAATPD PLTTATLAEL YVSQGFIEKA LGIYRELITA
HPANSQYRLR CAELQEMLEL QQMPAGPAGS AKGLAAATAP AVAELQEETD ELETGWDVPV
ETGEPVEPEG VPETERSSKL VIPAALETPA TLEFPATAEA AFAIDSPAVM EEPALEALET
KSAPSGGAGE VEVELQRWLE NIRRRRDGV