Gene GM21_3825 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3825 
Symbol 
ID8139199 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4407566 
End bp4409335 
Gene Length1770 bp 
Protein Length589 aa 
Translation table11 
GC content70% 
IMG OID644871442 
ProductTetratricopeptide TPR_2 repeat protein 
Protein accessionYP_003023600 
Protein GI253702411 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG3914] Predicted O-linked N-acetylglucosamine transferase, SPINDLY family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones61 
Fosmid unclonability p-value0.0624707 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGGTACC AGGAGAGCGC GGAGAGCCAG GCGCGGCTTA AGTACGTGGA CCTGCTGCTC 
AAGAAGGGGA TGCAGGAGGA GGCCCGGACC CAGTTGCAGC AGCTGGTGCA GGAAGACCCC
GGCTGCGCGC GCGGCTGGTA CCTCCTCGCC GTGCTGGTGG GGGAGCAGGG ACACCCGGAC
CAGGCGGCGA AGCTCCTGAG ACAGGCGCTT AGGGCCGAGC CGGAGAACGT CAAGGCGCTG
AACGCGCTCG GGGTGGCCCT GCAGCAGATG GGGGAGCGGG ACCAGGCCGC GGCATGCTAC
GGCGAGGCGC TCCGCATCGA TCCCCGGTTC CAGGAGGCGC GGGTGAACCT CGCCCTCTTC
CTCAAGGTGG GGATGAGGCT TGCCGAGGCC GAGGCGCTCC TCTCCCGGGG GATCGCGCTC
GAGCCCGCAT CGGTGCGGCT TCGCTACAAC TACGCCAACG TGCTCCATTA CCAGGGGAGA
AGCCTGGAGG CGGCCGGCGC CTACCGGGAG GTGCTCCGCC TGGACCCGCA GCACCTGGAC
GCCAGGCAGA ACCTCCTTTT CGCGCTGCAC TACTCTCCGC AGTTCTCCGA CCGCCGGATC
TTCGCCGAAC ACCTGCGCGC CGCGAGAAGC GCCCCCTTCC GCCTCCCCCC CTCCCCTTCC
GTCCCGCGCC GAGGCGGGCG CATCAGGATC GGCTACCTCT CCCCCGACTT TCGCAGCCAC
GCCGTCGCCT CGTTCATCGA GCCGGTGCTC AAGGCACACG ACCGGGAGCG CTTCGAGATC
TTCTGCTACG CGAACCTCCC CCGCCCCGAC CGGGTCACCG AGAGGGTGAA GGCTTTGAGC
GAGCACTGGC GCGACCTCTA CAACATCCCG GACCAGATCG CCGCGCTGAT GATCGCCGCC
GACGCCCTCG ACGTCCTGAT CGACCTGGCC GGCCACACCT CCGGGAACCG GCTCCCGCTT
TTCGCGCGCA GGCCCGCTCC CCTGCAGATC ACCTGGATCG GCTACCCCGA CACCACCGGG
CTCAAGCAGA TGGATTACCG GATCACCGAC CGCCATGCCG ACCCGCCCGG GAAAAGCGAG
CGCTACCACA CCGAGACGCT GCTCAGGCTC CCGCGCAGCT TCAGCTGCTT TCTCCCGCCG
CAGGAAGCCC CCGAGGTGGC ACCTGTCCCG TGCCTTGCGA CCGGCGCGGT CACCTTCGGC
TCGTTCAACA ACCTGGCCAA GGTCACCCCC GAGACGATCG CCCTCTGGTG CCGGGTGCTC
GATGCCGTCC CCGGCTCGCG CCTGCTGTTG AAGGGGAGGC CCTTCGCCGA CAGCGGGGTA
CGGGAGAGGA TCGCATCCCT GTTCGCCAGA GGAGGGATCG CGGGGGAGCG GGTCGAGCTA
CACCCGGGCG AGCCGGAGAA TTCGGCGCAC CTGGCGCAGT ACGGGCGGGT CGACATCGCC
CTCGACACCT TCCCCTACAA CGGCACCACC ACCACCTGCG AGGCGCTCTG GATGGGGGTC
CCGGTGGTGA CCCTCGCGGG TACGAGGCAC GCGGCGCGGA CCGGCGCGAG CATCCTTACG
AACTGCGGGC TCGATGAGCT GGTGGCCGAG GACGAGGGGG AATACCTGGA GATCGCCCGG
CGGCTGGCGG CGGATCGGGG GAGGCTTTCG GAGTTCAGGA AGGGGGCGCG GGAAAGGCTC
GCGGCGTCGC CGCTACTGGA CGCGGCGGGG GTGACGCGGG AGTTGGAAGC GGCCCTGGAG
GGGGTCCTCA AGGAGCGCGG GGTGCGTTAG
 
Protein sequence
MGYQESAESQ ARLKYVDLLL KKGMQEEART QLQQLVQEDP GCARGWYLLA VLVGEQGHPD 
QAAKLLRQAL RAEPENVKAL NALGVALQQM GERDQAAACY GEALRIDPRF QEARVNLALF
LKVGMRLAEA EALLSRGIAL EPASVRLRYN YANVLHYQGR SLEAAGAYRE VLRLDPQHLD
ARQNLLFALH YSPQFSDRRI FAEHLRAARS APFRLPPSPS VPRRGGRIRI GYLSPDFRSH
AVASFIEPVL KAHDRERFEI FCYANLPRPD RVTERVKALS EHWRDLYNIP DQIAALMIAA
DALDVLIDLA GHTSGNRLPL FARRPAPLQI TWIGYPDTTG LKQMDYRITD RHADPPGKSE
RYHTETLLRL PRSFSCFLPP QEAPEVAPVP CLATGAVTFG SFNNLAKVTP ETIALWCRVL
DAVPGSRLLL KGRPFADSGV RERIASLFAR GGIAGERVEL HPGEPENSAH LAQYGRVDIA
LDTFPYNGTT TTCEALWMGV PVVTLAGTRH AARTGASILT NCGLDELVAE DEGEYLEIAR
RLAADRGRLS EFRKGARERL AASPLLDAAG VTRELEAALE GVLKERGVR