Gene GM21_1931 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1931 
Symbol 
ID8137265 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2241220 
End bp2242260 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content63% 
IMG OID644869545 
Productphosphoesterase RecJ domain protein 
Protein accessionYP_003021742 
Protein GI253700553 
COG category[R] General function prediction only 
COG ID[COG0618] Exopolyphosphatase-related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones92 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAACCA GTACGAAGCC GCCCGACCTT TTGGCCTACA CCGATTCCCT GCTGACCTGG 
GTCAGCGGCA GGGGGCGCAT CCTGATAGTG GTGCACGACA ACCCCGACCC CGATTCGCTC
GCCTCCGCCA TGGCGCTGCG GCACTTCTTC GCGGTAAAGC TGAACCGCGA GGCGGTGATC
GCCTTCTCCG GCATGATCGG CAGGAGCGAA AACCTCGCCA TGGCGAAACT GCTCCAGATT
CCGCTCACCC CGTTGCCCCT TATCGACCTC AAGTCCTTCC AGGTGGTCTG TCTGCTGGAC
ACGCAACCCG GCACCGGGAA CAACTCGCTC CCCGCAGGGA CGCGCACCGA CATCGTCATC
GACCACCACC CCATGCGGGA ATTCAGCGCC GCCTGCCGCT GGGTCGATAT CCGCCCCGAT
TACGGGACCA CGGCCACCAT CCTCTACGAG TACCTGAAGG TGCAGGGGGT CTCGATCGGG
ACCAAGATGG CCACGGCACT CTTCTACGCC ATCAAGTCCG AGACGCAGGA TCTGGGGCGT
GAGGCGAGAA GGGCGGACCG CGACGCTTAT CTCGATCTTT TCCCGCTGGC CAACAAGACC
CTTTTGAACA GCATCACCCG CCCGAGCCTT CCCCGCGAAT ACTTCATCTC GCTGCATAGC
GCGCTGGAAC ACGCGGCGCT CTACGGCAAC GTGCTGGTGG CGTCGCTCAA GGGTATCCAG
TTCCCCGAGG TGGTCGCCGA GCTCGCCGAT CTCCTGGTTC GGCTGGAAGG GACTGAGACG
GTACTCTGCC TGGGGCATTA CAGCGCCGAA CTGGTCCTCT CCATCAGGAC CTCGAACGAG
GAGATAAATG CCGGCGAACT GATCCGCAAG CTGGTCGCCG GTATCGGCTC TGCGGGCGGC
CACGGCATGA TGGCCGGAGG CAAAATCGAC TTGAGCGACA ATTCGGAGGA GGCCATTCGC
GAACTGGAGA ACCTCCTCAC CGAACGGCTG CTCGCCGAGA TGAAGATTTC CGATCCAAAA
CCGGTGCCGC TGGTCCCCTA A
 
Protein sequence
METSTKPPDL LAYTDSLLTW VSGRGRILIV VHDNPDPDSL ASAMALRHFF AVKLNREAVI 
AFSGMIGRSE NLAMAKLLQI PLTPLPLIDL KSFQVVCLLD TQPGTGNNSL PAGTRTDIVI
DHHPMREFSA ACRWVDIRPD YGTTATILYE YLKVQGVSIG TKMATALFYA IKSETQDLGR
EARRADRDAY LDLFPLANKT LLNSITRPSL PREYFISLHS ALEHAALYGN VLVASLKGIQ
FPEVVAELAD LLVRLEGTET VLCLGHYSAE LVLSIRTSNE EINAGELIRK LVAGIGSAGG
HGMMAGGKID LSDNSEEAIR ELENLLTERL LAEMKISDPK PVPLVP