Gene Gmet_2041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGmet_2041 
Symbol 
ID3739821 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter metallireducens GS-15 
KingdomBacteria 
Replicon accessionNC_007517 
Strand
Start bp2284638 
End bp2285858 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content46% 
IMG OID637779335 
Productputative glycosyl transferase 
Protein accessionYP_384995 
Protein GI78223248 
COG category[C] Energy production and conversion
[G] Carbohydrate transport and metabolism 
COG ID[COG1819] Glycosyl transferases, related to UDP-glucuronosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.221622 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value0.0364064 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAATTC TTGCCTTCCC CTATACTCAT ACACTGTCTC ATCTCAGCCG CGTTCTGGCA 
GTCGCTCTTG AGTTACGAAG GATGGGCCAT GAGGTAGTTT TTGCCGGCGA GAGCGCCAAG
GTTTCATTTG TATCGCAGCA GGGCTTCGAT GTGGTACCCA TTCATGAACC TGATCCTGAG
ATGCTTTTTG GCAACATTCG TTCCGGTAAG CTTCGATTTG TTGAAGATGC CGAACTGTTG
CAAATGCTGA CGGCAGATAT AGAGGTAATT AGGTCCTTGA AACCTGATTT GGTGCTTTCA
GACGGTAGAT TCAGTGCGCC CTTGTCTACT CATCTGACCA ATGTCAGGCA CGCTGCTATT
GTGAATGCTT CATCAACGGA GTACCGAGCA CTCCCCTATG TGCCTTTTTT TGATTGGATG
CCCCCCTGGC TGATTAGTCG CGATGCGATG ATCTGGAAAG CATTGGTCCG GCTGAATCTT
TTTCTCGAAA TGAAGTTGTT TGACAATGTT ATGAAGGTAT TCAAGAGGTT AAGCCGGGAA
TTGAACACAA ACCGAACTGT TACGGCGACA AATTGCCTTA CTGGAAAGGA TATAACACTC
CTTGCGGATA TTCCGGAGTA TTTCCCATCG CGTAATCTGC CGGCTTCTTA TCATTATGTG
GGACCATTAA CCTGGAAAAG TGTTCTTGCT CCCCCGGCAT GGTGGCCGCT TGATATTCCT
TCATCTCCGC TGGTTTATGT GACAATGGGC ACAACGGGAG TTTCCGAATT TTTTTCAAAA
CTTGGCCCAA GTCTCTCTAC ATCTTTTTTT TCGTCAATTG TAACCACTGG TGGCCAATCA
TCAGAGCTCA AGCCGATGCC AGGAAAGGTT TATGTGGAAA GTTACCTCGA TGGCGATCTG
GTCATGGAGC GTAGTGATGT TGTAATTTGT CATGGAGGCA ATGGCACCAT ATACCAGGCT
CTTTCTCACG GCAAGCCAGT GATCGGCATT CCAACCATAC CTGATCAAAA ATTCAATATG
CGCCGTGTTG AGGCAATGGG ATTTGGAAAG TCACTTGACT TAAAACAATT TTTGGAAAAG
CCATCATTGC TTGCTGACAC GGTTAAACAA GTACTGTCTG ATCATTCGTT CCGAAATAGT
GCCCAAAAAA TTCAAGCTGT CCTGAAATCT TATAATGCTG CAACCACCAG CGCCAAAATT
CTCATTGATA GCATTTTATA G
 
Protein sequence
MRILAFPYTH TLSHLSRVLA VALELRRMGH EVVFAGESAK VSFVSQQGFD VVPIHEPDPE 
MLFGNIRSGK LRFVEDAELL QMLTADIEVI RSLKPDLVLS DGRFSAPLST HLTNVRHAAI
VNASSTEYRA LPYVPFFDWM PPWLISRDAM IWKALVRLNL FLEMKLFDNV MKVFKRLSRE
LNTNRTVTAT NCLTGKDITL LADIPEYFPS RNLPASYHYV GPLTWKSVLA PPAWWPLDIP
SSPLVYVTMG TTGVSEFFSK LGPSLSTSFF SSIVTTGGQS SELKPMPGKV YVESYLDGDL
VMERSDVVIC HGGNGTIYQA LSHGKPVIGI PTIPDQKFNM RRVEAMGFGK SLDLKQFLEK
PSLLADTVKQ VLSDHSFRNS AQKIQAVLKS YNAATTSAKI LIDSIL