Gene GSU3144 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU3144 
Symbol 
ID2688423 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp3451680 
End bp3452888 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content64% 
IMG OID637127837 
Producthypothetical protein 
Protein accessionNP_954185 
Protein GI39998234 
COG category[R] General function prediction only 
COG ID[COG1092] Predicted SAM-dependent methyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGAGCTC TACGCTGCGT CGTCGGCCCT GAATCGGTGA GAATGCTGCA GTTGGGGCAT 
CCCTGGATCA TAGCCGACGC CTATACGAAA AAGTGGCCCG CCGGCTCGGC CGGCGATCTG
GTGGAACTGG TGGATGCCTC GGGGCGTTTT CTCGCAACGG CCCTGCTGGA CCCGGGCGAG
CGCATTGTGG CGCGGGTACT GGGAGGGAAA GGACTCCGCC TTGGAGAAAG GTGGCTGGCA
CAACGGTTGG AAGACGCCCT GGAGCTGCGG CGGTCCCATG TCCCCCTTGA GGAGACCGAT
GCCTACCGGC TTGTGAACGG CGAAGGGGAC GGTCTGCCGG GCATCACGGT GGACCGCTAC
GGCGACTACC TGATGGTTCA GCTTTACTGC GGGGGGTGGC GACCGCACCT GCCGGAACTG
ACCCGGGTGC TGGCCGGGGC CCTGAAACCG GCCGGCATCT ATGAAAAAAC CCGCCCCCGC
AACACGCGGG AGCTGGAAGC GGTTAGCGAC ACCAAGCGTT ATGGGCGGCT CCTGGCCGGG
ACATCCGCAC CGCCACGGCT GCCCGTGCGG GAAAACGGTC TGACCTTTCT GGTCGAACTG
GAGCGGGGTC TGAACACCGG CCTTTTCCTG GACCAGCGCG CCAACCGCCG CCAGTTAATG
GCCCGGACCG CCGGCAAGCG GGTGCTCAAT CTCTTCGCCT ATACCGGCGC CTTTTCCGTG
GCAGCCGCGG CGGGCGGAGC GAGCCGCGTG ACAAGCGTGG ATGCCTCTCC CACCTACACC
GACTGGGCAA AGGCCCATTT CGAGGCAAAC CGCCTCAATC CCAAGCGCCA CGAATTCATT
GTGGGCGACT GCATGGACAC CCTGGCGGAG CTCGCCCGCC GGCAGGAGCG TTTCGACATC
GTCCTGATGG ACCCCCCTTC CTTCTCAACC ACGTCGAAAA GCCGCTTCAC CACGCGCGGC
GGCACCTCTG ACCTTGTTGC CGCTTCGCTT CCCCTGCTGG CGGAAGGTGG GCTCCTGATC
ACCTCATCCA ACCACCAGAA GGTGGATATG GCCGACTACC TCAAGGAATT GCGCCGCGGA
GCGCTCCAGG CAGGCAGCAC CGTGCGCGTG ATCTCCCTGG CCGGCCAGCC TGAAGATTTC
CCCTACCCGG TCACTTTTCC CGAAGGGCGC TATCTGAAAT ATGTCGTCAG TGTCAAGGGA
AGGACGTAA
 
Protein sequence
MGALRCVVGP ESVRMLQLGH PWIIADAYTK KWPAGSAGDL VELVDASGRF LATALLDPGE 
RIVARVLGGK GLRLGERWLA QRLEDALELR RSHVPLEETD AYRLVNGEGD GLPGITVDRY
GDYLMVQLYC GGWRPHLPEL TRVLAGALKP AGIYEKTRPR NTRELEAVSD TKRYGRLLAG
TSAPPRLPVR ENGLTFLVEL ERGLNTGLFL DQRANRRQLM ARTAGKRVLN LFAYTGAFSV
AAAAGGASRV TSVDASPTYT DWAKAHFEAN RLNPKRHEFI VGDCMDTLAE LARRQERFDI
VLMDPPSFST TSKSRFTTRG GTSDLVAASL PLLAEGGLLI TSSNHQKVDM ADYLKELRRG
ALQAGSTVRV ISLAGQPEDF PYPVTFPEGR YLKYVVSVKG RT