Gene GSU1957 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU1957 
Symbol 
ID2688286 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp2146106 
End bp2147206 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content50% 
IMG OID637126648 
Productglycosyl transferase, group 1 family protein 
Protein accessionNP_953006 
Protein GI39997055 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAATTG CGTATGTTAT AGATTACTTG TACAGCGTAA ACGGAGGGAC TGAACGACAG 
CTCTATATGC TCATAGAGGG TATGGTGTCC CGTGGACATA CTGTTGATCT TTATGTGTTC
AGAGATACGG AATTTACTAA AAATCTGCCG GATTTTCCCT GCCCGGTTCA TTGCTTGAAT
GTTGAGTCGG TTCTGTCGCC AGGCGGGCTT ATTCGACTGA TTCAGTTCAG AAAGCGGATC
ATCGCCGATA ACGTTGATGT TCTGCATGGT TTTTTCAATG ATGTTGCCCT GTCATTACCG
CCGTTGATGC TGGGTTCAAA CGTAAAGACT TTCACCTCGC GAAGAGACAT GGGGATATGG
TATTCCCCGG CCAAGTTGCT GTTTTTAAGA CTTTTCAGGT TTTCAAGCAT CCGGTTGATC
TGCAATAGCA TTGCGGTGGC GAAGTTTACG TTCGAGCAGG AATGGAAGGC CAAGGAGTCG
ATACGGGTGA TATACAACGG CATGAATCGC TTCAATGTCG ACCCTTCGTC CTGCTCCTGT
GACTGGGCTC CGGAGAAAGG AAAGAACATC AACATCATCC TCGTCGCCAA CGTGCGGCCC
GTCAAGCGTG TAGAAGATCT CATCAGGGCG GCGAGCCTGA TCGTCGAGCA TGGCTACCAT
CCTCAGTACT ATGTCGTCGG GCATCTGCAG AGTGACGGTT ACACAGACTC TCTCCGAGAG
TTGCTGAAAC GGCATCACCT TGAAGCGGAT TTCCATTTTA CCGGTCCCGT ATCTGAGCCA
CGGGGAGCTT TGGAAAGATT TGATATCGGG GTGCTGACCT CAGCCTCTGA GGGGTTCTCC
AATACACTGA TGGAATATCT CGACGCCGGC TTGCCGGTCG TAGCTTCAAA GGTCGGAGGC
AATCCGGAAT TGGTGGACGA CGGAGAGACC GGTTTTCTGT ACGAAGCCGG AGATGTGAAT
GCCCTGGCCG ACTGCATTCT CAAGCTTATT GCAGATGACC GGACGAGGAA CCTGTTTGCC
GCCAATGCGA AACAGATGAT AACCCGCTTT GACCGGGCAA CGATGATTGA GTCCCACGAA
AAGGAATATC TGCGTGCTTA G
 
Protein sequence
MKIAYVIDYL YSVNGGTERQ LYMLIEGMVS RGHTVDLYVF RDTEFTKNLP DFPCPVHCLN 
VESVLSPGGL IRLIQFRKRI IADNVDVLHG FFNDVALSLP PLMLGSNVKT FTSRRDMGIW
YSPAKLLFLR LFRFSSIRLI CNSIAVAKFT FEQEWKAKES IRVIYNGMNR FNVDPSSCSC
DWAPEKGKNI NIILVANVRP VKRVEDLIRA ASLIVEHGYH PQYYVVGHLQ SDGYTDSLRE
LLKRHHLEAD FHFTGPVSEP RGALERFDIG VLTSASEGFS NTLMEYLDAG LPVVASKVGG
NPELVDDGET GFLYEAGDVN ALADCILKLI ADDRTRNLFA ANAKQMITRF DRATMIESHE
KEYLRA