Gene GSU1454 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU1454 
Symbol 
ID2687763 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp1592452 
End bp1593870 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content56% 
IMG OID637126128 
Productglycosyl transferase, group 2 family protein 
Protein accessionNP_952505 
Protein GI39996554 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTTCATA ACATCGTTGC CTTCCTGCTG GCGATCCAGC CGGTGATCTT CATTTATTTC 
ATCGTGCTGA ACGGGTTCTA CACCCTGTTC ACCATCATCT CCCTGCGTGA CATCCGCAAC
TATCTGAATG CCGTCACGAG CCAGAGCATC GACAACGTGC TCAACGGGAT GTTCTATCGG
CCCCTGTCTA TCCTCGTGCC CGCCTACAAC GAGGAAAAGA CCATCGTTTC CTCGGTGAAG
TCGCTGCTCG CCCTGCGGTA TCCCGAGTAC GAAGTCATCG TCATCAACGA CGGGTCCACC
GACGGCACTC TTGAAAGTCT CATCAACGAA TTCCGTCTTG TCCGGATCGA CAAGCCCATC
AGCCTGCATG TTCCCCATCG GCCGATCATT GCCAAGTACG TTTCCGTCGA CCATCCCTAC
CTCTTCGTGC TGGACAAGGA GAACGGCGGC AAGGCCGATG CCCTCAACGC GGGGATAAAC
GCTTCCCAGT TTCCCCTGTT CTGCTCGATC GATGCCGACT CCGTGCTTGA GGACGATGCG
CTCATCCGTG CGACGCGGCT TTTCGTGGAA GACCGGGAGG TCGTTGCCAC CGGCGGTATC
GTGCGGGTCC TTAACGGTTG CGAGGTGGAG GACGGCATCG TCAAGGGGGT CCGCGCTCCC
CGGGGGATGC TGGAGTGCTT CCAGACTGTG GAGTACACCA AGGGGTTCCT CTCGGGGCGG
ACATCGTGGA ATTACTTCAG GAGCTTGCTG ATCATCTCGG GCGCCTTCGG CATCTTCCGC
AAGGACATGG TGATGGCCGT CAAAGGGTAC CGCGAGAGCG TGGGCGAGGA CATGGACTTG
GTGGTGCGGC TCCACCGCCA TTGCCGCCAG AACCGGATTC GCTACAAGGT AGTGTTTGTG
CCCGATCCTG TCTGTTGGAC CCAGGTTCCT TCGGATATGG CATCGCTCCT CAAACAGCGC
AATCGCTGGC ACCGTGGCCT CATTGACAGC CTTTGGCACA ACAGGGGCAT GTTCCTCAAC
CCCCGCTACG GTACGGTCGG ACTGTTCGGG TTCCCCTATT TTGTTACGGT TGAACTACTC
GGGCCGGCAG TTGAATTTAT TGGCTATTTC GGCTTCGTGC TTCTCTTTTT CCTGGGTCAG
GTGAATCGCG AGTTTGCAAT TCTCTTTTTC CTCCTGGCTG TTCTCTGGGG AACTTGGATT
AATCTCGGCT CCATCTTTCT CGATAACCTC ATTTACAAGC GCTACAAGGG GTTGGGCGAC
GTCCTGAAGC TTTGCCTCTT CGGTCTACTC GAATTTTTCG GGTACCGGCA GATCATCGTG
GTTGAACGGC TCATCGCCAC GTTCATGTTC TGGAAAAAGG GGTGGGGCAA GCCCAAGCGA
AAGGAGATCG ATGGTGAAGT GTCTGGTTCG GTTGCCTAG
 
Protein sequence
MLHNIVAFLL AIQPVIFIYF IVLNGFYTLF TIISLRDIRN YLNAVTSQSI DNVLNGMFYR 
PLSILVPAYN EEKTIVSSVK SLLALRYPEY EVIVINDGST DGTLESLINE FRLVRIDKPI
SLHVPHRPII AKYVSVDHPY LFVLDKENGG KADALNAGIN ASQFPLFCSI DADSVLEDDA
LIRATRLFVE DREVVATGGI VRVLNGCEVE DGIVKGVRAP RGMLECFQTV EYTKGFLSGR
TSWNYFRSLL IISGAFGIFR KDMVMAVKGY RESVGEDMDL VVRLHRHCRQ NRIRYKVVFV
PDPVCWTQVP SDMASLLKQR NRWHRGLIDS LWHNRGMFLN PRYGTVGLFG FPYFVTVELL
GPAVEFIGYF GFVLLFFLGQ VNREFAILFF LLAVLWGTWI NLGSIFLDNL IYKRYKGLGD
VLKLCLFGLL EFFGYRQIIV VERLIATFMF WKKGWGKPKR KEIDGEVSGS VA