Gene GSU1842 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU1842 
Symbol 
ID2688629 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp2008304 
End bp2010649 
Gene Length2346 bp 
Protein Length781 aa 
Translation table11 
GC content62% 
IMG OID637126532 
Productpolysaccharide biosynthesis/export domain-containing protein 
Protein accessionNP_952892 
Protein GI39996941 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1596] Periplasmic protein involved in polysaccharide export 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.749239 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGCTATA ACGCAGATCG CAAGCTGGTG CTTTCCTCCG AGGAGGTAAG GGTGATACCC 
ACGGCCCCGG ATGCGTCCGC TTCGTCCCTG GAGCGTGCCT TTGCCCGCCA GAGCCCGACG
CTGCTGGACA AGCTCGAACC GACTCCCCTC AACAGATCGC TCCGTCAGTT CGGGTATGAG
TTTTTTAAAA ACAGTCTCGC AAGTCTGGCC GTAACTGAAA ATCTGCCCGT GGGCCCCGAC
TATGTTGTTG GTCCGGGCGA TTCCATCAGA ATTGATGTGT GGGGGAGCTT CACCGCCCGC
TATGAGCCGA CTGTGGACCG CAACGGCGAG ATCCTGATCC CGCGCATTGG TCCGGTAAAG
CTCTGGGGGC TGACCTACGG CCAGGCTCGG GAAGCCATAG ACAAAGCACT GGCCCGCTAC
TACAAGGGGT ACGAGCTGAA CGTGACCCTG GGGAGCCTCA GGACCATCCA GGTTTACGTG
GTGGGTGAGG TGGAAACGCC CGGGGTGTAC AGTGTCAGCT CCCTGGCCAC GGTGGTCAAT
GCCCTGGCCG CCGCCGGGGG GCCTTCGAAG AACGGGAGCC TGCGGTCGAT CAGGGTGTCG
AGACCGGGAG TCGAACCCCG TTCGATCGAT CTGTACGACA TGTTTCTCAC TGGTGACCGC
AGCAATGACG TGAGGCTGCA AAACGGCGAC ACGGTATTTG TGCCGGTGAT CGGGCCGGTT
GCTGCCGTGG CGGGCGAAGT GCGGCGTCCC GGCATCTACG AGTTGAAGGG CGCGACGCCG
CTCGCCCGGC TGGTGGCCAT GGCGGGGGGG ATCACTGCAG CGGGAGATAC GGGGCGTATC
CAACTGGAGC GGATCGAGGG GAACAGCGCA CGGATAGTGC TCGATTATGA GCCCAAAGGG
GGCGACCTGG AGGCCGAACT CGCCCGGGTG GAACTGAAGG ACCGTGACAT GGTCACCGTG
TTCCCGGTGT TCGATGCCGT GCGCAAGGTG GTCACCCTCA CGGGGAACGT GACGCGGCCC
GGTGCCTATC AGTTGAAGGA GGGAATGCGG GTCAGGGATA TTCTTCCTGA TCCGTCGGTC
CTTCTCCCCG AATCGTATCT GGAGTCGGCC GAGATTACCC GTCTTGCTCT TCCCGAGTAC
CGCCGGGAAG TGGTCACCTT CAATCTGCGG GCCGCCATGC AGGGGGATCC CAAGGAGAAC
CTTCCCCTTC AGGAGCAGGA CACGGTGCGG GTGTTCTCCC GGGCCGAGAT GATTGAGAAG
CATACGGTGT CCATCAGCGG GGCGGTGCTC AATCCGGGCA GCTATGAGTA TTTTCCGCGC
ATGACGGTGC GCGACCTGGT GACCGTCGCC GGCAGTCCCA AGCGAAATGC TTTTCTGGGC
AGTGCGGAGC TGACCAGGAT CAATGTCAAC GGCGACGGAG CGCGGGCGAG TAGGCAGGAT
ATCAATCTCG AAAAGGCCAT GGCGGGCGAT CCCGATCATA ACCTCCCCCT GCAGACCGAC
GACGTCCTCA TCGTCCGGAG CATCGAGAAC TGGCTTGAGG CCAGCGACCG GTTCGTGACC
CTGCGGGGCG AGGTGAAGTT TCCCGGCACC TATTCCATTG CCAAGGGAGA ACGACTGAGT
TCGGTCATTG CCCGGGCCGG GGGCTATACC GAACATTCCT GCCTGAAGGG GGCCAAGTTC
ACCCGCCGGT CGGTCAGAGA GGAGCAGCAG CGCCGCATGG ACGAGGTGAT CGCCCGGACC
GAGCAGGATG TCTACCGCAA GCAGGCGGAA CTCTCGTCGG TGGCTACATC ACGCGAGGAG
CTGGAGGCCA CCAAGGCCGC CCTCGACGGT CTGTTGCGGA GTCTGGCGAA GCTGAAGACC
ACCAGGGCCG AGGGGCGGGT GGTGATCCGG CTGGCGCCGG TCGAGGCGCT TGCCCACAGC
TCCTATGACC TTGAACTGGA AGGCGGGGAT GAACTCTCCA TACCCCCAAC CCCGAGCGTG
GTGTCGGTCA TGGGGTCCGT CTACAACCCC ACGTCGTTCC TGCACATTTC AGAGCGGGAC
GTGGCCTATT ACCTGGAACG TTCCGGCGGG GCTACCCGTG ATGCCGAATT GGATGACATG
TACATTATCA AGGCCGACGG ATCGGTCTTC AGCAGGCAGC AATCGTCCTT CGGCATTCGC
TGGGACGACT ATGCCCGGCG CTGGACGTTC GGCGGATTCC TGTCGTCCCC CCTGGAGCCC
GGAGATACGC TCGTGGTGCC GCAGAAGCTC GAGCGAACCG CCTGGATGCG CGAGATCAAG
GACATAACAA CCATCCTGTC GCAGGTAGCC ATAACCGCCG GCGTGATCAT CGCGGCAGGT
CTCTAG
 
Protein sequence
MGYNADRKLV LSSEEVRVIP TAPDASASSL ERAFARQSPT LLDKLEPTPL NRSLRQFGYE 
FFKNSLASLA VTENLPVGPD YVVGPGDSIR IDVWGSFTAR YEPTVDRNGE ILIPRIGPVK
LWGLTYGQAR EAIDKALARY YKGYELNVTL GSLRTIQVYV VGEVETPGVY SVSSLATVVN
ALAAAGGPSK NGSLRSIRVS RPGVEPRSID LYDMFLTGDR SNDVRLQNGD TVFVPVIGPV
AAVAGEVRRP GIYELKGATP LARLVAMAGG ITAAGDTGRI QLERIEGNSA RIVLDYEPKG
GDLEAELARV ELKDRDMVTV FPVFDAVRKV VTLTGNVTRP GAYQLKEGMR VRDILPDPSV
LLPESYLESA EITRLALPEY RREVVTFNLR AAMQGDPKEN LPLQEQDTVR VFSRAEMIEK
HTVSISGAVL NPGSYEYFPR MTVRDLVTVA GSPKRNAFLG SAELTRINVN GDGARASRQD
INLEKAMAGD PDHNLPLQTD DVLIVRSIEN WLEASDRFVT LRGEVKFPGT YSIAKGERLS
SVIARAGGYT EHSCLKGAKF TRRSVREEQQ RRMDEVIART EQDVYRKQAE LSSVATSREE
LEATKAALDG LLRSLAKLKT TRAEGRVVIR LAPVEALAHS SYDLELEGGD ELSIPPTPSV
VSVMGSVYNP TSFLHISERD VAYYLERSGG ATRDAELDDM YIIKADGSVF SRQQSSFGIR
WDDYARRWTF GGFLSSPLEP GDTLVVPQKL ERTAWMREIK DITTILSQVA ITAGVIIAAG
L