Gene GSU0036 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU0036 
Symbol 
ID2685741 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp46259 
End bp47314 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content64% 
IMG OID637124698 
Productcapsule biosynthesis protein, putative 
Protein accessionNP_951098 
Protein GI39995147 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2843] Putative enzyme of poly-gamma-glutamate biosynthesis (capsule formation) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCATCT TCCCCGCCGC ATTGACCCTA GTGCTCGTAC TGCTCGCAGT CCCTGCGGGG 
GCTCTCGCCG AGCGCATCAG CCTTTCTTTT GTGGGTGATG TGATGCTGGC CGGCAGCGCA
ACTGACACCC TCCAGCGGTA CGGCTACAGC TATCCGTTTT CCGCTACCGC CGCAGAGCTG
CGACGTAGCG ACCTGGTGGT GGGCAACCTG GAAGCCCCCC TCACCGACGG GGGGCGCGAG
TTCCGCGCCA AACGTTTCCG CTTCAAGGCT TCTCCCGTCG CGGCAGCGGC CCTGAAACGG
GCCGGCTTCT CGGTCATGAC ACTGGCCAAT AACCACATGA TGGACTTTGG CGCCGATGGG
TTGAGCGACA CGATTCACCA TCTCAATCGT AACGGCATCG CCTTCGCAGG CGCTGGGCCG
TCAATTGCTG ATGCCCGACG CGAAGCATCC GTGACGGTCA GGGGGCAGAC CGTCGCCTTC
CTGGCCTACT CGCTCACCCA GCCGATCGAA TTCTTCGCCA CCGAGGGACG TCCTGGCACC
GCACCCGGCT ACGCAGGCCA CTATCTCGCG GATATCCGAC GGGTCCGCAG CAGCGCCGAT
CATGTGGTCG TCTCCTTCCA CTGGGGACAG GAGCGCGCCG CGCTACCATC GCCCTACCAG
ATCGAAACAG CCCATCGCGC TATCGATGCA GGGGCTGACA TCGTCATCGG CCACCATCCC
CATGTCCTCC AGGGAATCGA AATCTATCGC GGCAGCCCGA TCTTTTACAG CCTCGGTAAC
TTTGCCTTCG GCAGCCGGAG CCCCTCGGCA GACCGGAGCA TCATCGCCCG GGTGACCCTC
GGCGAAGGAC CGCCGATTGT GGAGGTCATC CCTCTCAACG TTCTTTTCCG CGAGGTACGC
TTTCAACCGG CCATCCTCAC GGGCCGTAAG GCGGCGGACG TAGTGGACCG GCTGAATCGT
CTGTCAGCCC CCTTTAGCAC GGTCATCACC TCCACCGCTG GCAGCCACCT CGTCGCGCCT
GCGGAGGCTA ATGCCCGGCT TGCCCGCAGG CAGTGA
 
Protein sequence
MTIFPAALTL VLVLLAVPAG ALAERISLSF VGDVMLAGSA TDTLQRYGYS YPFSATAAEL 
RRSDLVVGNL EAPLTDGGRE FRAKRFRFKA SPVAAAALKR AGFSVMTLAN NHMMDFGADG
LSDTIHHLNR NGIAFAGAGP SIADARREAS VTVRGQTVAF LAYSLTQPIE FFATEGRPGT
APGYAGHYLA DIRRVRSSAD HVVVSFHWGQ ERAALPSPYQ IETAHRAIDA GADIVIGHHP
HVLQGIEIYR GSPIFYSLGN FAFGSRSPSA DRSIIARVTL GEGPPIVEVI PLNVLFREVR
FQPAILTGRK AADVVDRLNR LSAPFSTVIT STAGSHLVAP AEANARLARR Q