Gene GSU0302 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU0302 
Symbol 
ID2686915 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp331170 
End bp333134 
Gene Length1965 bp 
Protein Length654 aa 
Translation table11 
GC content64% 
IMG OID637124968 
Producthypothetical protein 
Protein accessionNP_951362 
Protein GI39995411 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCCATC GCCTCCACCT GGAACAGTTC GGCCCCGTCC ATGCCCTCCC CGTTCTCCAC 
TATCGCCTGG AGTTCGCACA TCTCGTCCGC GAGGCAGTGC GCCGCGTCAA GCCCGACTGC
ATCGCCATCG AGTTGCCGTC CACGATCGAG GCGCCGTTCC TGCGGGCAGT GGAGCGCCTG
CCCGAGATAT CGGTCATCCA CTACGAAGGT CGGCAGCGCC GCGATGGCGC CGAATCGGTC
TACCTCCTGG TGGAACCGGC CGATCCCCTG GTCGAAGGGG CGAGACTGGC CCTGGAGCGG
CGGATTCCCC TGCGGCTCGT GGATGTGGAC ACCGACTCCT ATCCCCGCCA TGTGGAAGCG
CTTCCCGACT CCTACGCCAT TCACCGGATT GGCCTCACCC CCTACTATGA GGAGTACCGG
CGTGCGGCCG CCTCTGTGGC GCCGGGCCGG GAGGATCGGC GGCGCGAGCG GGGAATGGCC
TGGCGGCTCC AGGAACTGGC AAAAGAGCAT GGCAGCATTC TCTTTGTCTG TGGCATGTAC
CATCTGGAGC GGATCAAGGA CGATTTCGGG CGGCCCCAGG CCGCTCCCCT GGAGAGGGTG
CGGCGCCAGG GGGTGAGGCT GTTCAATCTG CATCCCGACT CCTGCCGGGA GATCCTCGAC
GAGTTTCCCT TTATTTCGGC GGTCCATGAA CTGCGTCGTG GTCCTCTGCC GCCTGAACCC
GACGATCGCG GCGAGACGCT CCGCAAGCGA TTCAGTGCGT TCGAGCTAAT TGTCGGCGGG
AGGAAGGACC TCCCGGCGGA GGAGCTTTTG CGCCATGCGG TAGAGCGGGG TGCCCGGCAT
GCGGGGCGGG GGGAGGAGTT CCCCGACCGG CAGCGGATCA TCTTCCGGCT CTTCCAGGAG
GCGGCCCGCC ACTACCGGCA GGAGACGGGC GACCCGGTCC ACCTCTGGCA GAAGCGGGCC
TTTTTCCGTT TTGCCCGCAA TTATGCCCTT GCCTCGGGCG CACTCCTGCC CGATCTGTTT
CAGCTGCTCA TGGCGGCACG GGGGTGTGTG GACGATAACT TCGCCTACGC CCTGTGGCGC
CTTGCCACCT TCTACCACTG GCAGCGGGCC GAGGCAGACA TCCCGACCAT CAGTATCTCT
CCCGAGGAAA TCTGGGGCGG GAGCCGCCGC ATCCGTTTCC GCCCACGGGA GCGGCGCCGG
AAAGGGCTGT CGCATTTGGG CTTTCTCAAG CGCAAGAAGG AAAAGCGCCC CGGAGAATGG
CTCGAAGGAT TCACTGACCC GAGCATCTGC TCCTATCCGC CCGAGGATGT GCTGATTGAG
GAGTATGGCC GCTTTCTCAA GAAAAAAGGG GCCATGCAGC TTTCCGAGGA ACTCTCCCGT
ACGGAGCCGT TCACCTCGTC ACTTCTGGAT GGGATCGATC TGCGGGAAAC ACTGCGCAAC
GTTGCCGACG GGCGGGTCTA TGTTCGGGAA AGCCAGCGAG CCAAGGGGGG CGTGGGCTCG
GTGGTCGTCA TCTTCGACGA AGACCGGGAA AACGGTAACT ATCCCTACCG CACGACCTGG
CTGGGCGAGC ACGAGCAGGA ATCAGACATG GCCTTCTATG CGACGCCGCC GGAGGACAAC
ATCGTCGGTC CCGGCATCTG CCGCTGCGAG TACGGCGGGT TTCTTCTCTC CTACCCGCCG
CGCCGGATGA TGGATGTCTG GCGCGACCCG GACTATGTCT TTGCCCGGTC AAAGCCGGAG
GTGCTTCTCC TGGCCGCTCT CGACTATTCG CCCGAAAAGC ATGTGGTCCA CGTGGCTGCC
CGGCCACCCC GCAGCATCTT CCGGCAGATC GCCGCACGAA TGGGGAAAAA GATTGTCCAC
ATCCCCTTGG GCTCCCTTTC GTCGGTAAAG CTCAAATCCA TCCGGGTGCT TCATATCCTG
CACGGTCACG ACAAGCGTCA GGTGGCCAAG GACTACATCT GGTGA
 
Protein sequence
MPHRLHLEQF GPVHALPVLH YRLEFAHLVR EAVRRVKPDC IAIELPSTIE APFLRAVERL 
PEISVIHYEG RQRRDGAESV YLLVEPADPL VEGARLALER RIPLRLVDVD TDSYPRHVEA
LPDSYAIHRI GLTPYYEEYR RAAASVAPGR EDRRRERGMA WRLQELAKEH GSILFVCGMY
HLERIKDDFG RPQAAPLERV RRQGVRLFNL HPDSCREILD EFPFISAVHE LRRGPLPPEP
DDRGETLRKR FSAFELIVGG RKDLPAEELL RHAVERGARH AGRGEEFPDR QRIIFRLFQE
AARHYRQETG DPVHLWQKRA FFRFARNYAL ASGALLPDLF QLLMAARGCV DDNFAYALWR
LATFYHWQRA EADIPTISIS PEEIWGGSRR IRFRPRERRR KGLSHLGFLK RKKEKRPGEW
LEGFTDPSIC SYPPEDVLIE EYGRFLKKKG AMQLSEELSR TEPFTSSLLD GIDLRETLRN
VADGRVYVRE SQRAKGGVGS VVVIFDEDRE NGNYPYRTTW LGEHEQESDM AFYATPPEDN
IVGPGICRCE YGGFLLSYPP RRMMDVWRDP DYVFARSKPE VLLLAALDYS PEKHVVHVAA
RPPRSIFRQI AARMGKKIVH IPLGSLSSVK LKSIRVLHIL HGHDKRQVAK DYIW