Gene GSU0133 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU0133 
Symbol 
ID2688035 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp147502 
End bp148668 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content62% 
IMG OID637124800 
Producthypothetical protein 
Protein accessionNP_951195 
Protein GI39995244 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.957824 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTATTC TTTCAACAAC AACAGCCATA TGCCAGTTCC GGGTCGCGGG GGATCTCCCT 
GCCGGTGACC TCTATCCCTG GATTGCCGAA CATCTCGCCC GACAGGCCTT TCAGTCCATC
GATCAAGGGG TCGCCGAGCA GTCCGTGGGG TGGGTTCATC TGGATGACCA TCGGCAGATG
AGCTTCGACA TCCCCGCCGC CTTCTGGCGT GACCATTACG TGGCCTTCAC CCTGCGCCGC
GACCAGCGCA AGCTTCCGGC GGCGCTGGTA AAGGCCTATC TTCAGGTGGC CGAGCATGAG
TACCTCTCGG CCCATCCCGG CCTCAACCGG GTGCCCAAGC AGAAGCGCGA GGAGCTGAAA
GAGGCGGTGC GCCTCAACCT CCTGGCCAAG ACCTTGCCGG TTCCCTCCAC CTGGGATGCG
GTCTGGGACA CCCGCACCGG TATCGTCACC TTTACCTCCC TGTCGGCTCC CATTATCGAG
CTGTTCGAGA CCCAGTTCAA GAAGACCTTC GAGGGGACGC GCCTGGTGGC GATCCATCCC
TATGCCCGGG CAGAGGCCGT GGGAGGCGAA GGGCTGAAGC CTGCCCTCGA ACAGGCCAAC
CTCGCCACGA GCGATGCCGC CATCGATCTG ATCAGGAGCA ACCAGTGGCT CGGGTGGGAT
TTCCTCCTCT GGCTTCTCCA CCGGACCATG ACCGATTCTT CCGAGTACTG CGTGGGGCAG
CCCGGCCCGG CTCTGGCAGG CGAGCCCTTC GTGGCTTACC TGAACGATCG CCTGGTCCTC
GTGAGCGCCG GGGAGGCAGG AACCCAGAAA ATCACCGTGG CCGGTCCCCA GGACCACTTC
CGAGAAGCCC GCACTGCCCT TGCCCACGGC AAGCGGATCA CCGAATCGAC TCTCTACCTG
GAAAAGGAGG AGCATGTCTG GAAGTTGACC CTCAAGGGGG AACTCTTCCA TTTTGCTTCC
CTCAAGTCCC CCAAGGTGGC CATCGAAAAG GGCGAGCACG TGGACGAAGG GAGCGAACGG
GAAGCGGCCT TTTACGAGCG GATGTATGTT CTGGAACAGG GACTCCAGCT CTTCGACAGC
CTGTACGGCG AATTCCTCAC GGTGCGTCTG GGTGCCGGAT GGGGCGAAGA ACTGGCTCGG
ATCGAAGGGT GGCTGGCAGG GGAGTAA
 
Protein sequence
MGILSTTTAI CQFRVAGDLP AGDLYPWIAE HLARQAFQSI DQGVAEQSVG WVHLDDHRQM 
SFDIPAAFWR DHYVAFTLRR DQRKLPAALV KAYLQVAEHE YLSAHPGLNR VPKQKREELK
EAVRLNLLAK TLPVPSTWDA VWDTRTGIVT FTSLSAPIIE LFETQFKKTF EGTRLVAIHP
YARAEAVGGE GLKPALEQAN LATSDAAIDL IRSNQWLGWD FLLWLLHRTM TDSSEYCVGQ
PGPALAGEPF VAYLNDRLVL VSAGEAGTQK ITVAGPQDHF REARTALAHG KRITESTLYL
EKEEHVWKLT LKGELFHFAS LKSPKVAIEK GEHVDEGSER EAAFYERMYV LEQGLQLFDS
LYGEFLTVRL GAGWGEELAR IEGWLAGE