Gene GSU1149 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU1149 
Symbol 
ID2685486 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp1241046 
End bp1242281 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content58% 
IMG OID637125823 
ProductEAL domain-containing protein 
Protein accessionNP_952202 
Protein GI39996251 
COG category[T] Signal transduction mechanisms 
COG ID[COG3434] Predicted signal transduction protein containing EAL and modified HD-GYP domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGGAAG AAAAGTTTTT TCTGGGCAGA CAACCAATCC TTGATCGGGA GCAACGGCTC 
TACGGCTTCG AACTTCTGTT CCGGTCAGCC GACTCTCTCC ATGCCAATGT GACCGACTAT
CTCCAGGCCA GCGCCAGCGT CATTTTCGAT GCGCTCTCCA GTTTCGGCTT CCGCGAGATT
CTCGGCAAGC ACAAGGGGTT TATCAACGTC AACGCCGATG TCCTCATGAG CGAGGCACTG
GAGCTTCTTC CTCCCGAAAA GGTGGTCATC GAGCTTCTGG AGCATGTGCC GATTACCGAA
ACCGTTATCA GCCGTTGCCA CGAGTTGCGA GAGAAGGGTT TTTCCCTGGC CCTTGACGAC
CATATTTACG AACCGGTTTA TGAACCACTC TACCATCTCG TCGATGTGAT CAAGGTTGAC
CTGTTCAGAA CCGGCATGGA AAACCTTGCC TCGGAAGTGG AAACCCTGAG GAAGTGGAAC
ACCAAGCTCC TGGCCGAAAA GGTGGAGACG GTGGAACAGT ACGAGCGATG CGCCGACCTC
GGCTTTACCT ATTTCCAGGG CTACTATTTC GCCCGCCCGG TGGTGCTCAC CCGCACCCGT
GTCGACGTGG GGCGGCTGGC TATCGTCAAG CTTTTCAACC AGTTGGTGGC GGAGGTTGAA
CTGGGCGAAC TGGAGGAGAC CTTCAAGCAG AACCCGAATC TGGTCCTCAA TCTCCTGCGA
CTCGTTAATT CCGTCGCCGT TGGCCTTAAG GACAGGATCA CCTCGCTTCG CCACGCGATC
ATGGTGCTCG GCTACCACCA GTTGCGGCGC TGGGCCATGA TGGCGCTCTT CGCCAACAAC
GCCCAGGGGG GAGGGGACAA CCCTCTGCTG GTCATGGCTG CTACCCGTGC CCGTCTCATG
GAACTCATCA TCTCGGAGCA GCCCGCTGCC CGGCTGGATC GGGACTACCC CGACCGCGCC
TTCATGACCG GTATCCTGTC CTTGGCCGAC GCCCTGCTCA AGGTATCCAT GGACGAGGTC
CTTGCCCCTC TCAATCTGGA TAACGACGTG CGGGAAGCTC TGCTTTCCCG GACCGGGGAA
CTGGGAGAGC TCCTTACCTT GGTGGAGAAG ATCGAAAAGG ACGATTTCGA GAACATCGCT
TCCAGCTTTG AGCAGTTCCA GCTCTCCATG GAGCGTCTCG CCCCTGTCCA GATGGATGCC
TGCAACTGGG CAACCCAACT GGGCGAAGTC GCCTGA
 
Protein sequence
MAEEKFFLGR QPILDREQRL YGFELLFRSA DSLHANVTDY LQASASVIFD ALSSFGFREI 
LGKHKGFINV NADVLMSEAL ELLPPEKVVI ELLEHVPITE TVISRCHELR EKGFSLALDD
HIYEPVYEPL YHLVDVIKVD LFRTGMENLA SEVETLRKWN TKLLAEKVET VEQYERCADL
GFTYFQGYYF ARPVVLTRTR VDVGRLAIVK LFNQLVAEVE LGELEETFKQ NPNLVLNLLR
LVNSVAVGLK DRITSLRHAI MVLGYHQLRR WAMMALFANN AQGGGDNPLL VMAATRARLM
ELIISEQPAA RLDRDYPDRA FMTGILSLAD ALLKVSMDEV LAPLNLDNDV REALLSRTGE
LGELLTLVEK IEKDDFENIA SSFEQFQLSM ERLAPVQMDA CNWATQLGEV A