Gene GSU3329 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU3329 
Symbol 
ID2687648 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp3657542 
End bp3658837 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content65% 
IMG OID637128023 
Productradical SAM domain-containing protein 
Protein accessionNP_954369 
Protein GI39998418 
COG category[R] General function prediction only 
COG ID[COG4277] Predicted DNA-binding protein with the Helix-hairpin-helix motif 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGGAAC CAACTCTGGC TGAAAAACTC GAAATCCTCG CCGACGGCGC CAAGTACGAC 
GTTTCATGCG CCTCCAGCGG CAGCTCCCGC AAAGGCAAGG GTGGGATCGG AAACGCCGCC
CAGTGCGGCA TCTGCCACTC CTGGACCGCC GACGGCCGCT GCGTATCCCT CCTGAAGATC
CTTCTCACCA ACGTCTGCAT CTATGACTGT GCCTACTGCG TGAACCGTCG CTCCAACGAC
ATCCGGCGGG TGATGCTCAC TCCCGCCGAG GTGGCGGAGC TCACCATCGG CTTTTACCGC
CGCAATGCCA TCGAAGGGCT CTTCCTCTCC ACCGGGGTCA TCAGGAATCC CGACTACACC
ATGGAGCTTC TCATCGAAGC GACCCGGCAG CTGCGCGAGG AGTACCGCTT CAACGGCTAC
ATCCACGTAA AGGTGGTGCC CGGCGCGGAC CTCGCCCTGG TGGAGCGACT CGGTCGCCAT
GCCGACCGGG TCAGTATCAA CATGGAACTC CCGTCGCGCG AGAGCCTTGC ACTCCTGGCG
CCCGACAAAA GCAGGGAGTC GATCGTGGGG CCCATGAAGC GAGTGGGGGA GCTGATCGTC
CAGACCCGGG AGGAGCGCAA GGTTTCCCGG AAGATGCCCC CCTTTGCCCC GGCCGGGCAG
AGCACCCAGC TCATCGTCGG CGCATCGGGG GAGACTGACC TGCAGATCAT TTCGCTGGCG
GCGGGGCTTT ACGGAAGGCT TTCCCTGAAG CGGGTCTACT ACTCTGCCTT CATCTCCGTG
AACCGGGACG AGCGCCTTCC CGCCGTGGTG GGCACCCCAC CGCTCGCGCG GGAACACCGC
CTCTACCAGG CCGACTGGCT CATGCGCTAC TATGGCTTCG CCGCCGGCGA GCTCCTGGAC
GAAGAGCGCC CCAACCTGGA TCTCTCGCTG GACCCCAAGG CTGGGTGGGC GCTCCGAAAT
CTTCACCTGT TCCCCGTCGA GGTGAATCGG GCCGACTACG AGGCGCTCCT TCGGGTGCCG
GGAATCGGGG TCCGCTCGGC CCAGCGGATC GTCTTGGCGC GGCGGGGCAG CCATCTGTCC
CTGGATGACC TGCCCAGGCT CGGGGTGGTG ATGAAGCGCG CCCGCTACTT CATTACCGCC
CGAGGGAGGT TCGCCGCCGA TCTGACTCCC GATGCGGCAG GCCTCCGGCT GCGTCTCACC
GAAAAACCAC TTCGGCGCGA GCGCTGGAGC CAGCCGTCCC TGTTCGACGG TGGGGCCGGC
ATGGACATTC GGTCCACCAT CACGGGTGAG TTGTGA
 
Protein sequence
MAEPTLAEKL EILADGAKYD VSCASSGSSR KGKGGIGNAA QCGICHSWTA DGRCVSLLKI 
LLTNVCIYDC AYCVNRRSND IRRVMLTPAE VAELTIGFYR RNAIEGLFLS TGVIRNPDYT
MELLIEATRQ LREEYRFNGY IHVKVVPGAD LALVERLGRH ADRVSINMEL PSRESLALLA
PDKSRESIVG PMKRVGELIV QTREERKVSR KMPPFAPAGQ STQLIVGASG ETDLQIISLA
AGLYGRLSLK RVYYSAFISV NRDERLPAVV GTPPLAREHR LYQADWLMRY YGFAAGELLD
EERPNLDLSL DPKAGWALRN LHLFPVEVNR ADYEALLRVP GIGVRSAQRI VLARRGSHLS
LDDLPRLGVV MKRARYFITA RGRFAADLTP DAAGLRLRLT EKPLRRERWS QPSLFDGGAG
MDIRSTITGE L