Gene GSU1708 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU1708 
Symbol 
ID2685571 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp1872319 
End bp1873581 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content60% 
IMG OID637126389 
ProductAtz/Trz family chlorohydrolase 
Protein accessionNP_952759 
Protein GI39996808 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0266273 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGCTCT ACGCCGCCTC ATACCTTGTT CCCATTTCAT CGCCCCCTGT CGCCGGCGGC 
GCACTAGCCG TCGACAATGG CCGGATAGTC GATACGGGTA CGTTTGCCGA GCTTCGTGCC
CGGTACGGCT GTCCCGTTCA CGACTTTCCG GGCTGCACCA TTCTTCCCGG CCTCGTCAAC
GCCCACACGC ACCTGGAGCT TACCCATTTC CCGTCATGGA AAATCCGCAA GGGAATCGAT
TACTCACCCC GGACGTACGT TGATTGGATC ATCCAGGTAA TAAAGATCCG CCGGGCCCTG
ACGCGGCAGG AACAGGAACT TTCCGTACGG GAAGGGCTGC GCATCTGCCT TGAAGCGGGG
ACCACGTCCA TCGGCGAAAT TCTCACCGAC CGGTCGCTCC TCCCTCTGTA TGCCGATTCC
GGCCTGGGGG GGCGGCTTTT TCTCGAGGCC ATAGGCCACG ACCCGGTCCG CAGCGCCGAG
CTCATCTCGG AGCTTGGTGC CGCCGTAGCT TCTTTCCCTG CTGGAGACTT ACTGCCGGGC
CTCTCGCCCC ATGCCCCCCA CACTGTTTCT GAGCAGCTCC ATCAGAATGT TCGCCGGCTG
GCGGAAGAGT ACAACGTGCC GCGAATCATC CATCTGGCCG AATCCCGGGA AGAGAGTGAT
TTCTTCTTCG ATTCGACCGG AAAAATTGCT GAGCTTCTCT ATTCCCACGT CAGGTGGGAG
TCGTACCTTC CCGCCCCGAG ACGTGCTACC GCAACCGCCT GGCTTGACGG ACTCGGCGTC
CTCAACGGTG CCATCTCGGC GGTCCACTGC GTTCATCTTA CGCCGTCGGA TGCCGAAACC
CTGGCCAAAC GCGGAGTCGG CATTGTGCTT TGCCCCCGGA GCAACGAAAA GCTTGCTGTG
GGGCGCGCAC CTGTCGCCTA TTTGAAGAAA CTTGGTATTC CCCTTGCCTT GGGCACCGAC
TCGCTGGCCA GCAACGATTC CCTCTCACTC TGGGATGAAA TGCGTTACCT CCTCGATCTC
TTCCCGGGCG TCTTCACCCC GTCAGAGGCC CTTGCCATGG CAACCATCGG TTCGGCACGA
CAGCTCGCTC TCGCCGATCG GGTAGGGTCC ATCGAGAAGG GCAAACGCGC CGATCTGCTT
GTCATGAAAC TACCTGGTCC GCAGAGCACC GGCGAAGGAC TCCACGAAGC CGTTATCGGC
AGCGGAGAAC TTCTCCATGT GATTCTGTCC GGCAGGTTCA CCGACCGGAC AGAACCCCGC
TAA
 
Protein sequence
MELYAASYLV PISSPPVAGG ALAVDNGRIV DTGTFAELRA RYGCPVHDFP GCTILPGLVN 
AHTHLELTHF PSWKIRKGID YSPRTYVDWI IQVIKIRRAL TRQEQELSVR EGLRICLEAG
TTSIGEILTD RSLLPLYADS GLGGRLFLEA IGHDPVRSAE LISELGAAVA SFPAGDLLPG
LSPHAPHTVS EQLHQNVRRL AEEYNVPRII HLAESREESD FFFDSTGKIA ELLYSHVRWE
SYLPAPRRAT ATAWLDGLGV LNGAISAVHC VHLTPSDAET LAKRGVGIVL CPRSNEKLAV
GRAPVAYLKK LGIPLALGTD SLASNDSLSL WDEMRYLLDL FPGVFTPSEA LAMATIGSAR
QLALADRVGS IEKGKRADLL VMKLPGPQST GEGLHEAVIG SGELLHVILS GRFTDRTEPR