Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GSU1708 |
Symbol | |
ID | 2685571 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sulfurreducens PCA |
Kingdom | Bacteria |
Replicon accession | NC_002939 |
Strand | + |
Start bp | 1872319 |
End bp | 1873581 |
Gene Length | 1263 bp |
Protein Length | 420 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637126389 |
Product | Atz/Trz family chlorohydrolase |
Protein accession | NP_952759 |
Protein GI | 39996808 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0266273 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAGCTCT ACGCCGCCTC ATACCTTGTT CCCATTTCAT CGCCCCCTGT CGCCGGCGGC GCACTAGCCG TCGACAATGG CCGGATAGTC GATACGGGTA CGTTTGCCGA GCTTCGTGCC CGGTACGGCT GTCCCGTTCA CGACTTTCCG GGCTGCACCA TTCTTCCCGG CCTCGTCAAC GCCCACACGC ACCTGGAGCT TACCCATTTC CCGTCATGGA AAATCCGCAA GGGAATCGAT TACTCACCCC GGACGTACGT TGATTGGATC ATCCAGGTAA TAAAGATCCG CCGGGCCCTG ACGCGGCAGG AACAGGAACT TTCCGTACGG GAAGGGCTGC GCATCTGCCT TGAAGCGGGG ACCACGTCCA TCGGCGAAAT TCTCACCGAC CGGTCGCTCC TCCCTCTGTA TGCCGATTCC GGCCTGGGGG GGCGGCTTTT TCTCGAGGCC ATAGGCCACG ACCCGGTCCG CAGCGCCGAG CTCATCTCGG AGCTTGGTGC CGCCGTAGCT TCTTTCCCTG CTGGAGACTT ACTGCCGGGC CTCTCGCCCC ATGCCCCCCA CACTGTTTCT GAGCAGCTCC ATCAGAATGT TCGCCGGCTG GCGGAAGAGT ACAACGTGCC GCGAATCATC CATCTGGCCG AATCCCGGGA AGAGAGTGAT TTCTTCTTCG ATTCGACCGG AAAAATTGCT GAGCTTCTCT ATTCCCACGT CAGGTGGGAG TCGTACCTTC CCGCCCCGAG ACGTGCTACC GCAACCGCCT GGCTTGACGG ACTCGGCGTC CTCAACGGTG CCATCTCGGC GGTCCACTGC GTTCATCTTA CGCCGTCGGA TGCCGAAACC CTGGCCAAAC GCGGAGTCGG CATTGTGCTT TGCCCCCGGA GCAACGAAAA GCTTGCTGTG GGGCGCGCAC CTGTCGCCTA TTTGAAGAAA CTTGGTATTC CCCTTGCCTT GGGCACCGAC TCGCTGGCCA GCAACGATTC CCTCTCACTC TGGGATGAAA TGCGTTACCT CCTCGATCTC TTCCCGGGCG TCTTCACCCC GTCAGAGGCC CTTGCCATGG CAACCATCGG TTCGGCACGA CAGCTCGCTC TCGCCGATCG GGTAGGGTCC ATCGAGAAGG GCAAACGCGC CGATCTGCTT GTCATGAAAC TACCTGGTCC GCAGAGCACC GGCGAAGGAC TCCACGAAGC CGTTATCGGC AGCGGAGAAC TTCTCCATGT GATTCTGTCC GGCAGGTTCA CCGACCGGAC AGAACCCCGC TAA
|
Protein sequence | MELYAASYLV PISSPPVAGG ALAVDNGRIV DTGTFAELRA RYGCPVHDFP GCTILPGLVN AHTHLELTHF PSWKIRKGID YSPRTYVDWI IQVIKIRRAL TRQEQELSVR EGLRICLEAG TTSIGEILTD RSLLPLYADS GLGGRLFLEA IGHDPVRSAE LISELGAAVA SFPAGDLLPG LSPHAPHTVS EQLHQNVRRL AEEYNVPRII HLAESREESD FFFDSTGKIA ELLYSHVRWE SYLPAPRRAT ATAWLDGLGV LNGAISAVHC VHLTPSDAET LAKRGVGIVL CPRSNEKLAV GRAPVAYLKK LGIPLALGTD SLASNDSLSL WDEMRYLLDL FPGVFTPSEA LAMATIGSAR QLALADRVGS IEKGKRADLL VMKLPGPQST GEGLHEAVIG SGELLHVILS GRFTDRTEPR
|
| |