Gene GSU1865 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU1865 
Symbol 
ID2688523 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp2041368 
End bp2042390 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content65% 
IMG OID637126556 
Productputative DNA-binding/iron metalloprotein/AP endonuclease 
Protein accessionNP_952914 
Protein GI39996963 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0533] Metal-dependent proteases with possible chaperone activity 
TIGRFAM ID[TIGR00329] metallohydrolase, glycoprotease/Kae1 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.211148 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTCGTAC TTGCCATCGA AACATCGTGT GACGAAACCG CTGCCGCCTT GGTGCGCGAC 
GGGCGCTCCA TCCTGTCAAG CGTGGTTTCC TCCCAGGTGA AGGATCACGC CGTCTACGGC
GGGGTTGTAC CCGAAATTGC CTCGCGCAAA CACCTGGAGA CCATTCCCGC GGTTATCGGC
GAGGCCCTGC GCTTGGCGGA CGTGACCCTC GATCACGTGG AAGGGGTTGC GGTGACCCAG
GGGCCCGGAC TGGCCGGCGC TCTCCTGGTC GGCCTGTCCG TGGCCAAATC CATCGCCTTT
GCCAGGAGAC TGCCCCTGGT AGGAGTCAAT CACATTGAAG CGCACCTTGC CGCTATTTTT
CTTGAACGCG AGGTCGCCTA TCCCTATCTG GCCCTGGTGG TGTCGGGGGG GCATTCCCAC
CTGTACCGGG TCGACGGTAT CGGCCGGTGC ACCACCCTCG GCCAGACCCT GGACGATGCT
GCTGGCGAAG CCTTCGACAA GGTGGCCAAG CTGCTGGGCC TCCCCTATCC GGGCGGCATC
GAGATTGATC GCCTGGCCTC CGCCGGTGAC CCGGATGCCA TCGCCTTTCC GCGGCCTCTG
CTCCACGACG GCAGCTTCAA CTTCAGCTTC AGCGGCCTCA AGACCGCGGT GCTCTCTGCG
GTGAAGAAGC AGGGGCTTCC CGAGGGAAAA TCCCTGGCCG ACTTCTGCGC TTCGTTCCAG
AAGGCCGTCT GTCATGTGTT GGTGGAAAAG ACCTTTCGTG CCGCGGAGGC GGCAGGTATT
GACCGGGTCG TGGTGGCAGG TGGGGTGGCC TGCAACAGTG CGCTGCGGCG GGAAATGGCC
CATGCCGCCG CTGCGCGGGG CGTGGAGCTC ATGATCCCGT CGCCGTCGCT ATGCGGAGAC
AATGCCGCCA TGATCGCCGT GCCGGGTGAC TATTACCTGC GCTGTGGGGA GCAGGGCGGT
CTCGCGCTTG ACGCACGGGT GAACTGGCCA CTTGACCTGC TCGGGTCGGG GAGGGAGGGG
TGA
 
Protein sequence
MLVLAIETSC DETAAALVRD GRSILSSVVS SQVKDHAVYG GVVPEIASRK HLETIPAVIG 
EALRLADVTL DHVEGVAVTQ GPGLAGALLV GLSVAKSIAF ARRLPLVGVN HIEAHLAAIF
LEREVAYPYL ALVVSGGHSH LYRVDGIGRC TTLGQTLDDA AGEAFDKVAK LLGLPYPGGI
EIDRLASAGD PDAIAFPRPL LHDGSFNFSF SGLKTAVLSA VKKQGLPEGK SLADFCASFQ
KAVCHVLVEK TFRAAEAAGI DRVVVAGGVA CNSALRREMA HAAAARGVEL MIPSPSLCGD
NAAMIAVPGD YYLRCGEQGG LALDARVNWP LDLLGSGREG