Gene SeD_A3564 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A3564 
Symbolgcp 
ID6874081 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp3419862 
End bp3420875 
Gene Length1014 bp 
Protein Length337 aa 
Translation table11 
GC content58% 
IMG OID642786552 
Productputative DNA-binding/iron metalloprotein/AP endonuclease 
Protein accessionYP_002217189 
Protein GI198243102 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0533] Metal-dependent proteases with possible chaperone activity 
TIGRFAM ID[TIGR00329] metallohydrolase, glycoprotease/Kae1 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000516178 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value0.199213 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTGTAC TGGGTATTGA AACATCCTGC GATGAAACCG GCATCGCTAT TTACGACGAC 
AAAAAAGGTC TGTTAGCCAA CCAATTGTAT AGTCAGGTGA AATTACATGC TGACTACGGC
GGCGTAGTGC CTGAACTGGC TTCCCGCGAT CATGTGCGTA AAACCGTGCC GCTGATTCAG
GCGGCATTAA AAGAAGCCGG TCTGACGGCG AGCGATATCG ACGCGGTGGC CTATACCGCA
GGCCCGGGCC TGGTCGGCGC GCTGCTGGTC GGCGCAACCG TCGGGCGTTC GCTGGCATTT
GCCTGGAATG TGCCGGCCAT TCCTGTACAC CATATGGAAG GTCATCTGCT GGCGCCAATG
CTGGAAGATA ATCCCCCGGA ATTCCCGTTT GTGGCGCTAC TGGTCTCCGG CGGACATACG
CAGCTCATTA GCGTGACCGG AATCGGTCAG TACGAACTGC TGGGAGAGTC GATTGACGAT
GCCGCCGGTG AAGCGTTTGA TAAAACCGCC AAATTGTTGG GGCTGGATTA TCCTGGCGGC
CCGATGCTGT CGAAAATGGC GTCGCAGGGG ACGGCGGGAC GTTTTGTCTT TCCGCGCCCG
ATGACCGATC GCCCGGGGCT GGATTTTAGT TTTTCCGGTC TGAAAACCTT TGCCGCTAAC
ACCATTCGTA GTAATGGCGG CGACGAACAA ACTCGCGCTG ATATCGCGCG CGCTTTTGAA
GATGCGGTCG TGGATACGCT GATGATCAAG TGCAAGCGCG CGCTGGAAAG CACCGGTTTT
AAGCGTCTGG TCATGGCGGG CGGCGTCAGC GCTAACCGCA CGCTGCGCGC GAAGCTTGCC
GAAATGATGC AAAAACGCCG CGGCGAAGTG TTCTATGCGC GTCCGGAGTT TTGTACTGAC
AACGGGGCGA TGATCGCCTA TGCCGGAATG GTGCGGTTTA AGGCGGGCGT TACGGCGGAT
CTTGGCGTAA CGGTACGTCC GCGCTGGCCG CTGGCCGAGC TGCCGGCGGC GTAA
 
Protein sequence
MRVLGIETSC DETGIAIYDD KKGLLANQLY SQVKLHADYG GVVPELASRD HVRKTVPLIQ 
AALKEAGLTA SDIDAVAYTA GPGLVGALLV GATVGRSLAF AWNVPAIPVH HMEGHLLAPM
LEDNPPEFPF VALLVSGGHT QLISVTGIGQ YELLGESIDD AAGEAFDKTA KLLGLDYPGG
PMLSKMASQG TAGRFVFPRP MTDRPGLDFS FSGLKTFAAN TIRSNGGDEQ TRADIARAFE
DAVVDTLMIK CKRALESTGF KRLVMAGGVS ANRTLRAKLA EMMQKRRGEV FYARPEFCTD
NGAMIAYAGM VRFKAGVTAD LGVTVRPRWP LAELPAA