Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeD_A3564 |
Symbol | gcp |
ID | 6874081 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 |
Kingdom | Bacteria |
Replicon accession | NC_011205 |
Strand | - |
Start bp | 3419862 |
End bp | 3420875 |
Gene Length | 1014 bp |
Protein Length | 337 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 642786552 |
Product | putative DNA-binding/iron metalloprotein/AP endonuclease |
Protein accession | YP_002217189 |
Protein GI | 198243102 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0533] Metal-dependent proteases with possible chaperone activity |
TIGRFAM ID | [TIGR00329] metallohydrolase, glycoprotease/Kae1 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.000516178 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 56 |
Fosmid unclonability p-value | 0.199213 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTGTAC TGGGTATTGA AACATCCTGC GATGAAACCG GCATCGCTAT TTACGACGAC AAAAAAGGTC TGTTAGCCAA CCAATTGTAT AGTCAGGTGA AATTACATGC TGACTACGGC GGCGTAGTGC CTGAACTGGC TTCCCGCGAT CATGTGCGTA AAACCGTGCC GCTGATTCAG GCGGCATTAA AAGAAGCCGG TCTGACGGCG AGCGATATCG ACGCGGTGGC CTATACCGCA GGCCCGGGCC TGGTCGGCGC GCTGCTGGTC GGCGCAACCG TCGGGCGTTC GCTGGCATTT GCCTGGAATG TGCCGGCCAT TCCTGTACAC CATATGGAAG GTCATCTGCT GGCGCCAATG CTGGAAGATA ATCCCCCGGA ATTCCCGTTT GTGGCGCTAC TGGTCTCCGG CGGACATACG CAGCTCATTA GCGTGACCGG AATCGGTCAG TACGAACTGC TGGGAGAGTC GATTGACGAT GCCGCCGGTG AAGCGTTTGA TAAAACCGCC AAATTGTTGG GGCTGGATTA TCCTGGCGGC CCGATGCTGT CGAAAATGGC GTCGCAGGGG ACGGCGGGAC GTTTTGTCTT TCCGCGCCCG ATGACCGATC GCCCGGGGCT GGATTTTAGT TTTTCCGGTC TGAAAACCTT TGCCGCTAAC ACCATTCGTA GTAATGGCGG CGACGAACAA ACTCGCGCTG ATATCGCGCG CGCTTTTGAA GATGCGGTCG TGGATACGCT GATGATCAAG TGCAAGCGCG CGCTGGAAAG CACCGGTTTT AAGCGTCTGG TCATGGCGGG CGGCGTCAGC GCTAACCGCA CGCTGCGCGC GAAGCTTGCC GAAATGATGC AAAAACGCCG CGGCGAAGTG TTCTATGCGC GTCCGGAGTT TTGTACTGAC AACGGGGCGA TGATCGCCTA TGCCGGAATG GTGCGGTTTA AGGCGGGCGT TACGGCGGAT CTTGGCGTAA CGGTACGTCC GCGCTGGCCG CTGGCCGAGC TGCCGGCGGC GTAA
|
Protein sequence | MRVLGIETSC DETGIAIYDD KKGLLANQLY SQVKLHADYG GVVPELASRD HVRKTVPLIQ AALKEAGLTA SDIDAVAYTA GPGLVGALLV GATVGRSLAF AWNVPAIPVH HMEGHLLAPM LEDNPPEFPF VALLVSGGHT QLISVTGIGQ YELLGESIDD AAGEAFDKTA KLLGLDYPGG PMLSKMASQG TAGRFVFPRP MTDRPGLDFS FSGLKTFAAN TIRSNGGDEQ TRADIARAFE DAVVDTLMIK CKRALESTGF KRLVMAGGVS ANRTLRAKLA EMMQKRRGEV FYARPEFCTD NGAMIAYAGM VRFKAGVTAD LGVTVRPRWP LAELPAA
|
| |