Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeAg_B3394 |
Symbol | gcp |
ID | 6794878 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Agona str. SL483 |
Kingdom | Bacteria |
Replicon accession | NC_011149 |
Strand | - |
Start bp | 3295741 |
End bp | 3296754 |
Gene Length | 1014 bp |
Protein Length | 337 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 642777533 |
Product | putative DNA-binding/iron metalloprotein/AP endonuclease |
Protein accession | YP_002148138 |
Protein GI | 197248190 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0533] Metal-dependent proteases with possible chaperone activity |
TIGRFAM ID | [TIGR00329] metallohydrolase, glycoprotease/Kae1 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.000000000436621 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGTGTAC TGGGTATTGA AACATCCTGC GATGAAACCG GCATCGCTAT TTACGACGAC AAAAAAGGTC TGTTAGCCAA CCAATTGTAT AGTCAGGTGA AATTACATGC TGACTACGGC GGCGTAGTGC CTGAACTGGC TTCCCGCGAT CATGTGCGTA AAACCGTGCC GCTGATTCAG GCGGCATTAA AAGAAGCCGG TCTGACGGCG AGCGATATCG ACGCGGTGGC CTATACCGCA GGTCCGGGCC TGGTCGGCGC GCTGCTGGTC GGCGCAACCG TCGGGCGTTC GCTGGCATTT GCCTGGAATG TGCCGGCCAT TCCTGTACAC CATATGGAAG GTCATCTGCT GGCGCCGATG CTGGAAGATA ACCCTCCGGA GTTCCCGTTT GTGGCGCTAC TGGTCTCCGG CGGACATACG CAGCTCATTA GCGTGACCGG AATTGGTCAG TACGAACTGC TGGGCGAGTC GATTGACGAT GCCGCCGGCG AAGCGTTTGA TAAAACTGCC AAATTGTTGG GGCTGGATTA TCCTGGTGGC CCGATGCTGT CGAAAATGGC GTCGCAGGGG ACGGCGGGAC GTTTTGTCTT TCCGCGCCCG ATGACCGATC GCCCGGGGCT GGATTTTAGT TTTTCCGGTC TGAAAACCTT TGCCGCTAAC ACCATTCGTA GTAATGGCGG CGACGAACAA ACTCGCGCTG ATATCGCGCG CGCTTTTGAA GATGCGGTCG TGGATACGCT GATGATCAAG TGCAAGCGCG CGCTGGAAAG CACCGGTTTT AAGCGTCTGG TCATGGCGGG CGGCGTCAGC GCTAACCGCA CGCTGCGCGC GAAGCTTGCC GAAATGATGC AAAAACGCCG CGGCGAAGTG TTCTATGCGC GCCCGGAATT TTGTACCGAC AACGGGGCGA TGATCGCCTA TGCCGGAATG GTGCGGTTTA AGGCGGGCGT TACGGCGGAT CTTGGCGTAA CGGTACGTCC GCGCTGGCCG CTGGCCGAGC TGCCGGCGGC GTAA
|
Protein sequence | MRVLGIETSC DETGIAIYDD KKGLLANQLY SQVKLHADYG GVVPELASRD HVRKTVPLIQ AALKEAGLTA SDIDAVAYTA GPGLVGALLV GATVGRSLAF AWNVPAIPVH HMEGHLLAPM LEDNPPEFPF VALLVSGGHT QLISVTGIGQ YELLGESIDD AAGEAFDKTA KLLGLDYPGG PMLSKMASQG TAGRFVFPRP MTDRPGLDFS FSGLKTFAAN TIRSNGGDEQ TRADIARAFE DAVVDTLMIK CKRALESTGF KRLVMAGGVS ANRTLRAKLA EMMQKRRGEV FYARPEFCTD NGAMIAYAGM VRFKAGVTAD LGVTVRPRWP LAELPAA
|
| |