Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeD_A3035 |
Symbol | |
ID | 6872500 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 |
Kingdom | Bacteria |
Replicon accession | NC_011205 |
Strand | - |
Start bp | 2932853 |
End bp | 2933761 |
Gene Length | 909 bp |
Protein Length | 302 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 642786065 |
Product | baseplate assembly protein J |
Protein accession | YP_002216711 |
Protein GI | 198242076 |
COG category | [R] General function prediction only |
COG ID | [COG3948] Phage-related baseplate assembly protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.805786 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 39 |
Fosmid unclonability p-value | 0.000161965 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGGCAGTCA TTGACCTTTC CCAGTTGCCT GCGCCGCAGA TAGTGGACGT GCCGGATTTT GAGACGCTGC TGGCTGAGCG CAAGGCCGCT TTTGTGCTCC TTTATCCGGC GGATGAACAG GACGCGGTGC GGCGCACACT GGCGCTGGAA TCTGAACCCG TCACCAAGCT GCTGCAGGAA AGTACATACC GCGAAATCCT GCTGCGCCAG CGTATTAACG AGGCTGCGCA GGCGGTCATG GTGGCCTATT CGATAGGAAA TGATCTTGAG CAGCTGGCAG CCAACTGCAA CGTGAAACGT CTGACGGTAG TGCCTGCTGA TAATGATGCA GTACCGCCGG TCGCCGCAGT GATGGAAGAT GATGATGCGC TGCGCCAGCG CATCCCTGCA GCATTTGAGG GACTGTCCGT TGCTGGCCCG ACGGGAGCCT ATGAATTTCA CGCCAGAAGT GCGGACGGAC GTGTGGCAGA TGCCAGCGCA ACCAGTCCGG CCCCTGCAGA GGTGGTACTT ACCGTACTGA GCCGGGAGGG TGACGGTACA GCAGTAAAAG ACCTGCTGGA TGTGGTTGAA AAAGCCCTGA ACAGTGAGAG TGTACGCCCG GTGGCTGACC GTCTGACGGT TCGTAGTGCG GAGATCATAC CGTACCGGGT GGAGGCTACC ATTTTTCTTT ATCCGGGGCC GGAAGCGGAG CCTGTTATGG CGGCGGCAAA AGCCAGCCTG CAGAAGTACA TCGCCAGTCA GACGAGGCTG GGACGTGATA TCCGCCGCAG CGCCATTTAT GCCGCGCTGC ACGTGGAGGG CGTCCAGCGT GTGGAGCTAA CGTCCCCTCT GGAGGATGTG GTGCTGGATA AGACGCAGGC GGCATCCTGT ACTGAATGGA GCGTTACCAA CGGGGGCACG GATGAATAG
|
Protein sequence | MAVIDLSQLP APQIVDVPDF ETLLAERKAA FVLLYPADEQ DAVRRTLALE SEPVTKLLQE STYREILLRQ RINEAAQAVM VAYSIGNDLE QLAANCNVKR LTVVPADNDA VPPVAAVMED DDALRQRIPA AFEGLSVAGP TGAYEFHARS ADGRVADASA TSPAPAEVVL TVLSREGDGT AVKDLLDVVE KALNSESVRP VADRLTVRSA EIIPYRVEAT IFLYPGPEAE PVMAAAKASL QKYIASQTRL GRDIRRSAIY AALHVEGVQR VELTSPLEDV VLDKTQAASC TEWSVTNGGT DE
|
| |