Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeD_A2255 |
Symbol | |
ID | 6872470 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 |
Kingdom | Bacteria |
Replicon accession | NC_011205 |
Strand | - |
Start bp | 2154605 |
End bp | 2155945 |
Gene Length | 1341 bp |
Protein Length | 446 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 642785355 |
Product | tail/DNA circulation protein |
Protein accession | YP_002216017 |
Protein GI | 198243666 |
COG category | [R] General function prediction only |
COG ID | [COG4228] Mu-like prophage DNA circulation protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 39 |
Fosmid unclonability p-value | 0.000334711 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCTTTTT TCTCCTCAAC TGGCTGGCGC GGGCGCCTGC GTGATGCATC ATTTCGTGGA GTGCCTTTCT CCGTTGAAGA TGATGAAAGC ACCTTTGGAC GCCGCGTACA GGTACATGAA TATCCGAACA GGGATAAGCC CTGGACGGAG GATTTAGGTC GCGCCACGCG CCGCCTGACG ATAAATGCTT ATCTTGTCGG TGATGATTAC GCAGACAGGC GGGATCGTCT TATTGGTGCC ATTGAAACCG CAGGCCCTGG TACGCTGGTC CATCCGCAGT ATGGCGAAAT GCAGGGCAGC ATTGACGGAC AGGTCAGGAT CACTCACAGC AGTACAGAAG GGCGCATGTG TCGTGTCTCC TTTCAGTTTG TGGAAAGTGG GGAACTTTCT TTTCCGGTGG CAGGAATGGC AACGGCGAAG CGCCTGGAAA CGTCAGGCGG GCTTTTCGAC GATGCGATTG ACAGTATGTT TTCCACATTC TCGTTGTCAG GTATTTCTGA TTTTATCCAG AACGATGTCA TTGCCGATGC AGCCTCCATG CTGGGCGATG TTGCCGATGC TTTCAGGATG GTTGACTCCG GCGTGTCTGC CGCAATGCGG CTGTTACAGG GGGATTTGTC TGTCATTCTG ATGCCACCGG GCGCCGCAAG TGATTTCGTT AACGCACTGC AAAAAGCCTG GCGCTCAGGT GACAGGCTCA GGGGCAGTAC ATCGGATCTG GTCACGATGA TAAAAACGAT GTCAGGTATC ACGCTTGATC CCGGTCTTTC CCCTCGAGGC ACCTGGCCCA CTGACTCCGG ATCTGCTGCG AAACAGAAAA TGCAACGCAA TATGATCGCA GCCGCCATCA GGACAACAGC CATCAGCACA GCCGTCCACG CCGTGACAAC ACTGAAGCAG CCGCGTGATG TACCTGGTGT CCGGGGCGTA AATCAGCCTG CAGGAACAGG CCGTGACTCA GACATTATCA CTGTCATGCA CCCGGCGCTG GATGGTGTAC AGACAGTCAG TAATGGCAGC TCTCCACCGA ATTATGAGGA TCTGAAAGCT ATCCGGACCG CGCTCAATGC TGCGATTGAC CAGGAGCAGT TGCGTATCCG GGATGATGTG CTTTTCCAGC AAATTTCCGT TATGCGGACG GATCTCAATC GCGATATTTC TGCACGACTG GCACAGGTTG AACGTACTGC ATTGCGAACG CCTGATGATG TTCTGCCTGC ACTGGTACTG GCTGCAGCCT GGTATGACGA CGCCGGGCGG GAATCTGACA TCCTCACTCG TAATCCCGTT CCCCATCCAG GATTTATCCC GGTTGAGCCG CTGAGGGTTC CGGTACGATG A
|
Protein sequence | MAFFSSTGWR GRLRDASFRG VPFSVEDDES TFGRRVQVHE YPNRDKPWTE DLGRATRRLT INAYLVGDDY ADRRDRLIGA IETAGPGTLV HPQYGEMQGS IDGQVRITHS STEGRMCRVS FQFVESGELS FPVAGMATAK RLETSGGLFD DAIDSMFSTF SLSGISDFIQ NDVIADAASM LGDVADAFRM VDSGVSAAMR LLQGDLSVIL MPPGAASDFV NALQKAWRSG DRLRGSTSDL VTMIKTMSGI TLDPGLSPRG TWPTDSGSAA KQKMQRNMIA AAIRTTAIST AVHAVTTLKQ PRDVPGVRGV NQPAGTGRDS DIITVMHPAL DGVQTVSNGS SPPNYEDLKA IRTALNAAID QEQLRIRDDV LFQQISVMRT DLNRDISARL AQVERTALRT PDDVLPALVL AAAWYDDAGR ESDILTRNPV PHPGFIPVEP LRVPVR
|
| |