Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A4389 |
Symbol | |
ID | 6485998 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | + |
Start bp | 4260657 |
End bp | 4261826 |
Gene Length | 1170 bp |
Protein Length | 389 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 642739629 |
Product | late control gene D protein |
Protein accession | YP_002043323 |
Protein GI | 194442964 |
COG category | [R] General function prediction only |
COG ID | [COG3500] Phage protein D |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.725346 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 53 |
Fosmid unclonability p-value | 0.0652944 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTACGG GCATGACCAT TGACGCCGGT GCCAGCTTTG CACCGGCATT TATGCTGACG CTGAACAGCC AGGACATTAC CAGCAATTTT AGTGACCGGC TGATTTCTCT CACCATGACC GACAACCGGG GTTTTGAGGC TGACCAGCTC GACATTGAGC TCGACGACAC CGACGGTAAA GTTGAGTTAC CCCTGCGCGG GGCGGTGCTG ACGCTGTGGC TTGGCTGGCA GGGCTCGGCA TTGCTGAATA AAGGCGATTT CACGGTCGAT GAGATTGAGC ACCGGGGCGC ACCTGATACC CTGACCATTC GGGCGCGTAG TGCAGATTTT CGCGGAACGC TCAATTCACG GCGTGAGGAG TCATGGCACG ACACCACCCT CGGTGAGCTG GTCAGCACCA TTGCAAAGCG CAATAAACTG ACGGCCAGTG TCGCGGATTC GCTGAAAAAA ATACCGGTAC CGCATATCGA CCAGTCGCAG GAGTCCGACG CCGTATTTCT GACCCGGCTG GCTGACCGCA ATGGGGCGGC GGTGTCAGTG AAAGCGGGTA AACTCCTGTT TCTGAAAGCC GGTAGTGCGA TGACGGCCAG TGGCAAGCCC GTCCCGCAAA TGACCCTGAC CCGCAGCGAT GGCGACCGTC ATCAGTTTGC CATTGCTGAC CGTGGTGCTT ACACCGGCGT AACAGCAAAA TGGTTGCACA CCAAAGACCC GAAGCCGCAA AAGCAAAAAG TGACGCTGAA ACGTAAGCCA AAAGAGAAGC ACCTGCGCGC ACTGGAGCAT CCGAAAGCAA AGCTGGTCAG CAAAAAGACA AAGGCCAAAA AAGAGCCGGA AGCGCGTGAG GGTGAGTATA TGGCCGGTGA GGCTGATAAC GTGCTGGCGC TGACGACGGT CTACGCATCA AAGGCGCAGG CGATGCGCGC AGCTCAGGCT AAATGGGATA AGCTGCAGCG AGGCGTTGCG GAGTTTTCAA TTACGCTGGC GCTTGGTAGG GCTGATTTAT TCCCTGAGAC ACCGGTGCGT GTGTCGGGCT TTAAGCGTGT CATAGACGAG CAGGCATGGT TAATCAGTAA AGTGACTCAC AGCCTGAATA ATAGTGGCTT CACGACGGGC TTAGAGCTTG AGGTTAAGCT CTCTGACGTA GAGTATAAAG CGGAAGATGA TGATGGGTGA
|
Protein sequence | MITGMTIDAG ASFAPAFMLT LNSQDITSNF SDRLISLTMT DNRGFEADQL DIELDDTDGK VELPLRGAVL TLWLGWQGSA LLNKGDFTVD EIEHRGAPDT LTIRARSADF RGTLNSRREE SWHDTTLGEL VSTIAKRNKL TASVADSLKK IPVPHIDQSQ ESDAVFLTRL ADRNGAAVSV KAGKLLFLKA GSAMTASGKP VPQMTLTRSD GDRHQFAIAD RGAYTGVTAK WLHTKDPKPQ KQKVTLKRKP KEKHLRALEH PKAKLVSKKT KAKKEPEARE GEYMAGEADN VLALTTVYAS KAQAMRAAQA KWDKLQRGVA EFSITLALGR ADLFPETPVR VSGFKRVIDE QAWLISKVTH SLNNSGFTTG LELEVKLSDV EYKAEDDDG
|
| |