Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A1832 |
Symbol | |
ID | 6484555 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | - |
Start bp | 1797870 |
End bp | 1799039 |
Gene Length | 1170 bp |
Protein Length | 389 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 642737207 |
Product | tetratricopeptide repeat protein |
Protein accession | YP_002040959 |
Protein GI | 194446469 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2956] Predicted N-acetylglucosaminyl transferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.00100254 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 0 |
Fosmid unclonability p-value | 2.05101e-30 |
Fosmid Hitchhiker | No |
Fosmid clonability | unclonable |
| |
Sequence |
Gene sequence | ATGTTGGAGT TGTTATTTCT GCTGTTGCCT GTAGCCGCTG CCTATGGGTG GTATATGGGT CGCAGAAGTG CGCAACAAAC AAAACAGGAT GAAGCTAACC GCCTGTCGCG CGATTATGTC GCAGGGGTTA ACTTCCTGCT GAGTAACCAA CAAGATAAAG CGGTGGATCT GTTCCTCGAT ATGCTTAAAG AGGATACGGG CACCGTTGAG GCTCATCTCA CTCTCGGTAA TCTGTTTCGC TCACGCGGCG AAGTCGATCG CGCCATTCGT ATTCATCAAA CGCTCATGGA AAGCGCTTCA TTGACCTATG AACAGCGTTT ACTGGCTGTT CAGCAACTGG GGCGCGACTA TATGGCCGCC GGTTTATATG ACCGCGCGGA AGATATGTTT AACCAACTTA CCGACGAAAC GGAATTTCGC GTAGGCGCGT TACAACAGCT CTTGCAAATC TATCAGCTAA CCAGCGACTG GCAAAAGGCG ATCGAAGTAG CAGAACGGCT GGTGAAACTG GGCAAAGATA AACAGCGTAT CGAAATCGCC CATTTTTACT GTGAGTTAGC TTTACAGCAG ATGGGCAACG ACGACATGGA TCGCGCGATG GCGTTGCTGA AAAAAGGTGC CGCCGCAGAT AAAAATAGCG CCCGGGTGTC TATCATGATG GGGCGCGTTT ATATGGCGAG AGGGGATTAC GCCAAAGCGG TCGAAAGCCT GCAACGTGTG ATCGTTCAGG ATAAAGAGCT GGTCAGCGAA ACGCTGGAGA TGCTGCAAAC CTGTTATCAA CAGCTCGGTA AAAATGCCGA GTGGGCGGAG TTTTTACGTC GCGCCGTTGA GGAGAATACC GGTGCTGGCG CTGAGTTAAT GCTTGCCGAT ATTCTGGAAG CACGTGAAGG TAGTGACGCA GCTCAAGTCT ATATCACGCG TCAGCTACAG CGACATCCTA CCATGCGGGT GTTCCATAAG CTGATGGATT ACCATCTCAA CGAGGCGGAA GAAGGGCGAG CGAAAGAAAG CCTGATGGTA CTGCGTGATA TGGTTGGCGA GCAGGTGCGC AGTAAACCGC GGTATCGTTG TCAGAAATGC GGTTTTACCG CCTATACCTT GTACTGGCAC TGTCCGTCCT GCCGGGCATG GTCGACCATT AAACCTATTC GCGGACTTGA TGGGCAGTAG
|
Protein sequence | MLELLFLLLP VAAAYGWYMG RRSAQQTKQD EANRLSRDYV AGVNFLLSNQ QDKAVDLFLD MLKEDTGTVE AHLTLGNLFR SRGEVDRAIR IHQTLMESAS LTYEQRLLAV QQLGRDYMAA GLYDRAEDMF NQLTDETEFR VGALQQLLQI YQLTSDWQKA IEVAERLVKL GKDKQRIEIA HFYCELALQQ MGNDDMDRAM ALLKKGAAAD KNSARVSIMM GRVYMARGDY AKAVESLQRV IVQDKELVSE TLEMLQTCYQ QLGKNAEWAE FLRRAVEENT GAGAELMLAD ILEAREGSDA AQVYITRQLQ RHPTMRVFHK LMDYHLNEAE EGRAKESLMV LRDMVGEQVR SKPRYRCQKC GFTAYTLYWH CPSCRAWSTI KPIRGLDGQ
|
| |