Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A2227 |
Symbol | |
ID | 6484258 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | + |
Start bp | 2133895 |
End bp | 2134905 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 642737575 |
Product | hypothetical protein |
Protein accession | YP_002041317 |
Protein GI | 194446088 |
COG category | [R] General function prediction only [S] Function unknown |
COG ID | [COG2096] Uncharacterized conserved protein [COG3193] Uncharacterized protein, possibly involved in utilization of glycolate and propanediol |
TIGRFAM ID | [TIGR00636] ATP:cob(I)alamin adenosyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 89 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGATTT ATACCCGAAC AGGTGACGCT GGCACCACAT CACTGTTTAC CGGCCAGCGG GTGAGCAAAA CCCACCCGCG GGTTGAAGCC TACGGCACGC TGGATGAGCT GAATGCCGCG CTGAGCCTGT GCGCCTGCGC CGCTGCGGAT GAAAATCACC GCACCTTACT CGAGGCCATC CAGCAGCAAC TTTTTTGGTT TAGCGCAGAG CTGGCCAGCG ACAGCGAGCA GCCGTCGCCC AAACAGCGCT ACATCAGCAG CGAAGAGATT TCGGCCCTGG AAGCCGCTAT CGATCGGGCA ATGGCCCGCG TCGAACCGCT GCACAGCTTT ATATTACCCG GACGCTGCGA AGCCGCGAGC CGCTTACATT TTGCCCGCAC GCTGGCGCGC CGCGCCGAAC GCCGTCTGGT TGAACTGGCA ACTGAAGTCA ACGTACGCCA GGTGCTGATG CGCTACATCA ACCGCTTATC GGACTGCCTG TACGCCCTGG CCCGCGCGGA AGATAGCGAT GCGCACCAGG CCAACATCAT CCGTGAAGTT AGCAAGCGCT ATCTGGCTGC CAGCCAGCCG ACCCGCAGCA AGGAGACAAC GCCCGTGGCC CTCTCATTCC ACGATCTGCA CCAGCTCACC CGCGCCGCCG TTGAACGCGC GCAGCAACTG CAGGTTCCGG TAGTCGTCAG CATCGTTGAC GCGCACGGCA CGGAAACTGT GACCTGGCGG ATGCCGGACG CCCTGCTGGT CAGCAGCGAG CTGGCGCCGA AAAAGGCCTG GACCGCAGTG GCGATGAAAA CGGCGACCCA TGAGCTGAGC GATGTCGTTC AGCCGGGCGC CGCGCTTTAC GGTCTGGAAA GTCATTTACA GGGAAAAGTG GTCACCTTTG GCGGCGGTTA CGCCCTGTGG CGCGGCGGCA TATTAATTGG GGGTCTTGGC ATCAGCGGCG GCAGCGTTGA GCAGGACATG GACATAGCAC AGACCGCCAT CGCGGCTATT AACGTGGGAA CTCATCAATG A
|
Protein sequence | MAIYTRTGDA GTTSLFTGQR VSKTHPRVEA YGTLDELNAA LSLCACAAAD ENHRTLLEAI QQQLFWFSAE LASDSEQPSP KQRYISSEEI SALEAAIDRA MARVEPLHSF ILPGRCEAAS RLHFARTLAR RAERRLVELA TEVNVRQVLM RYINRLSDCL YALARAEDSD AHQANIIREV SKRYLAASQP TRSKETTPVA LSFHDLHQLT RAAVERAQQL QVPVVVSIVD AHGTETVTWR MPDALLVSSE LAPKKAWTAV AMKTATHELS DVVQPGAALY GLESHLQGKV VTFGGGYALW RGGILIGGLG ISGGSVEQDM DIAQTAIAAI NVGTHQ
|
| |