Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeD_A1039 |
Symbol | |
ID | 6871809 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 |
Kingdom | Bacteria |
Replicon accession | NC_011205 |
Strand | - |
Start bp | 1038673 |
End bp | 1040433 |
Gene Length | 1761 bp |
Protein Length | 586 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 642784224 |
Product | hypothetical protein |
Protein accession | YP_002214898 |
Protein GI | 198245525 |
COG category | [S] Function unknown |
COG ID | [COG1944] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR00702] uncharacterized domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.548133 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 55 |
Fosmid unclonability p-value | 0.203105 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGCAAA CATTTATTCC CGGCAAAGAC GCCGCGCTGG AAGACTCCAT CGCCCGCTTC CAGCAAAAGT TACTCGACCT CGGCTTTCAC ATCGAAGAGG CCTCCTGGCT GAACCCGGTG CCAAACGTCT GGTCAGTGCA TATTCGCGAT AAAGAGTGCG CGTTATGCTT TACCAACGGA AAAGGCGCGA CCAAAAAAGC GGCGCTGGCC TCGGCGCTGG GCGAATACTT CGAGCGTCTG TCAACCAACT ACTTCTTTGC TGATTTCTGG CTTGGCGAAA CGGTCGCCAA TGGGCCATTC GTGCATTACC CGAACGAAAA GTGGTTCCCG CTGACTGAAA ATGACGACGT ACCGGAAGGC TTGCTTGATG CCCGTCTGCG CGCGTTTTAC GATCCGGAAA ATGAACTCAC CGGAAGCCAG TTAATTGATC TTCAGTCCGG CAATGAAGCT CGCGGCGTCT GCGGCCTGCC ATTTACCCGT CAGTCCGATA ACCAGACCGT GTATATTCCG ATGAATATCA TCGGCAACCT GTACGTCTCT AACGGAATGT CCGCCGGCAA TACGCGTAAT GAAGCCCGCG TTCAGGGACT GTCGGAAGTC TTCGAGCGTT ATGTGAAAAA TCGCATCATT GCGGAAAGTA TCAGTCTGCC GGAGATTCCC GCAGAGGTGA TGGCGCGTTA TCCGGCGGTA ATGGAGTCAA TCGCCACGCT GGAAGCCGAG GGTTTCCCGA TTTTCGCCTA TGACGGCTCG CTGGGCGGTA AGTATCCGGT TATCTGCGTC GTGCTGTTCA ACCCGGCTAA CGGTACCTGC TTTGCTTCTT TTGGCGCCCA TCCTGACTTT GGCGTTGCGC TGGAGCGTAC AGTGACCGAG CTACTCCAGG GACGCGGTCT GAAAGATCTT GATGTCTTCA CGCCACCAAC GTTCGATGAT GAAGAAGTCG CGGAGCACAC TAATCTGGAG ACCCACTTCA TCGACTCCAG CGGCCTGATT TCCTGGGATC TGTTCAAACA GGACGCCGAT TATCCGTTCA CGGACTGGAG TTTTTCCGGC ACTACCGAAG AAGAATTCGC CACGCTGATG GCCATCTTTG CTGCTGAAGA TAAAGAAGTT TACATTGCCG ATTACGAGCA TCTCGGCGTA TACGCCTGTC GTATTATCGT ACCGGGAATG TCTGATATTT ATCCTGCCGA AGATCTGTGG CTGGCCAACA ACAATATGGG TAGCCATCTT CGTGAGACTC TGCTTTCGCT GCCCGGTAGC GCCTGGAATA AAGAAGATTA TCTCAATCTG ATTGAACAAT TGGATGAAGA AGGTTTTGAC GATTTCACCC GCGTGCGTGA ACTGTTGGGT CTGGCGACCG GAGCGGACAA TGGTTGGTAT ACACTGCGCG TCGGCGAATT AAAAGCAATG TTAGCGTTAG CGGGCGGCGA TTTGGAGCAG GCGCTAATCT GGACAGAATG GACGATGGAG TTCAATTCGT CGGTCTTTAG TCCGACACGC GCAAACTATT ACCGTTGCCT GCAAACTCTG CTGCTCCTGT CGCAAGAAGA TGCGCGTCAG CCACTGCAAT ATCTCAATGC TTTTATAAAA ATGTATGGCG CAGAGGCTGT AGAGGCCGCC AGCGCCGCGC TTAGCGGTGA AGCGGCTTTT TATGGACTAC CGGCTGTCGA CCACGATCTA CAAGCGTTCC CGGCGCATCA GTCCTTGTTA AAAGCGTATG ATAAATTACA GCGCGCGAAA GCGGCATACT GGTCAAAATA A
|
Protein sequence | MTQTFIPGKD AALEDSIARF QQKLLDLGFH IEEASWLNPV PNVWSVHIRD KECALCFTNG KGATKKAALA SALGEYFERL STNYFFADFW LGETVANGPF VHYPNEKWFP LTENDDVPEG LLDARLRAFY DPENELTGSQ LIDLQSGNEA RGVCGLPFTR QSDNQTVYIP MNIIGNLYVS NGMSAGNTRN EARVQGLSEV FERYVKNRII AESISLPEIP AEVMARYPAV MESIATLEAE GFPIFAYDGS LGGKYPVICV VLFNPANGTC FASFGAHPDF GVALERTVTE LLQGRGLKDL DVFTPPTFDD EEVAEHTNLE THFIDSSGLI SWDLFKQDAD YPFTDWSFSG TTEEEFATLM AIFAAEDKEV YIADYEHLGV YACRIIVPGM SDIYPAEDLW LANNNMGSHL RETLLSLPGS AWNKEDYLNL IEQLDEEGFD DFTRVRELLG LATGADNGWY TLRVGELKAM LALAGGDLEQ ALIWTEWTME FNSSVFSPTR ANYYRCLQTL LLLSQEDARQ PLQYLNAFIK MYGAEAVEAA SAALSGEAAF YGLPAVDHDL QAFPAHQSLL KAYDKLQRAK AAYWSK
|
| |