Gene SeD_A1967 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A1967 
Symbol 
ID6874185 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp1899023 
End bp1900024 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content58% 
IMG OID642785086 
ProductErfK/YbiS/YcfS/YnhG family protein 
Protein accessionYP_002215752 
Protein GI198245212 
COG category[S] Function unknown 
COG ID[COG1376] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00349723 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.0000000102388 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAACGTG CGTCTTTCAT TACGCTAACT ATTATCGGCG CGTATAGCGC GTTACAGGCA 
GCCTGGGCGG TTGATTATCC ATTACCGCCC GAAGGCAGCC GTCTTATTGG TCAGAATCAA
ACCTATACCG TACAGGAAGG TGATAAAAAC CTGCAGGCTA TCGCCCGGCG TTTTGATACG
GCGGCGATGC TCATTCTTGA AGCGAACAAT ACGATTGCGC CGGTGCCTAA GCCCGGTACG
CTAATTACCA TTCCCTCGCA GATGCTATTG CCGGATGCGC CCAGGGAAGG CGTTATCGTC
AACCTCGCGG AGCTACGGCT GTATTATTAC CCGCCGGGAG AAAACCGCGT TCAGGTGTAT
CCCATCGGCA TCGGTTTGCA GGGACTGGAA ACTCCCGTCA TGGACACCCG GATAGGGCAA
AAGATCCCCA ACCCGACGTG GACGCCGACG GCGGGCATAC GCCAACGTTC GCTTGAGCGG
GGGATCACGC TGCCGCCGGT GATCCCTGCC GGGCCAAATA ACCCGCTGGG ACGCTATGCG
CTGCGTCTGG CGCACGGTAA TGGGGAATAT CTCATTCATG GCACCAGCGC GCCGGATAGC
GTAGGTCTGC GCGTGAGTTC CGGTTGCATT CGCATGAACG CGCCGGACAT CAAAGCGTTA
TTTGCGCAGG TCAGAACGGG GACGCCGGTA AAAGTGATTA ACCAGCCGGT GAAATTCTCT
GTCGAGCCGA ATGGCATTCG TTATGTAGAG GTGCACAGGC CGCTATCGCC GGAAGAAGAG
CAAAACGTGC AGACGATGCC CTACGTGTTG CCTGCGGAAT TTACCGCGTT CAGAAACGCG
CAAGGGGTGG ATAGTCGCCT GGTCGATAAG GCGCTATACC GGCGAGCCGG GTATCCAGTC
AGCGTGAGCG CCGGGCAGAC GCCAGCGGTA AATACGACCG CAGTCGAATC CGCTCAGAAC
GGTTTTGTCG GGGAAGAGGG GCAAACGCGC GCGACGCAGT AG
 
Protein sequence
MKRASFITLT IIGAYSALQA AWAVDYPLPP EGSRLIGQNQ TYTVQEGDKN LQAIARRFDT 
AAMLILEANN TIAPVPKPGT LITIPSQMLL PDAPREGVIV NLAELRLYYY PPGENRVQVY
PIGIGLQGLE TPVMDTRIGQ KIPNPTWTPT AGIRQRSLER GITLPPVIPA GPNNPLGRYA
LRLAHGNGEY LIHGTSAPDS VGLRVSSGCI RMNAPDIKAL FAQVRTGTPV KVINQPVKFS
VEPNGIRYVE VHRPLSPEEE QNVQTMPYVL PAEFTAFRNA QGVDSRLVDK ALYRRAGYPV
SVSAGQTPAV NTTAVESAQN GFVGEEGQTR ATQ