Gene SeD_A4017 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A4017 
Symbol 
ID6875896 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp3860505 
End bp3862196 
Gene Length1692 bp 
Protein Length563 aa 
Translation table11 
GC content52% 
IMG OID642786970 
Productphosphoethanolamine transferase 
Protein accessionYP_002217597 
Protein GI198245589 
COG category[R] General function prediction only 
COG ID[COG2194] Predicted membrane-associated, metal-dependent hydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.363242 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones67 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGATACA TTAAATCGAT GACGCAACAG AAACTTAGTT TCTTGCTTGC GCTCTATATC 
GGTCTGTTTA TGAATTGCGC CGTGTTTTAC CGCCGTTTCG GCAGTTATGC GCAAGAATTT
ACCATTTGGA AAGGCCTCTC CGCAGTTGTC GAACTGGGCG CCACGGTGCT GGTCACTTTC
TTCTTACTTC GTCTTCTTTC ACTGTTTGGC CGACGCGTCT GGCGTGTGCT GGCCACGCTG
GTGGTGCTGT TTTCCGCTGG CGCCAGTTAT TACATGACCT TCCTGAACGT GGTGATTGGC
TACGGCATTA TTGCGTCTGT TATGACCACC GATATCGATC TCTCGAAAGA GGTGGTGGGG
CTGCACTTTG TATTGTGGCT GATTGCCGTG AGCGTGCTTC CGCTCATCTT TATCTGGAGT
AACCACTGTC GCTACACGTT GTTGCGCCAG CTACGTACGC CGGGGCAGCG TTTTCGCAGC
GCCGCTGTAG TGGTACTCGC AGGCGTAATG GTGTGGGCGC CTATCCGCCT GCTGGATATA
CAGCAAAAAA AGGTTGAACG GGCGACAGGC ATCGACTTAC CCAGCTATGG CGGCGTGGTG
GCGAACTCCT ATCTGCCTTC AAACTGGTTA TCTGCGTTAG GGCTGTATGC CTGGGCGCAG
GTAGATGAAT CGTCGGACAA TAATTCGTTA ATAAACCCGG CCAGGAAATT TACCTATGTT
GCGCCGAAAG ATGGGGATGA CACCTACGTC GTTTTCATTA TCGGTGAGAC GACCCGTTGG
GATCACATGG GGATTTTCGG CTACGAGCGT AATACCACGC CGAAGCTGGC GCAGGAAAAA
AATCTGGCGG CATTCCGCGG CTATTCCTGC GATACCGCGA CGAAGCTTTC TTTACGCTGT
ATGTTTGTAC GGGAAGGTGG GGCGGATAAT AACCCGCAGC GTACGTTAAA AGAGCAGAAT
GTTTTTGCCG TACTCAAACA GCTCGGATTC AGTTCCGATC TGTACGCCAT GCAGAGCGAG
ATGTGGTTTT ATAGCAATAC CATGGCGGAT AATATCTCCT ACCGCGAGCA GATTGGCGCC
GAGCCGCGTA ACCGTGGTAA AACGGTTGAT GACATGCTGT TGATTGATGA GATGCAGAAC
TCGCTTGCCC AGAACCCGGA GGGTAAACAT CTGATTATCC TGCATACTAA GGGATCGCAT
TTTAACTATA CGCAACGTTA TCCGCGCAGC TACGCTCAGT GGAAGCCCGA ATGTATTGGG
GTCGATAGCG GCTGCACGAA AGCGCAGATG ATCAACTCTT ACGATAACTC CGTGACCTAT
GTTGATCACT TTATTACCAG TGTGTTCGAT CAGCTACGTG ATAAAAAAGC GATTGTGTTC
TACGCCGCTG ACCACGGCGA GTCGATTAAC GAACGTGAAC ATTTGCACGG TACGCCGCGC
AATATGGCGC CGCCGGAACA ATTCCGTGTT CCGATGCTGG TATGGATGTC GGATAAATAT
CTTGCCAGTC CGCAACATGC GCAGATGTTT GCTCACCTGA AACAGCAGGC GGAGATCAAA
GTGCCGCGTC GTCATGTGGA ACTGTACGAT ACGATAATGG GCTGCCTGGG GTATACATCG
CCGAATGGCG GCATTAACCA GAACAACAAC TGGTGCCATA TTCCCGATGC GCAGAAAGTC
GCCGCGAAGT AG
 
Protein sequence
MRYIKSMTQQ KLSFLLALYI GLFMNCAVFY RRFGSYAQEF TIWKGLSAVV ELGATVLVTF 
FLLRLLSLFG RRVWRVLATL VVLFSAGASY YMTFLNVVIG YGIIASVMTT DIDLSKEVVG
LHFVLWLIAV SVLPLIFIWS NHCRYTLLRQ LRTPGQRFRS AAVVVLAGVM VWAPIRLLDI
QQKKVERATG IDLPSYGGVV ANSYLPSNWL SALGLYAWAQ VDESSDNNSL INPARKFTYV
APKDGDDTYV VFIIGETTRW DHMGIFGYER NTTPKLAQEK NLAAFRGYSC DTATKLSLRC
MFVREGGADN NPQRTLKEQN VFAVLKQLGF SSDLYAMQSE MWFYSNTMAD NISYREQIGA
EPRNRGKTVD DMLLIDEMQN SLAQNPEGKH LIILHTKGSH FNYTQRYPRS YAQWKPECIG
VDSGCTKAQM INSYDNSVTY VDHFITSVFD QLRDKKAIVF YAADHGESIN EREHLHGTPR
NMAPPEQFRV PMLVWMSDKY LASPQHAQMF AHLKQQAEIK VPRRHVELYD TIMGCLGYTS
PNGGINQNNN WCHIPDAQKV AAK