Gene SeD_A3878 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A3878 
Symbol 
ID6874739 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp3706882 
End bp3707808 
Gene Length927 bp 
Protein Length308 aa 
Translation table11 
GC content52% 
IMG OID642786840 
Producthypothetical protein 
Protein accessionYP_002217468 
Protein GI198245712 
COG category[S] Function unknown 
COG ID[COG5464] Uncharacterized conserved protein 
TIGRFAM ID[TIGR01784] conserved hypothetical protein (putative transposase or invertase) 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones79 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGACCT CAACAACATC CACGCCGCAT GACGCGGTAT TCAAACAGTT TCTATGCCAC 
CCCGATACTG CACGGGATTT TTTGGAAATT CATCTCCCGA CGACATTACG TCAAATCTGT
AATCTGAATA CGTTACGGCT GGAGTCCGGT AGCTTTATTG AAGAGGATTT ACGCCCCCAT
TATTCCGATA TTCTTTGGTC GCTGGAAACA AGTGAAGGTG AAGGTTACAT TTACGTGGTT
ATTGAACATC AGAGTACGCC GGACGCGCAT ATGGCATTTC GGCTGATGCG TTACGCAATG
GCTGCAATGC AACGGCACCT GGAGGCCGGG CATAAGACGT TGCCGTTAGT GGTGCCAATG
CTGTTTTATC ACGGAAACCG CAGCCCGTAT CCGTTTTCAT TATGCTGGCT GGATGAATTT
GCCGACCCGG TGATGGCGCG TAAGCTATAC GCCACCGCCT TTCCTCTGGT CGATATTACG
GTCGTGCCGG ACGACGAGAT TATGCGGCAC CGACGGGTCG CGCTGCTGGA ACTCATACAA
AAACACATCC GCCAGCGTGA TCTGATGGGG CTTGTCGAAC AGCTGGTCGC CCTGCTGGTT
AAGGGATACG CTAATGACAC CCAGCTTCAA AGTTTGTTTA ATTACATGAT GCACACTGGC
GACGCCGCGC GCTTCAATAC GTTTATCCGC CAGGTGGCTA TGCGTATCCC ACAGCATAAG
GAGAAGATCA TGACTATCGC AGAAAGATTA CGTCAGGAAG GACATCGTAA CGGGTTACAG
AAAGGGCTAC AGCAAGGCAA GCAGGAAGGC CAACGGCTCG CCGCATTGCG CATTGCCCGC
TCGATGCTAA ACGATGGTTT CGATCGCGAT ACCGTGCTTA GGGTTACCGG GCTGGCGCCT
GCCGATCTGG CGTCTGAAAA CCATTAA
 
Protein sequence
MATSTTSTPH DAVFKQFLCH PDTARDFLEI HLPTTLRQIC NLNTLRLESG SFIEEDLRPH 
YSDILWSLET SEGEGYIYVV IEHQSTPDAH MAFRLMRYAM AAMQRHLEAG HKTLPLVVPM
LFYHGNRSPY PFSLCWLDEF ADPVMARKLY ATAFPLVDIT VVPDDEIMRH RRVALLELIQ
KHIRQRDLMG LVEQLVALLV KGYANDTQLQ SLFNYMMHTG DAARFNTFIR QVAMRIPQHK
EKIMTIAERL RQEGHRNGLQ KGLQQGKQEG QRLAALRIAR SMLNDGFDRD TVLRVTGLAP
ADLASENH