Gene SeD_A3050 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A3050 
Symbol 
ID6873601 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp2941621 
End bp2943387 
Gene Length1767 bp 
Protein Length588 aa 
Translation table11 
GC content56% 
IMG OID642786080 
Productterminase, ATPase subunit 
Protein accessionYP_002216726 
Protein GI198245759 
COG category[S] Function unknown 
COG ID[COG5484] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value0.00590407 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAACACCA CACTAACATC CGCAGAGCTC GATCCCCGTC GGCAGGCCAT GCTGCTGTAC 
TTTCAGGGAT ACCGCGTAGC CCGCATTGCT GAAATGCTGG GCGAGAAAGT TGCAACCGTT
CACAGCTGGA AAAAACGCGA CAAGTGGGGT GACTATGGGC CGCTGGATCA GATGCAGCTC
ACCACCGCCG CACGCTACTG CCAGCTCATT ATGAAGGAGC ACAAAGAAGG GAAAGATTTC
AAAGAGATTG ACCTGCTGGC GCGCCAGTCG GAGCGCCACG CGCGCATCGG CAAGTTTAAC
AATGGCGGTA ACGAAGCCGA CTTAAACCCA AACGTCGCCA ACCGCAACAA AGGCCCGCGC
CGTCAGCCGG AAAAGAATGT TTTCACCGAT GAACAGATTG AGAAGCTGGA AGAAATCTTC
CATTCCTCCA TGTTCAACTA CCAGCGCCAC TGGTGGGAAG CTGGAAAAAC CAACCGCATC
CGCAACCTGC TGAAGTCACG CCAGATCGGC GCGACCTTTT ACTTTGCCCG TGAAGCCCTG
ATTGACGCTC TGCTTACCGG ACGTAACCAG ATTTTTCTTT CCGCCAGCAA GGCACAGGCT
CACGTCTTTA AGCAGTACAT CATAGACTTC GCCAAAGAAG TCGAGGTGGA GCTGAAAGGC
GATCCGATGG TGCTTCCTAA CGGAGCCACG CTTTACTTCC TCGGCACCAA TGCCCGCACG
GCCCAGAGTT ACCACGGCAA CCTGTATCTG GATGAATATT TCTGGATACC GAAATTCCAG
GAGCTGCGCA AAGTGGCTTC CGGTATGGCT ATTCACAAAA AATGGCGACA AACCTATTTT
TCCACGCCAT CCAGCCTGAC ACACAGTGCT TATCCGTTCT GGTCCGGTGC GCTGTTCAAC
CGTGGGCGCA ATAAAGCCGA TAAGGTGGAC ATCGACCTGT CCCACAGCAA TCTGGCCCCC
GGCCTGCTGT GCGCAGACGG GCAATACCGC CAGATAGTCA CCGTGGAAGA TGCAGTGCGC
GGCGGATGTA ACCTTTTCGA CCTCGATCAG TTGCGCATGG AGTACAGCCC GGACGAATAC
CAGAACCTGC TGATGTGCGA GTTTGTGGAC GATCTCGCGT CCGTGTTCCC GCTCAGCGAG
CTGCAGGCGT GCATGGTGGA CAGTTGGGAA GTCTGGACCG ACTTTCATGC ACTGGCCCTG
CGCCCGTTTG GCTGGCGCGA AGTGTGGATC GGTTATGACC CGGCAAAAGG TACGCAAAAC
GGCGACAGCG CCGGGTGCGT GGTGGTGGCA CCGCCAGCCG TGCCGGGCGG TAAGTTCCGC
ATTCTTGAGC GTCACCAGTG GCGCGGGATG GACTTCCGCG CCCAGGCTGA CGCCATCAAA
AAACTGACTG AACAGTACAA CGTGACATAC ATCGGTATCG ACTCAACCGG CGTCGGTCAC
GGGGTTTACG AGAACGTGAA AGCGTTTTTT CCTGCCGTCC GGGAGTTTGT CTACAACCCC
AACGTTAAAA ACGCCCTGGT ACTCAAGGCC TACGACATTA TTAGCCACCG CCGCCTGGAG
TTTGACGCCG GGCACACCGA CATAGCGCAG TCCTTTATGG CAATCCGTCG CGCCACCACT
GCCAGCGGCA ACCGCCCGAC CTATGAAGCC AGCCGCAGCG AAGAAGCCAG CCACGCCGAT
CTGGCCTGGG CAACAATGCA CGCACTGTTT AACGAGCCGC TGCAGGGCGA GTCCGCCAAT
ACCAGCAATA TTGTGGAGAT TTTTTGA
 
Protein sequence
MNTTLTSAEL DPRRQAMLLY FQGYRVARIA EMLGEKVATV HSWKKRDKWG DYGPLDQMQL 
TTAARYCQLI MKEHKEGKDF KEIDLLARQS ERHARIGKFN NGGNEADLNP NVANRNKGPR
RQPEKNVFTD EQIEKLEEIF HSSMFNYQRH WWEAGKTNRI RNLLKSRQIG ATFYFAREAL
IDALLTGRNQ IFLSASKAQA HVFKQYIIDF AKEVEVELKG DPMVLPNGAT LYFLGTNART
AQSYHGNLYL DEYFWIPKFQ ELRKVASGMA IHKKWRQTYF STPSSLTHSA YPFWSGALFN
RGRNKADKVD IDLSHSNLAP GLLCADGQYR QIVTVEDAVR GGCNLFDLDQ LRMEYSPDEY
QNLLMCEFVD DLASVFPLSE LQACMVDSWE VWTDFHALAL RPFGWREVWI GYDPAKGTQN
GDSAGCVVVA PPAVPGGKFR ILERHQWRGM DFRAQADAIK KLTEQYNVTY IGIDSTGVGH
GVYENVKAFF PAVREFVYNP NVKNALVLKA YDIISHRRLE FDAGHTDIAQ SFMAIRRATT
ASGNRPTYEA SRSEEASHAD LAWATMHALF NEPLQGESAN TSNIVEIF