Gene SeD_A2286 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A2286 
Symbol 
ID6873285 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp2176860 
End bp2178002 
Gene Length1143 bp 
Protein Length380 aa 
Translation table11 
GC content50% 
IMG OID642785384 
Productputative antirepressor 
Protein accessionYP_002216046 
Protein GI198245401 
COG category[S] Function unknown 
COG ID[COG3646] Uncharacterized phage-encoded protein 
TIGRFAM ID[TIGR02681] phage regulatory protein, rha family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.36993 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value4.73451e-22 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAATAGTT TAACGGTAAA TAACCGTTTG TCGCAACAAC CGGGGATGTA TGAGTACCGG 
CCGTTGCGTC ATGAATGCAG ATTACCAAAT AGCCTGGTCG TGCGTAACCA CAGGGAACAC
AGCCTGACCG TGGGGGATGA ATCGTGCAGG AACTTAACCG CTGGTTTCGG GATGGAAGGG
GACTTTATGT CCATGTTATT CGCTGGGAAC CAGAAACTGA GCGCGTTATC TATCTGCGCA
AGGGCTACCC GCATGAGTGT TTTAGCCCTT TGTGGAAATT CAGGCGTGAT TTTGTTGAGT
GTGAAGCGCC AGGAACACAT TGATTCTGCA ATTCCGGGAC GTTACACTGT TCAGGCACCT
CATAAAGCGG GTGCCGGGAT TGGCGTCCTG GAATTGCATA CGGCGACAAT GGGCGCGTTA
GCGTCTTTTT TGTTGCTACA GCTCAGCTAT ACCCAAATTA TGGTGGGCTG GGTGGGGGCA
CCGAAAGGTG CGCCGGTTTC CGTATGCGCC GGTTACGCCA ACCCTGCTCA GTTCACCACC
AGCGAAATTG GCGTTTCCGG TGGTGGAAGT TATCCATTGC ATACGGAGGC TGCCATCATG
GCTACGATCC CTGCCTTAGT ACAACCTGAA CTTTGCATTA TTGCAGGCAA AGTTGTTACT
TCTTCTCTGG CTGTTGCTAG TTATTTCGGC AAACAACACA AAAATGTCAT TCAAAAAATT
GCGTCTCTTG AATGCTCTGC CGAATTTACT GAGCTGAATT TTCAGCTCAG TGAGTACATC
GACGCATCAG GCCGCAAACT ACCTTGCTAT CAAATAACCC GCGACGGCTT TGCTTTCCTT
GCTATGGGCT TTACGGGCAA ACGCGCCGCC CAATTCAAAG AGGCATACAT CAATGCCTTT
AACCAGATGG AGAAACAGCT TTCAAAGCCG TCGGTGCTGA GCGATGCAGC ACATAATGCC
AGCGTTCTCT ATTCCTACAT TTCATCCATT CATCAGGTCT GGTTACAGCA GCTTTATCCC
ATGCTGGAAA AAGTGGAATC TCCGCTGGCC GTAAGCCTGT ACGACCGCAT CAATGACGCT
GCGGCGCTTG CGAGCCTTAT CAATATGACA CTGAACCGTT CAGAGGTAAG GGGGCGCAAA
TGA
 
Protein sequence
MNSLTVNNRL SQQPGMYEYR PLRHECRLPN SLVVRNHREH SLTVGDESCR NLTAGFGMEG 
DFMSMLFAGN QKLSALSICA RATRMSVLAL CGNSGVILLS VKRQEHIDSA IPGRYTVQAP
HKAGAGIGVL ELHTATMGAL ASFLLLQLSY TQIMVGWVGA PKGAPVSVCA GYANPAQFTT
SEIGVSGGGS YPLHTEAAIM ATIPALVQPE LCIIAGKVVT SSLAVASYFG KQHKNVIQKI
ASLECSAEFT ELNFQLSEYI DASGRKLPCY QITRDGFAFL AMGFTGKRAA QFKEAYINAF
NQMEKQLSKP SVLSDAAHNA SVLYSYISSI HQVWLQQLYP MLEKVESPLA VSLYDRINDA
AALASLINMT LNRSEVRGRK