Gene SeD_A2331 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A2331 
Symbol 
ID6874213 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp2210817 
End bp2212088 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content36% 
IMG OID642785426 
Producthypothetical protein 
Protein accessionYP_002216086 
Protein GI198241973 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.122664 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.0000000213226 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCACCG CAACTGACTT TAAGACACTC CTCGACAATA TAAAAATAGA TAATGCAGGC 
CAGATTAGTA AAAGGTATGG TCGTATAACT AAGGCTTTGA ACCAATACTT TTATAACTTA
GATTCTAAGA CAGCCAATTC ACTACAGGTT GGTTCCTATG GGCGCTTCAC AGGGATTCGA
GGGATCTCTG ATCTTGATAT GCTTTACTTT CTACCTGCAA CTGCATGGCC AAGATTCCGA
GATCGACAAT CGTATTTATT ACAGGTTGTG AAAACAGAAA TCAAGAAAAC TTTCAAAAAT
ACAGATATTC GCGGTGATGG GCAAGTTGTT GTTGTTAAAT TTAAGAATCA AGAGGTTGAG
GTAGTTCCTG TATTCAGTAA TGAAGATGGC ACTTTTACAT ACCCGGATAC ACATGATGGT
GGATCGTGGA AGGTATGTAA CCCTAGGGCC GAAATGTCGT CTTTTAGGGC ACTGAATGAT
GATAGGAAGG GACATCTGAG ACGTCTATCT AAAATGATTC GAGCATGGAA AGCTCGTCAT
GAAGTTGAGA TAAGTGGATT CTTAATTGAT ACACTGTGTT ATAATTTTTT CTCTAATCTA
ACTGAATATG ATGATAAGAG TTTCAAAAGT TATGATCAAC TTTCGCTTGA TTTTTTCACT
TTCTTAGAGA ATGAAGGTGA CCGAGTATTT TATTATGCTC CCGGTAGTCG CTCAAAAGTG
AGCGTAAAAA AATCATTTAA TAAAGTAGCA AAATTAACAA AAGAATATTG TGAAGAAGCT
TTATCTGCTA CAAGTGAAAA CTCAAGAAAC TTAGCTTGGA AAAAAGTTTT TGGCAGGCCT
TTTCCAAATT ATACGACAAA AGCACTCAGT AATGTCAATG TATCTGAACA GTTTATTGAA
GATCAATATG AGATGAATTT ATATGGTCAT GTTTCGATAG AATGTGAGAT TAGAAAGAAT
AATTTACTGG AAGCTCTTCT TTCAAATCTT CTTGGCGAAG GGCATGATAT TAGCACGAAT
CGCAAGTTAA GATTCTATGT TGATGAGATA AATAATATAT CTCACCCATA TAAGATTAAA
TGGAAAATAA AAAATGTAGG TGATGAAGCT GAGCGCCGAG GAAATGTTAG AGGTGAAATT
TTAGACGATG AAGGCGGTTC TGAGCGTTTC GAGACCGCAG ATTTCTCAGG ACCTCATTTT
GTTGAATGTT ATGTTATTTA TGGTAATCAA GTTGTAGCAA GAGATAGAAT CGACGTACCT
ATACATAATT AG
 
Protein sequence
MSTATDFKTL LDNIKIDNAG QISKRYGRIT KALNQYFYNL DSKTANSLQV GSYGRFTGIR 
GISDLDMLYF LPATAWPRFR DRQSYLLQVV KTEIKKTFKN TDIRGDGQVV VVKFKNQEVE
VVPVFSNEDG TFTYPDTHDG GSWKVCNPRA EMSSFRALND DRKGHLRRLS KMIRAWKARH
EVEISGFLID TLCYNFFSNL TEYDDKSFKS YDQLSLDFFT FLENEGDRVF YYAPGSRSKV
SVKKSFNKVA KLTKEYCEEA LSATSENSRN LAWKKVFGRP FPNYTTKALS NVNVSEQFIE
DQYEMNLYGH VSIECEIRKN NLLEALLSNL LGEGHDISTN RKLRFYVDEI NNISHPYKIK
WKIKNVGDEA ERRGNVRGEI LDDEGGSERF ETADFSGPHF VECYVIYGNQ VVARDRIDVP
IHN