Gene SeD_A1963 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A1963 
Symbol 
ID6872063 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp1894928 
End bp1896271 
Gene Length1344 bp 
Protein Length447 aa 
Translation table11 
GC content51% 
IMG OID642785082 
Productamino acid transporter 
Protein accessionYP_002215748 
Protein GI198243296 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0531] Amino acid transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.041678 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.00000000765672 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAAAAG TCAAAAAATT GTCTCTTACC GATTTAGTGC TTTATGCTCT GGTGTTTATG 
GTGCCTATCG CGCCAGTCTC TCTTTATGGC GTCGTTTATA ATCTTTCGCA CGGTATGGTT
GCGCTGGTTT ATATTATTGG CGCTATCGCG ATGTTTTTTA CCGCGTACAG CTACTCTACC
TTATCGCAGC ATATCTCCTC GTCGGGTTCG GCCTATGCGT ATGCTGGCGT ATGCATTAAT
CCCGCCGTAG GGTTTCTGAC CGGATGGATA TTACTGCTGG ATTATCTTTT GTTCCCTACG
CTCGTGGCAG TGCTGGGCGG CGTCGCGGTC CACGCTATTT TACCGCAAAT CCCGGTCTGG
GTCTGGCCGT TGATTTATGT GGCTATCGGC ACCGGAGTCA ACTATCTCGG GATACAGCAA
ACGGCGAAAT TCGATAAGTT ACTGGTCTTT ATTCAACTCG GTATTCTGGC GATTTTCGTG
CTACTCATTG TCCGTCTGAT GCTGCTGGAC AGTAGTCAGA TAACGCTTTC CTTCCGGCCT
TTTTTCGATT CACAGTGGTT CACCCCAGGG CTCATCGCGA CGGCGATTTC GGTGGCTGCG
CTGAACTTTC TCGGCTTTGA CGCTATCAGT ACATTAAGTG AAGAAAGCGA AGGCGGGGGA
CGCGCCGTCA GTAAAGCAAC GCTGTTGGCG TTAGTTTTGG CGACGGTACT GTTTATTATC
GTTGTGGCGT TTGCGGCGTT CGCCACCGGT AATGTCGACC GCTTCGCGGA AGGCAATGCG
ACGAATGAAG CATTTTTCAC CATTGCCGGC AACGTTGGCG GTATCTGGCT CAAAGTGGCA
TTTTCCATCA TTGTGGCGTT TGTGTGCGCC GTCGGCAACA TTATTACCGC GCAAACGGCG
GTGTCCAGAG TGCTATTTTC AATGGGGCGC GACCGAATGT TGCCCGCCTT CCTCGCTCAC
GTCCATACCA CGCGTAAAAC GCCGGATTAT GCCATTCTGT TTACCGGCGG CGTCACCCTG
CTGCTCAGCT ACCTGTTTTC CGGCAAGATT GAGTCTATCT CCACGCTGGT GAACTTTGGC
GCGCTTTTCG CTTTTTTTGT CGTGAGCCTG TGTGTGTTTA TCCTGTTCAA TTTCCGGATG
AAAGCCCAGC GGCGCATTTT CGCCCATGTC ATTTCGCCGA TCATGGGGAT GATCGTCATC
GGCTACGTCT GTTTAAACAT GAACATTCAC GCTCTGATAC TCGGAATTAG CTGGGCGGCG
ATCGGGATAG CGATTCTGTG CTATCGAAAA GCACATAACC AAAACATCGC CATCGACCTG
GAAGGCAAAA AGTTGCTCGA TTGA
 
Protein sequence
MKKVKKLSLT DLVLYALVFM VPIAPVSLYG VVYNLSHGMV ALVYIIGAIA MFFTAYSYST 
LSQHISSSGS AYAYAGVCIN PAVGFLTGWI LLLDYLLFPT LVAVLGGVAV HAILPQIPVW
VWPLIYVAIG TGVNYLGIQQ TAKFDKLLVF IQLGILAIFV LLIVRLMLLD SSQITLSFRP
FFDSQWFTPG LIATAISVAA LNFLGFDAIS TLSEESEGGG RAVSKATLLA LVLATVLFII
VVAFAAFATG NVDRFAEGNA TNEAFFTIAG NVGGIWLKVA FSIIVAFVCA VGNIITAQTA
VSRVLFSMGR DRMLPAFLAH VHTTRKTPDY AILFTGGVTL LLSYLFSGKI ESISTLVNFG
ALFAFFVVSL CVFILFNFRM KAQRRIFAHV ISPIMGMIVI GYVCLNMNIH ALILGISWAA
IGIAILCYRK AHNQNIAIDL EGKKLLD