Gene SeD_A1389 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A1389 
Symbol 
ID6875060 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp1360455 
End bp1361453 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content53% 
IMG OID642784555 
Producthypothetical protein 
Protein accessionYP_002215225 
Protein GI198244936 
COG category[S] Function unknown 
COG ID[COG3756] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value1.16332e-16 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCCGCAC TTCCATACAT GCAGCTTTAC ATCGCTGATT ATCTGGCGGA CACCATGCAC 
CTTTCTGCCG AGGAGCATGG AGCCTATTTG TTGTTGATGT TCAATTACTG GCAGACCGGA
AGAGCTATCC CGAAAAACAG GCTGGCAAAA ATTGCTCGGA TTAGCAGTGA ACGATGGGGG
GCTGTGGAAG AGTCCCTGAG AGAATTTTTC ATTGATAACG GCACTGAATG GACTCATGAG
CGTATCGAAA ATGATCTCGC TGCGGTCAGG GATGTTCTGG CGAAAAAGTC GGCAGCAGGG
AAAGCATCTG TTCAGTCCAG AAGGAACAGG AAGAAAACGC AGGCCGCCAG TGGAAGTAAC
ACATGTTCAA CAGGTGTTGG TTCGGTGTTT AAACAGGAAG CCAACAAAAA GGGAACTAAT
AAAGATATAG ATCTAAAAGA ATTAAACCCC ACACATAACG CGTGCGCGCG CGCGAGTGCT
CCGGTTAGTC AGCCTGGAAT TATGCAACAG CCTGTCGTGA CTGAACCGGA ATACCTGAAC
GAGCCGATCG GGAAATTCTC AATGATGGAT GACTGGCATC CCTCGCTGGA TTTCCGACAA
CGGGCCGCCC ATTGGGGCGT TGCGTTACCA GAGCCGGAGT ATTTACCTAC GGAGCTTGTC
GCGTTCAGGG ATTACTGGAC GTCGGAGGGA AAGGTGTTCA CACAAATCCA GTGGGAACAA
AAATTCGCCC GTCACGTAAA CCACGTCAGG GCAAAGGCGA AACCAGCCAG CAGGGGAGAA
AGCCATGCAG AAATCCAGCC AGACAGCACC GCATCGCGGG CAGTACAGCA AATCAGGGCA
GCCCGCGTGC AGTGGGAACG CGAAAACGGG ATCGCCAGCG ACGGAGACGG CCTGGCGACT
CTGGGAAGTC ATGGGGGAAA TTTATTCGAA CCGATGGACG CAGAAGAACG GCGCGGCACC
TTCGAAGCTG TGGGTGGCCC AGATTGGGGC GATGACTGA
 
Protein sequence
MAALPYMQLY IADYLADTMH LSAEEHGAYL LLMFNYWQTG RAIPKNRLAK IARISSERWG 
AVEESLREFF IDNGTEWTHE RIENDLAAVR DVLAKKSAAG KASVQSRRNR KKTQAASGSN
TCSTGVGSVF KQEANKKGTN KDIDLKELNP THNACARASA PVSQPGIMQQ PVVTEPEYLN
EPIGKFSMMD DWHPSLDFRQ RAAHWGVALP EPEYLPTELV AFRDYWTSEG KVFTQIQWEQ
KFARHVNHVR AKAKPASRGE SHAEIQPDST ASRAVQQIRA ARVQWERENG IASDGDGLAT
LGSHGGNLFE PMDAEERRGT FEAVGGPDWG DD