Gene SeD_A3027 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A3027 
Symbol 
ID6873202 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp2924924 
End bp2927731 
Gene Length2808 bp 
Protein Length935 aa 
Translation table11 
GC content58% 
IMG OID642786058 
Productphage tail tape measure protein 
Protein accessionYP_002216704 
Protein GI198244813 
COG category[S] Function unknown 
COG ID[COG5283] Phage-related tail protein 
TIGRFAM ID[TIGR01760] phage tail tape measure protein, TP901 family, core region 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value0.000444183 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGTGACA ATAACCTGCG TCTGCAGGTG ATTCTTAATG CGGTTGACAA GCTCACCCGC 
CCATTTCGAT CCGCGCAGGC CAGCTCAAAA GAGCTGGCTG CAGCCATTCA GCAAAGCCGC
GCCCGTTTAA AAGAATTAGA TGCTCAGGCG GGCCGCATTG ACGGTTTCCG CAAGGCCAGC
GCGCAGCTGG CAGTCACCGG TAACAGCCTG AAAGCCGCAC GCGAAGAAGC TGCGAAACTT
GCCACGCAAT TCTCTGCCAC CAACCGCCCG ACGGCGGCGC AGGCACGGCT GCTTGAGCAG
GCAAAAAACC GCGTTACGGA GTTACAGAGC AAATATAACG GTCTACGTCA GTCGGTGCAG
CGCCAGCGTC TTGCGCTCAA TGAAGCCGGA CTGGACACGA AAAAGCTCAG TAGTGCGCAG
CGGGAACTGC GGCAGAATGC CGACGAAACC CGGCAGGCCC TGGACCGGCA GCAGAAATCC
CTTAAAAGCT TGGGCGAACA GCAGGCGCGA ATGAACGCCG TCCGCGATCA GTATTCGCGC
CGTCTTGAGG TGCGGGATCG CATCGCCGGG GCAGGGGCTA CCACTACGGC TGCAGGGGGG
GCAATGGGCG CACCTGTTGT GGCGGCAGTT AAGAGCTACG CCAGCATGGA AGATGCCATG
AAAGGTGTGG CAAAGCAGGT AAACGGGCTG CGGGACGATA ATGGCAACCG CACAAAACAG
TTTTATGACA TGCAGGATGC CATCAAGGCC GCCAGCGAAC AGCTACCGAT GGAGAACGGC
GCTATAGACT ATGCCGCGCT GGTTGAAGGT GGTGCTCGCA TGGGGGTGAC CAATCAGGAC
GATCCTTACG AAGAGCAGAA ACGTGACCTG CTGGCTTTTG CATCCACGGC GGCAAAAGCG
GCAACGGCCT TTGAGCTGCC CGCAGATGAA CTGGCAGAAG GACTGGGGAA AATCGCGCAG
CTCTATAAAG TTCCGACGCG CAATATTGAA CAACTGGGCG ATGCGCTGAA CTACTTGGAC
GATAACGCCA TGTCAAAGGG TGGGGACATT ATCAACGTCC TGCAGCGTAT GGGGGGCGTG
GCTGACCGCC TTGACTTCCG AAAGGCCGCG GCGCTGGGTT CAACATTCCT TTCTCTTGGG
GCTGCCCCGG AAATCGCCGC CAGCGCCTCT AATGCCATGG TGCGTGAACT GTCCATTGCT
ACCATGCAAA GTAAACGATT TTTTGAAGGC ATGAATCTGT TGCAACTCAA TCCGGCGGAG
ATTGAAAAGC AGATGACCAC CGATGCCATG GGCACAATTC AGCGGGTTCT GGAGAAGGTC
AACAATCTGC CGCAGGATAA ACGCTTGTCA GCCATGACAA TGATTTTTGG CAAAGAGTTT
GGCGATGATG CGGCAAAGCT GGCTAACAAC CTGCCGGAGC TGCAGCGCCA GCTGAAACTC
ACATCAGGTA GTGGTGCTAA TGGCTCTATG CAGAAAGAAT CCGACATTAA CAAGGATTCA
TTGTCTGCGC AGTGGTTGCT GGTTAAGACG GGCGCGCAGA ACGCTTTCAG CAGCCTGGGG
GAAACGTTGC GCCAGCCGCT GATGGATATT ATGGGCATGG TTAAGCGCGT GACCGGGGCG
TTGCGTCGCT GGGTTGAACA GAATCCCGTG CTGGCTGGCA CGCTGATGAA AGTGGCGGCA
GCTACGGCAG CCATTACTGT TGGGTTGGGG GCGCTGGCAG TGGCGGTGGC TGCGGTACTA
GGGCCGCTGG CGGTTATCCG GTTTGGCCTG TCAATGCTGT CAGTTAAAGC GTTACCTTCT
GCAGCCGCCG CTGCCACACG TACAGGTAGC GTGCTGCGTT TGTTGATCTC TGGTCCGCTG
GCTTTGCTGC GCGTGGCATT ATTTGCTGTT GGTAGCCTGC TGGGTGCGCT GCTCAGTCCT
GTAGGGCTGG TTGTGGCTGC ACTGGCAGGC GTGGCGCTGG TTATCTGGAA ATACTGGCAG
CCCATTAGTG CATTTCTGGG GGGCGTGGTG GAAGGGTTCA GAGCCGCTGC TGCGCCCATC
AGCGCCGCCT TTGAGCCGCT CAGACCCGTG TTTCAGTGGA TTGGTGACAG GGTGCAGGCC
TTGTGGGGCT GGTTCAATGA TTTACTTACC CCGGTTAAAT CCACTTCCGA AGAACTGAAC
AGCGCAGCTG CAATGGGGCG TCGGTTTGGT GAGGCGCTGG CGGAAGGTCT GAATATGGTG
ATGCACCCAC TTGAGTCACT TAAATCCGGT GTGTCATGGC TGCTGGAAAA GCTCGGTATT
GTCAGTAAGG AGGCGGCAAA GGCGAAACTA CCTGCGCAGG TTACGCAGCC GCAGTCCGCC
ACAGTGAACA GTGACGGCAA AGTGGTGCTG CCGCCAGGCG GGTTCCCGGC TTACGCGGGG
ATGTATGACA CGGGCGGGAT CATTCCACGC GGGCAGTTTG GCATTGTCGG AGAAAATGGC
CCTGAAATTG TGAACGGACC GGCAAATGTT ACCAGCAGGC GGCGTACTGC TGCACTGGCC
TCTGTCGTTG CTGGCGTGAT GGGGGGAGCT GCGACACCTG CAGAAGCGGC TCCGCTTCAT
CCGTTCAGTT TGCCTGCGAG GGCATACCAG CCCCCGCTTG CTAAGGCAGA TAGCCCGCCG
CCGGTTATTC GTTATGAGAT AAATGCGCCC ATTCATATTG TCGCTCAGCC TGGGCAGAAC
GCGCAGGATA TTGCCCGTGA AGTGGCACGC CAGCTTGACG AACGGGAGCG CCGGGCCAGG
GCAAAAGCAC GCAGCAATTT CAGCGATCAG GGGGGGTATG AATCATGA
 
Protein sequence
MSDNNLRLQV ILNAVDKLTR PFRSAQASSK ELAAAIQQSR ARLKELDAQA GRIDGFRKAS 
AQLAVTGNSL KAAREEAAKL ATQFSATNRP TAAQARLLEQ AKNRVTELQS KYNGLRQSVQ
RQRLALNEAG LDTKKLSSAQ RELRQNADET RQALDRQQKS LKSLGEQQAR MNAVRDQYSR
RLEVRDRIAG AGATTTAAGG AMGAPVVAAV KSYASMEDAM KGVAKQVNGL RDDNGNRTKQ
FYDMQDAIKA ASEQLPMENG AIDYAALVEG GARMGVTNQD DPYEEQKRDL LAFASTAAKA
ATAFELPADE LAEGLGKIAQ LYKVPTRNIE QLGDALNYLD DNAMSKGGDI INVLQRMGGV
ADRLDFRKAA ALGSTFLSLG AAPEIAASAS NAMVRELSIA TMQSKRFFEG MNLLQLNPAE
IEKQMTTDAM GTIQRVLEKV NNLPQDKRLS AMTMIFGKEF GDDAAKLANN LPELQRQLKL
TSGSGANGSM QKESDINKDS LSAQWLLVKT GAQNAFSSLG ETLRQPLMDI MGMVKRVTGA
LRRWVEQNPV LAGTLMKVAA ATAAITVGLG ALAVAVAAVL GPLAVIRFGL SMLSVKALPS
AAAAATRTGS VLRLLISGPL ALLRVALFAV GSLLGALLSP VGLVVAALAG VALVIWKYWQ
PISAFLGGVV EGFRAAAAPI SAAFEPLRPV FQWIGDRVQA LWGWFNDLLT PVKSTSEELN
SAAAMGRRFG EALAEGLNMV MHPLESLKSG VSWLLEKLGI VSKEAAKAKL PAQVTQPQSA
TVNSDGKVVL PPGGFPAYAG MYDTGGIIPR GQFGIVGENG PEIVNGPANV TSRRRTAALA
SVVAGVMGGA ATPAEAAPLH PFSLPARAYQ PPLAKADSPP PVIRYEINAP IHIVAQPGQN
AQDIAREVAR QLDERERRAR AKARSNFSDQ GGYES