Gene SeD_A3033 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A3033 
Symbol 
ID6872102 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp2930573 
End bp2932258 
Gene Length1686 bp 
Protein Length561 aa 
Translation table11 
GC content52% 
IMG OID642786063 
Producttail fiber domain-containing protein 
Protein accessionYP_002216709 
Protein GI198243132 
COG category[R] General function prediction only 
COG ID[COG5301] Phage-related tail fibre protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value0.000290271 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAGCGCAA AATTTTATAC CCTGCTGACG GAGATCGGCG CGGCGAAACT GGCAAGCGCC 
GCCGCGCTCG GTGTCCCGCT GAAAATTACC CATATGGCGG TGGGCGACGG TGGCGGTGTG
CTGCCCACAC CCAGCGCGCA ACAGACCGCG TTAGTTGCTG AGAAGCGTCG AGCAGCGCTG
AATATGCTGT ATATCGACCC GCAGAACAGC AGCCAGATTA TTGCTGAGCA AGTGATCCCG
GAAACTGAGG GGGGATGGTG GATTCGTGAG GTCGGCCTGT TTGATGAAAC CGGCGCACTG
ATCGCCGTGG GTAACAGCCC TGAGAGCTAC AAGCCGCAGC TGACAGAAGG GAGCGGACGT
ACGCAGACCG TGCGCATGGT ACTGATTACC AGCAGCACCG ATAACATCAC CCTGAAAATT
GATCCTGCAG TAGTACTGGC AACCCGTAAA TATGTAGATG ATAAGGCGCT GGAGCTGAAG
GTATATGTAG ACGACCTGAT GGCAAAGCAT CTTGCTGCAC CGGACCCGCA TTCACAGTAT
GCACCCAAAG AAAGTCCGAC GTTTACCGGG ACACCCAAAG CGCCAACGCC AGCGGCGCTG
GATACATTAA ATGAACTGGC TGCTGCGCTG GGGAATGATC CAAACTTTGC GACAACAGTG
ATGAATGCGC TGGCAGGAAA ACAACCTCTC GATACCACGC TGACGAATCT GAGTGGAAAA
GACAATGCAG GGATTCTCCA ATACCTCGGT TTAAAAGACG CACTTTTAAA AGGTGACGGG
CGATTCCTTG CGGGAACGTT TGTCAGTGAC GCAATTGACC GAACATCAAT TGGTGCCAGA
GCGGCTACAG GCTGTCAGTT TATGCGCGCA CATCAGGCAC CTGATGCGCC AGACCAGGTA
AGTTTCTGGC AAATTATTAC CCTTAGCGAG GTGGTAAGTC CGACCACTGT TGTGGATGTT
CTTGCAGTCA GTGGCAATAA CGTATTGTTT GGTCACGGAA CAGGAGCGGG TATTACCTCA
TGGCGTCAAG TGGCGATGCT GGAGGGGGGC GCCTTTACGG GGGGTATTTC TGCTCCAAAT
ATGCGTGGCG ATACCCTGGT TACGGTTGGG GATGGCACTG GTGGGATGGC TAAAGGTGAC
GTTGATGGTG CAGGTTTTAA TGGTAACAAT CTGAACATTA AGTCATGGAA TGGTATCGGA
TTTCAGAACT CAGAAGACCT GGCTATCCGG GCATATATCA GCACCCGACT CGGTGTTATC
GCAGCTGCTG AAAATTTGCA GGCCGGAAAT GCGATATTCA ACAAAAACGG CGATGTTTAC
GGCGATATAT GGGGCACAGG CAGCGGGCCT GGCTGGTTGA GTGCGTATAT TGCAGGCAGA
CCGTTACGAC AATACATCAC CATGGTCGGT GTGTACCAGA ACGACAAAAC AAAGCCATTT
ATGCTTCATG ATGATGGTTC TGGTGTATTC CTTGCTACAA CTGACATGCT AAGTGGGTAT
GTTCAGTCAA TTCGATTCGG TGCCGTTGAG CATGGAAACG TATATCGTTC GCCCGGATTT
GCAGACCAGT TAGGTTACGT CATTACAGGT GTTGAGAATG GAGACTCGAA CGAAACACCA
GACAGGATCC AACGACGCTT GTTACAGCTT AAAGTGAATG GTCAGTGGTA TACGGTGGGG
GCATAA
 
Protein sequence
MSAKFYTLLT EIGAAKLASA AALGVPLKIT HMAVGDGGGV LPTPSAQQTA LVAEKRRAAL 
NMLYIDPQNS SQIIAEQVIP ETEGGWWIRE VGLFDETGAL IAVGNSPESY KPQLTEGSGR
TQTVRMVLIT SSTDNITLKI DPAVVLATRK YVDDKALELK VYVDDLMAKH LAAPDPHSQY
APKESPTFTG TPKAPTPAAL DTLNELAAAL GNDPNFATTV MNALAGKQPL DTTLTNLSGK
DNAGILQYLG LKDALLKGDG RFLAGTFVSD AIDRTSIGAR AATGCQFMRA HQAPDAPDQV
SFWQIITLSE VVSPTTVVDV LAVSGNNVLF GHGTGAGITS WRQVAMLEGG AFTGGISAPN
MRGDTLVTVG DGTGGMAKGD VDGAGFNGNN LNIKSWNGIG FQNSEDLAIR AYISTRLGVI
AAAENLQAGN AIFNKNGDVY GDIWGTGSGP GWLSAYIAGR PLRQYITMVG VYQNDKTKPF
MLHDDGSGVF LATTDMLSGY VQSIRFGAVE HGNVYRSPGF ADQLGYVITG VENGDSNETP
DRIQRRLLQL KVNGQWYTVG A