Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeD_A3033 |
Symbol | |
ID | 6872102 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 |
Kingdom | Bacteria |
Replicon accession | NC_011205 |
Strand | - |
Start bp | 2930573 |
End bp | 2932258 |
Gene Length | 1686 bp |
Protein Length | 561 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 642786063 |
Product | tail fiber domain-containing protein |
Protein accession | YP_002216709 |
Protein GI | 198243132 |
COG category | [R] General function prediction only |
COG ID | [COG5301] Phage-related tail fibre protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 39 |
Fosmid unclonability p-value | 0.000290271 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGAGCGCAA AATTTTATAC CCTGCTGACG GAGATCGGCG CGGCGAAACT GGCAAGCGCC GCCGCGCTCG GTGTCCCGCT GAAAATTACC CATATGGCGG TGGGCGACGG TGGCGGTGTG CTGCCCACAC CCAGCGCGCA ACAGACCGCG TTAGTTGCTG AGAAGCGTCG AGCAGCGCTG AATATGCTGT ATATCGACCC GCAGAACAGC AGCCAGATTA TTGCTGAGCA AGTGATCCCG GAAACTGAGG GGGGATGGTG GATTCGTGAG GTCGGCCTGT TTGATGAAAC CGGCGCACTG ATCGCCGTGG GTAACAGCCC TGAGAGCTAC AAGCCGCAGC TGACAGAAGG GAGCGGACGT ACGCAGACCG TGCGCATGGT ACTGATTACC AGCAGCACCG ATAACATCAC CCTGAAAATT GATCCTGCAG TAGTACTGGC AACCCGTAAA TATGTAGATG ATAAGGCGCT GGAGCTGAAG GTATATGTAG ACGACCTGAT GGCAAAGCAT CTTGCTGCAC CGGACCCGCA TTCACAGTAT GCACCCAAAG AAAGTCCGAC GTTTACCGGG ACACCCAAAG CGCCAACGCC AGCGGCGCTG GATACATTAA ATGAACTGGC TGCTGCGCTG GGGAATGATC CAAACTTTGC GACAACAGTG ATGAATGCGC TGGCAGGAAA ACAACCTCTC GATACCACGC TGACGAATCT GAGTGGAAAA GACAATGCAG GGATTCTCCA ATACCTCGGT TTAAAAGACG CACTTTTAAA AGGTGACGGG CGATTCCTTG CGGGAACGTT TGTCAGTGAC GCAATTGACC GAACATCAAT TGGTGCCAGA GCGGCTACAG GCTGTCAGTT TATGCGCGCA CATCAGGCAC CTGATGCGCC AGACCAGGTA AGTTTCTGGC AAATTATTAC CCTTAGCGAG GTGGTAAGTC CGACCACTGT TGTGGATGTT CTTGCAGTCA GTGGCAATAA CGTATTGTTT GGTCACGGAA CAGGAGCGGG TATTACCTCA TGGCGTCAAG TGGCGATGCT GGAGGGGGGC GCCTTTACGG GGGGTATTTC TGCTCCAAAT ATGCGTGGCG ATACCCTGGT TACGGTTGGG GATGGCACTG GTGGGATGGC TAAAGGTGAC GTTGATGGTG CAGGTTTTAA TGGTAACAAT CTGAACATTA AGTCATGGAA TGGTATCGGA TTTCAGAACT CAGAAGACCT GGCTATCCGG GCATATATCA GCACCCGACT CGGTGTTATC GCAGCTGCTG AAAATTTGCA GGCCGGAAAT GCGATATTCA ACAAAAACGG CGATGTTTAC GGCGATATAT GGGGCACAGG CAGCGGGCCT GGCTGGTTGA GTGCGTATAT TGCAGGCAGA CCGTTACGAC AATACATCAC CATGGTCGGT GTGTACCAGA ACGACAAAAC AAAGCCATTT ATGCTTCATG ATGATGGTTC TGGTGTATTC CTTGCTACAA CTGACATGCT AAGTGGGTAT GTTCAGTCAA TTCGATTCGG TGCCGTTGAG CATGGAAACG TATATCGTTC GCCCGGATTT GCAGACCAGT TAGGTTACGT CATTACAGGT GTTGAGAATG GAGACTCGAA CGAAACACCA GACAGGATCC AACGACGCTT GTTACAGCTT AAAGTGAATG GTCAGTGGTA TACGGTGGGG GCATAA
|
Protein sequence | MSAKFYTLLT EIGAAKLASA AALGVPLKIT HMAVGDGGGV LPTPSAQQTA LVAEKRRAAL NMLYIDPQNS SQIIAEQVIP ETEGGWWIRE VGLFDETGAL IAVGNSPESY KPQLTEGSGR TQTVRMVLIT SSTDNITLKI DPAVVLATRK YVDDKALELK VYVDDLMAKH LAAPDPHSQY APKESPTFTG TPKAPTPAAL DTLNELAAAL GNDPNFATTV MNALAGKQPL DTTLTNLSGK DNAGILQYLG LKDALLKGDG RFLAGTFVSD AIDRTSIGAR AATGCQFMRA HQAPDAPDQV SFWQIITLSE VVSPTTVVDV LAVSGNNVLF GHGTGAGITS WRQVAMLEGG AFTGGISAPN MRGDTLVTVG DGTGGMAKGD VDGAGFNGNN LNIKSWNGIG FQNSEDLAIR AYISTRLGVI AAAENLQAGN AIFNKNGDVY GDIWGTGSGP GWLSAYIAGR PLRQYITMVG VYQNDKTKPF MLHDDGSGVF LATTDMLSGY VQSIRFGAVE HGNVYRSPGF ADQLGYVITG VENGDSNETP DRIQRRLLQL KVNGQWYTVG A
|
| |