Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeHA_C1160 |
Symbol | |
ID | 6488655 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Heidelberg str. SL476 |
Kingdom | Bacteria |
Replicon accession | NC_011083 |
Strand | + |
Start bp | 1141380 |
End bp | 1143755 |
Gene Length | 2376 bp |
Protein Length | 791 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 642741402 |
Product | side tail fiber protein |
Protein accession | YP_002045054 |
Protein GI | 194450476 |
COG category | [M] Cell wall/membrane/envelope biogenesis [R] General function prediction only |
COG ID | [COG3064] Membrane protein involved in colicin uptake [COG5301] Phage-related tail fibre protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0463923 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 79 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCAGTAC TTATTTCCGG CGTACTGAAA GATGGTACGG GAACGCCGGT ACAGAACTGC ACCATTCAGC TGAAGGCCTG CCGGACCAGT ACGACGGTGG TCGTGAATAC GGTGGCATCG GAAAATCCGG ATGACGCCGG GCGCTACAGC ATGGATGTGG AGCAGGGGCA GTACACTGTC ACGCTCCTGG TGGAAGGGTA TCCCCCGTCA CATGCCGGAG TTATTACGGT TTACGATGAT TCAAAGCCGG GCACCCTGAA TGATTTTCTG GGGGCCATGA CGGAAGACGA CGTCCGCCCG GAGGCGCTGC GGCGTTTTGA GGCGATGGTG GAAGAAGTTG CCCGCCAGGC ATCGGAGGCA TCGCGGAATG CCACCGCCGC AGGGCAGGCA TCTGAACAGG CGCAGACATC AGCAGGTCAG GCATCGGAAA GCGCCACGGC AGCAGTGAAT GCAGCCGGAG CGGCAGAAGC ATCAGCCACA CAGGCAGCCT CATCCGCAGC ATCTGCGGAG AGCAGCGCAG GTACGGCGAC CACAAAAGCC GGGGAGGCAT CAGCCAGCGC GGCGTCGGCT GACACAGCCA GAACGGCAGC AGCCGCATCG GCAGCCGCAG CGAAAACATC TGAAGCGAAT GCAGATGCCT CCCGTACTGC CGCCGGAGAT TCAGCTGCTG CCGCAGCCGC CAGCGCGACG GCGGCGCAGA CATCAGCAGA GCGCGCCGGA GCATCCGAAA CCGCCGCGAA GACGTCAGAA ACGCAGGCGG CTTCCAGTGC CGGTGATGCA GGTGCGTCAG CCACTGCGGC GGCAGCGTCG GAAAAGGCGG CAGCCGCATC GGCAGCCGCA GCGAAAACAT CTGAGACAAA TGCAGCAACG TCAGCAAGTA CAGCAGCGGC CAGCGCAACA GCCGCCTCGT CATCAGCATC GGAGGCATCC ACTCACGCCG CTGCATCTGA TACCAGCGCA TCACTGGCGG CGCAAAGCAG TACTGCTGCC GGAGCAGCAG CCACCAGAGC AGAAGAGGCC GCAAAACGGG CAGAAGATAT CGCGGACGTG ATTTCCCTGG AAGATGCCAG CCTGACGAAA AAAGGTATCG TTAAGTTAAG CAGCGCCACG GACAGTGACA GCGAAGCGCT GGCAGCCACG CCAAAGGCGG TCAAAGCTGT CATGATTGAG GTACAGACCA AAGCGCCGCT GGACAGTCCG GTATTCACTG GAACACCGAC CACACCGACG CCGCCAGATG ACGCTAAGGG ACTTCAGACT GCAAACGCTG AGTTTGTTCG TAAACTGATT GCTGCACTGG TCGGTTCCGT ACCTGAGTCG CTGGATACGC TGCAGGAACT GGCGGACGCG CTGGGTAACG ATCCGAGCTT TGCCACCACT GTAATGAATA AACTGGCGGG CAAGCAGCCG CTGGACGATA CACTGACGGC GCTGTCAGGA AAAAGCATTG AAGGTCTTAT CGAATACGTT GGTTTACGGA GCACAATTGA TAAGGCTGCT GGTGCGTTGC CTGCTGGTGG TACGGCTGTC GCAGCGAACA GGCTTGCATC ACGCGGCGCG CTTCCGGCAC TGACTGGCAC GACAAGAGGC AGCGATGGCG GCCTGATAAT GGGCGAGGTT TACAATAACG GTTATCCAAC GCAATACGGG AATATTTTGC GTCTGACCGG AACCGGTGAT GGAGAGGTAT TAATCGGATG GAGTGGGGTT AATGGTGCTC CTGCGCCCGC ATATATTCGC AGCCATCGAG ATACCGCCGA CGCTGAGTGG TCAGAATGGG CGATGTTCTA CACCTCACTA AATCCGCCAC CGGATTCGTA TCCAGTAGGG GCGGCGATTG CATGGCCGTC TGATGTGCTC CCGGATGGTG GTTATGCTTT TATGTATGGG CAGTCCTTCG ATAAATCTGC TTACCCGTTA CTGGCTATAG CGTATCCGTC CAGCGTTATC CCTGACATGA GAGGCTGGAC AATAAAGGGT AAGCCCATCA GTGGACGTGC CGTATTGTCG CAAGAAATGG ACGGCAATAA ATCGCACTCG CACACCGCGC GGGCGCAGGA TACTGACTTA GGGACAAAAT CTACCTCATC CTTTGATTAC GGCACGAAAT CGACCAATAC CACGGGCAAT CATACTCACC AGTTCGGCGG TTATATCAAT TCATACTGGG GAGATTCCAA TCACACCTCA TTTCAGCCTG GAGGTGGTGC ATGGACACAG GCCGCTGGCG ACCATGCGCA TACAGTTTAT ATCGGAGGAC ACGAGCACAC CATGTATATC GGTCCACACG GACACGTCGT TATTGTGGAC GCAGACGGTA ATGCAGAAAC CACGGTTAAA AACATTGCAT TTAACTATAT TGTGAGGCTG GCGTGA
|
Protein sequence | MPVLISGVLK DGTGTPVQNC TIQLKACRTS TTVVVNTVAS ENPDDAGRYS MDVEQGQYTV TLLVEGYPPS HAGVITVYDD SKPGTLNDFL GAMTEDDVRP EALRRFEAMV EEVARQASEA SRNATAAGQA SEQAQTSAGQ ASESATAAVN AAGAAEASAT QAASSAASAE SSAGTATTKA GEASASAASA DTARTAAAAS AAAAKTSEAN ADASRTAAGD SAAAAAASAT AAQTSAERAG ASETAAKTSE TQAASSAGDA GASATAAAAS EKAAAASAAA AKTSETNAAT SASTAAASAT AASSSASEAS THAAASDTSA SLAAQSSTAA GAAATRAEEA AKRAEDIADV ISLEDASLTK KGIVKLSSAT DSDSEALAAT PKAVKAVMIE VQTKAPLDSP VFTGTPTTPT PPDDAKGLQT ANAEFVRKLI AALVGSVPES LDTLQELADA LGNDPSFATT VMNKLAGKQP LDDTLTALSG KSIEGLIEYV GLRSTIDKAA GALPAGGTAV AANRLASRGA LPALTGTTRG SDGGLIMGEV YNNGYPTQYG NILRLTGTGD GEVLIGWSGV NGAPAPAYIR SHRDTADAEW SEWAMFYTSL NPPPDSYPVG AAIAWPSDVL PDGGYAFMYG QSFDKSAYPL LAIAYPSSVI PDMRGWTIKG KPISGRAVLS QEMDGNKSHS HTARAQDTDL GTKSTSSFDY GTKSTNTTGN HTHQFGGYIN SYWGDSNHTS FQPGGGAWTQ AAGDHAHTVY IGGHEHTMYI GPHGHVVIVD ADGNAETTVK NIAFNYIVRL A
|
| |