Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeAg_B2071 |
Symbol | |
ID | 6795300 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Agona str. SL483 |
Kingdom | Bacteria |
Replicon accession | NC_011149 |
Strand | - |
Start bp | 2001320 |
End bp | 2002756 |
Gene Length | 1437 bp |
Protein Length | 478 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 642776293 |
Product | prepilin-type N- cleavage/methylation domain protein |
Protein accession | YP_002146921 |
Protein GI | 197249826 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR02532] prepilin-type N-terminal cleavage/methylation domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.00288729 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATAAAGA AAAAAGGATT TACCTTACTG GAAGTCAGTA TTGTTTTAGG TATTGGAACA CTCATCGCTT TTATGAAGTT TCAGGATATG AGAAACAATC AGGAAGCGGT ACTGGCTGAT AATGTCGGAA CACAGATTAA ACAGCTTGGT GAAGCGGTTA ACCGCTATAT CAGTATTCGC TATGACAAGA TTTCCACACT GTCATCTTCT CACAATCAGA GCAGTGATCC GGGTCCAAGA ACCTGTACGG CTGCTGGTTG TGAAATCACT TACCAGACGT TGATTAATGA AGGTCTTTTA CCTGTCGGCT ATACTGGAAC CAACGCGCAA AAATCCACGT ATAAGATTTT GTTGAAGCGT TCGGGAACAG CGCCTGATTA TGTGATTAAT GGTCTGATAA CAACCAGCAG CCCTTGGAAA GAAGGTGGGC GCATTCGCTA TGACCTATTA GGCAAAGCAG TTCAGGCTGC GGGTGTTGAT AGTGGTATGA GCAGAACAAC CAAGATAGCT TCTGGCTATG GTGGTCAGTG GAGTGAGAAT TCAGGCAACT ACAGCAATAT TACTGATGGT GGATTGCTGG CTTACCGCGT GGGCTATAAC TCTTCTATGT ATTCTGTTTA CTTGCGTCGT GATGGAACAT TGCCTATGAC TGGCGATCTA AATCTTGGCG GTAAGAGCAT CAAAAACATC AAAGATATGA CGGCATCAGG AACAACAACG ACCGGAACAC TTAATACAAC GGGTAAAGCA TCTGTTGCAT CAGATCTTGC TGTTGGTGGA ACCTCAACCC TGAACGGGCA GGTAAATATC AATAATAACC TGAAAGTGAA ATCCGATACT TATCTGAACA CGCTATCTAC AACAGGGCTG GCTAAGTTTG GTAGTCGTAT CGCGACCAAT GGACTCAATC CCAACGATCT ACCATCTGGT TGGGCTGGAG GTGTTCGCAC CTATGACTTG TATGCATCGG GGACTGTTGG CGCTGGAACC GGGAAAACGG TTAATGCTTA TATGAATAGT GCAGGTAATA TTTTTGCGTC AGGCAATATT ACAGCCGGAA CGATAAAATC CAATGGTACA ATTGAATCAG TTGGTAGAGT TAAAGTGGGT GAATATCTCC ATCTGAATGG TCAGGCAACG CTGAATGCGA AATGTACGCC TAATGGTCTG GTGGGGCGAG ACAGTGCGGG TAAGGTTTTA TCCTGCGTCA GTGGCAAGTG GTCCGAACTA ACACCTGCAG CAGCGTCAGG CATCTATGTT CGCTACTACA ATAGCATTAA ATGGTCCTGT ACTGTTCCTA ACAGAGCGAC AGGTGGTTGT TCTTGTTCAT CAGGTCTAAA CAGTATTTTG ATAAATACGG ATTCACAACA GTCATGTAGT GGTGGTAAGA ACGATCACTG TAGGACATAT ACCAAATATT TTTATGCTTG TGTTTAA
|
Protein sequence | MIKKKGFTLL EVSIVLGIGT LIAFMKFQDM RNNQEAVLAD NVGTQIKQLG EAVNRYISIR YDKISTLSSS HNQSSDPGPR TCTAAGCEIT YQTLINEGLL PVGYTGTNAQ KSTYKILLKR SGTAPDYVIN GLITTSSPWK EGGRIRYDLL GKAVQAAGVD SGMSRTTKIA SGYGGQWSEN SGNYSNITDG GLLAYRVGYN SSMYSVYLRR DGTLPMTGDL NLGGKSIKNI KDMTASGTTT TGTLNTTGKA SVASDLAVGG TSTLNGQVNI NNNLKVKSDT YLNTLSTTGL AKFGSRIATN GLNPNDLPSG WAGGVRTYDL YASGTVGAGT GKTVNAYMNS AGNIFASGNI TAGTIKSNGT IESVGRVKVG EYLHLNGQAT LNAKCTPNGL VGRDSAGKVL SCVSGKWSEL TPAAASGIYV RYYNSIKWSC TVPNRATGGC SCSSGLNSIL INTDSQQSCS GGKNDHCRTY TKYFYACV
|
| |