Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A3777 |
Symbol | |
ID | 6483757 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | + |
Start bp | 3642408 |
End bp | 3644738 |
Gene Length | 2331 bp |
Protein Length | 776 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 642739044 |
Product | putative transcriptional accessory protein |
Protein accession | YP_002042755 |
Protein GI | 194442760 |
COG category | [K] Transcription |
COG ID | [COG2183] Transcriptional accessory protein |
TIGRFAM ID | [TIGR00426] competence protein ComEA helix-hairpin-helix repeat region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 100 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGAATG ATTCTTTCTG CCGCATTATT GCGGGTGAAA TCCAGGCAAA TGCCGGGCAG GTTGAAGCTG CCGTTCGCCT GCTTGACGAA GGGAACACCG TGCCGTTTAT CGCACGTTAT CGTAAAGAAA TCACCGGCGG TCTGGATGAC ACGCAGTTGC GTAACCTGGA AACGCGTCTG GGCTATCTGC GTGAGCTGGA AGACAGGCGT CAGGCCATCC TCAAGTCCAT TTCCGAACAA GGCAAACTGA CCGATGAGCT GGCTGGCGCC ATCAACGCTA CGTTAAGTAA GACCGAGCTC GAAGACCTCT ACCTGCCCTA TAAACCTAAA CGCCGCACCC GTGGACAAAT CGCCATTGAA GCCGGCCTTG AGCCACTGGC CGATCTGCTC TGGAACGAGC CGTCCCACGA TCCTGACGTG GAAGCGGCAA AGTACATTGA TGGCGACAAA GGCGTGGCGG ACACGAAGGC CGCGCTCGAC GGCGCACGCT ACATTCTGAT GGAGCGCTTT GCCGAAGACG CCGCATTGCT GGCGAAAGTG CGTGATTACC TGTGGAAGAA CGCCCATCTG GTCGCCACCG TCGTGAGCGG CAAAGAGGAA GAAGGGGCAA AATTCCGCGA CTATTTCGAC CATCATGAGC CCATTGCTAA CGTCCCGTCT CACCGTGCGC TGGCCATGTT CCGTGGTCGT AACGAAGGCA TTCTGCAACT TTCGCTCAAT GCCGACCCAC AGTTTGATGA GCCGCCAAAA GAAAGCTACT GCGAGCAAAT CATCATGGAC CATCTCGGCC TGCGGCTAAA TAACGCCCCG GCGGATAGCT GGCGCAAAGG CGTAGTGAGC TGGACGTGGC GTATCAAAGT CTTAATGCAC CTCGAAACCG AACTGATGGG CACCGTGCGC GAACGTGCGG AAGACGAAGC GATTAACGTG TTTGCGCGTA ACCTGCACGA CCTGCTGATG GCAGCCCCCG CAGGCCTGCG CGCCACGATG GGCCTTGATC CTGGCCTGCG TACCGGCGTA AAAGTCGCTG TCGTTGACGG CACCGGCAAG CTGGTGGCGA CGGATACCAT TTATCCGCAT ACCGGTCAGG CGGCCAAAGC GGCTACCGTG ATCGCCGCGC TGTGCGAAAA ATACCACGTC GAACTGGTCG CGATTGGCAA CGGTACGGCC TCGCGTGAAA CCGAACGCTT CTATCTCGAC GTACAGAAAC AGTTCCCGAA CGTGACGGCG CAGAAAGTGA TCGTCAGCGA AGCGGGGGCG TCCGTGTATT CCGCTTCTGA GCTGGCGGCG CAGGAGTTTC CGGATCTCGA CGTCTCCCTG CGCGGCGCAG TCTCTATCGC CCGTCGTCTG CAGGATCCGC TGGCGGAACT GGTGAAAATC GATCCGAAAT CGATCGGCGT CGGCCAATAT CAACACGATG TCAGCCAGAC GCAGCTGGCG CGTAAGCTGG ATGCGGTGGT CGAAGACTGC GTAAACGCCG TCGGCGTCGA TTTGAATACC GCCTCCGTGC CGCTGCTGAC CCGCGTCGCG GGCTTAACGC GCATGATGGC GCAAAACATC GTCGCCTGGC GCGATGAGAA CGGTCAGTTC CAGAATCGCC AGCAACTGTT GAAGGTGAGC CGTCTGGGGC CGAAAGCGTT TGAGCAGTGC GCGGGCTTCC TGCGTATTAA CCACGGCGAT AACCCACTGG ATGCCTCCAC CGTCCACCCG GAAGCCTATC CGGTTGTCGA ACGCATTCTG GCGGCGACGC AGCAAGCGCT AAAAGATCTG ATGGGCAACA GCAACGAATT GCGTCACCTC AAGGCCGCTG ATTTTACCGA CGATAAATTC GGCGTGCCGA CCGTGAGCGA TATCATCAAA GAGCTGGAAA AACCGGGCCG CGACCCGCGT CCTGAATTTA AAACCGCGCA ATTCGCCGAT GGCGTTGAAA CCATGAACGA CCTGCTACCG GGGATGATTC TGGAAGGGGC GGTCACTAAC GTCACCAATT TCGGCGCGTT TGTCGATATC GGCGTTCATC AGGATGGCCT GGTGCATATC TCCTCGCTCT CGAATAAGTT CGTCGACGAT CCACACACCG TGGTAAAAGC TGGCGACATC GTGAAGGTGA AAGTGCTGGA AGTGGATCTG CAACGTAAGC GTATTGCGCT GACGATGCGT CTGGACGAAC AGCCCGGCGA AACCGCCGCT CGCCGCGGCG GCGCCGCCGA TCGCGCGCAG GGCAACCGCC CGGCGTCAAA AGCGGCGAAA CCGCGCGGTC GTGACGCCCA GCCAGCCGGT AACAGCGCCA TGATGGACGC GCTGGCAGCG GCAATGGGGA AAAAACGCTA A
|
Protein sequence | MMNDSFCRII AGEIQANAGQ VEAAVRLLDE GNTVPFIARY RKEITGGLDD TQLRNLETRL GYLRELEDRR QAILKSISEQ GKLTDELAGA INATLSKTEL EDLYLPYKPK RRTRGQIAIE AGLEPLADLL WNEPSHDPDV EAAKYIDGDK GVADTKAALD GARYILMERF AEDAALLAKV RDYLWKNAHL VATVVSGKEE EGAKFRDYFD HHEPIANVPS HRALAMFRGR NEGILQLSLN ADPQFDEPPK ESYCEQIIMD HLGLRLNNAP ADSWRKGVVS WTWRIKVLMH LETELMGTVR ERAEDEAINV FARNLHDLLM AAPAGLRATM GLDPGLRTGV KVAVVDGTGK LVATDTIYPH TGQAAKAATV IAALCEKYHV ELVAIGNGTA SRETERFYLD VQKQFPNVTA QKVIVSEAGA SVYSASELAA QEFPDLDVSL RGAVSIARRL QDPLAELVKI DPKSIGVGQY QHDVSQTQLA RKLDAVVEDC VNAVGVDLNT ASVPLLTRVA GLTRMMAQNI VAWRDENGQF QNRQQLLKVS RLGPKAFEQC AGFLRINHGD NPLDASTVHP EAYPVVERIL AATQQALKDL MGNSNELRHL KAADFTDDKF GVPTVSDIIK ELEKPGRDPR PEFKTAQFAD GVETMNDLLP GMILEGAVTN VTNFGAFVDI GVHQDGLVHI SSLSNKFVDD PHTVVKAGDI VKVKVLEVDL QRKRIALTMR LDEQPGETAA RRGGAADRAQ GNRPASKAAK PRGRDAQPAG NSAMMDALAA AMGKKR
|
| |