Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeD_A3873 |
Symbol | |
ID | 6871420 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 |
Kingdom | Bacteria |
Replicon accession | NC_011205 |
Strand | + |
Start bp | 3701067 |
End bp | 3703397 |
Gene Length | 2331 bp |
Protein Length | 776 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 642786835 |
Product | putative transcriptional accessory protein |
Protein accession | YP_002217463 |
Protein GI | 198242502 |
COG category | [K] Transcription |
COG ID | [COG2183] Transcriptional accessory protein |
TIGRFAM ID | [TIGR00426] competence protein ComEA helix-hairpin-helix repeat region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 78 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGAATG ATTCTTTCTG CCGCATTATT GCGGGTGAAA TCCAGGCAAA TGCCGGGCAG GTTGAAGCTG CCGTTCGCCT GCTTGACGAA GGGAACACCG TGCCGTTTAT CGCACGTTAT CGTAAAGAAA TCACCGGCGG TCTGGATGAC ACGCAGTTGC GTAACCTGGA AACGCGTCTG GGCTATCTGC GTGAGCTGGA AGACAGGCGT CAGGCTATCC TCAAGTCCAT TTCCGAACAA GGCAAACTGA CCGATGAGCT GGCTGGCGCC ATCAACGCTA CGTTAAGTAA GACCGAGCTC GAAGACCTCT ACCTGCCCTA TAAACCTAAA CGCCGCACCC GTGGACAAAT CGCCATTGAA GCCGGCCTTG AGCCGCTGGC CGATCTGCTC TGGAACGAGC CGTCCCACGA TCCTGACGTG GAAGCGGCAA AGTACATTGA TGGCGACAAA GGCGTGGCGG ACACGAAAGC CGCGCTCGAC GGCGCACGCT ACATTCTGAT GGAGCGCTTT GCCGAAGACG CCGCATTGCT GGCGAAAGTG CGTGATTACC TGTGGAAGAA CGCCCATCTG GTCGCCACAG TCGTGAGCGG CAAAGAGGAA GAAGGGGCAA AATTCCGCGA CTATTTCGAC CATCATGAGC CCATTGCTAA CGTCCCGTCT CACCGTGCGC TGGCCATGTT CCGTGGTCGT AACGAAGGCA TTCTGCAACT TTCGCTCAAT GCCGACCCAC AGTTTGATGA GCCGCCGAAA GAGAGCTACT GCGAGCAGAT CATCATGGAC CATCTCGGCC TGCGGCTGAA TAACGCCCCG GCGGATAGCT GGCGCAAAGG CGTAGTGAGC TGGACGTGGC GTATCAAAGT CTTAATGCAC CTCGAAACCG AACTGATGGG CACCGTGCGC GAACGTGCGG AAGACGAAGC GATTAACGTG TTTGCGCGTA ACCTGCACGA CCTGCTGATG GCAGCCCCCG CAGGCCTGCG CGCCACGATG GGCCTTGATC CTGGCCTGCG TACCGGCGTA AAAGTCGCTG TCGTTGACGG CACCGGCAAG CTGGTGGCGA CGGATACCAT TTATCCGCAT ACCGGTCAGG CGGCCAAAGC GGCTACCGTG ATCGCCGCGC TGTGCGAAAA ATACCACGTC GAACTGGTCG CGATTGGCAA CGGTACGGCC TCGCGTGAAA CCGAACGCTT CTATCTCGAC GTACAGAAAC AGTTCCCGAA CGTGACGGCG CAGAAAGTGA TCGTCAGCGA AGCGGGGGCG TCCGTGTATT CCGCTTCGGA GCTGGCGGCG CAGGAGTTTC CGGATCTCGA CGTCTCCCTG CGTGGCGCGG TCTCTATCGC GCGTCGTCTG CAGGATCCGC TGGCGGAACT GGTGAAAATC GATCCGAAAT CGATCGGCGT CGGCCAATAT CAACACGATG TCAGCCAGAC GCAGCTGGCG CGTAAGCTGG ATGCGGTGGT CGAAGACTGC GTAAACGCCG TCGGCGTCGA TTTGAATACC GCCTCCGTGC CGCTGCTGAC CCGCGTCGCG GGCTTAACGC GCATGATGGC GCAAAACATC GTCGCCTGGC GCGATGAGAA CGGTCAGTTC CAGAACCGCC AGCAACTGTT GAAGGTGAGC CGTCTGGGGC CGAAAGCGTT TGAGCAGTGC GCGGGCTTCC TGCGTATTAA CCACGGCGAT AACCCGCTGG ATGCCTCCAC CGTCCACCCG GAAGCCTATC CGGTTGTCGA ACGCATTCTG GCGGCGACGC AGCAAGCGCT AAAAGATCTG ATGGGCAACA GCAACGAATT GCGTCACCTC AAGGCCGCTG ATTTTACCGA CGATAAATTC GGCGTGCCGA CCGTGAGCGA TATCATCAAA GAGCTGGAAA AACCGGGCCG CGACCCGCGT CCTGAATTTA AAACCGCGCA ATTCGCCGAT GGCGTCGAAA CCATGAACGA CCTGCTGCCG GGGATGATTC TGGAAGGGGC GGTCACTAAC GTCACTAACT TCGGCGCGTT TGTCGATATC GGCGTTCATC AGGATGGCCT GGTGCATATC TCCTCGCTCT CGAATAAGTT CGTCGACGAT CCGCACACCG TGGTGAAAGC GGGCGACATC GTGAAGGTGA AAGTGCTGGA AGTGGATCTG CAACGTAAGC GTATTGCGCT GACGATGCGT CTGGACGAAC AGCCCGGCGA AACCGCCGCT CGCCGCGGCG GCGGCGCCGA TCGCGCTCAG GGCAACCGCC CTGCATCTAA AGCGGCGAAA CCGCGCGGTC GTGACGCCCA GCCAGCCGGT AACAGCGCCA TGATGGACGC GCTGGCAGCG GCAATGGGAA AAAAACGCTA A
|
Protein sequence | MMNDSFCRII AGEIQANAGQ VEAAVRLLDE GNTVPFIARY RKEITGGLDD TQLRNLETRL GYLRELEDRR QAILKSISEQ GKLTDELAGA INATLSKTEL EDLYLPYKPK RRTRGQIAIE AGLEPLADLL WNEPSHDPDV EAAKYIDGDK GVADTKAALD GARYILMERF AEDAALLAKV RDYLWKNAHL VATVVSGKEE EGAKFRDYFD HHEPIANVPS HRALAMFRGR NEGILQLSLN ADPQFDEPPK ESYCEQIIMD HLGLRLNNAP ADSWRKGVVS WTWRIKVLMH LETELMGTVR ERAEDEAINV FARNLHDLLM AAPAGLRATM GLDPGLRTGV KVAVVDGTGK LVATDTIYPH TGQAAKAATV IAALCEKYHV ELVAIGNGTA SRETERFYLD VQKQFPNVTA QKVIVSEAGA SVYSASELAA QEFPDLDVSL RGAVSIARRL QDPLAELVKI DPKSIGVGQY QHDVSQTQLA RKLDAVVEDC VNAVGVDLNT ASVPLLTRVA GLTRMMAQNI VAWRDENGQF QNRQQLLKVS RLGPKAFEQC AGFLRINHGD NPLDASTVHP EAYPVVERIL AATQQALKDL MGNSNELRHL KAADFTDDKF GVPTVSDIIK ELEKPGRDPR PEFKTAQFAD GVETMNDLLP GMILEGAVTN VTNFGAFVDI GVHQDGLVHI SSLSNKFVDD PHTVVKAGDI VKVKVLEVDL QRKRIALTMR LDEQPGETAA RRGGGADRAQ GNRPASKAAK PRGRDAQPAG NSAMMDALAA AMGKKR
|
| |