Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E1165 |
Symbol | |
ID | 6271498 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | - |
Start bp | 1063487 |
End bp | 1066804 |
Gene Length | 3318 bp |
Protein Length | 1105 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641725298 |
Product | putative sensor protein |
Protein accession | YP_001879812 |
Protein GI | 187730854 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG3447] Predicted integral membrane sensor domain |
TIGRFAM ID | [TIGR00229] PAS domain S-box [TIGR00254] diguanylate cyclase (GGDEF) domain |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.807956 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCAAAC AATCACAGCA TGTATTAATT GCCCTGCCCC ACCCGCTGCT TCACCTGGTC AGTTTAGGTT TAGTCTCGTT TATCTTTACC CTTTTCTCGC TTGAGCTTTC GCAGTTTGGC ACCCAACTCG CCCCACTGTG GTTCCCGACG TCCATCATGA TGGTGGCGTT TTATCGCCAT GCCGGGCGCA TGTGGCCGGG AATTGCGCTG AGCTGTTCGC TGGGAAATAT CGCCGCATCC ATCCTGCTTT TTTCCACCAG CTCGCTGAAC ATGACCTGGA CGACCATCAA TATTGTTGAA GCCGTGGTCG GGGCAGTGCT GCTACGTAAA TTGCTGCCGT GGTATAACCC CTTGCAAAAT CTGGCTGACT GGCTGCGTCT GGCACTCGGC AGCGCCATTG TTCCGCCTCT GTTGGGGGGG GGTCTGGTTG TCCTGCTGAC GGCCGGAGAC GATCCTCTCA GGGCATTTTT GATATGGGTA CTGTCAGAAT CCATCGGCGC TCTGGCACTG GTGCCGCTGG GATTGTTATT TAAACCACAC TATCTGCTGC GCCATCGCAA CCCACGGTTG CTTTTTGAGT CGCTGCTCAC GTTAGCCATC ACGCTGACGT TAAGCTGGCT TTCTATGTTG TACCTGCCGT GGCCTTTTAC TTTCATTATT GTGCTGTTGA TGTGGAGCGC CGTGCGCCTG CCACGAATGG AAGCCTTTTT GATCTTCCTT ACCACGGTGA TGATGGTGTC GCTGATGATG GCCGCGGATC CCTCCCTGCT TGCTACGCCG CGTACATACC TGATGAGCCA TATGCCGTGG CTACCGTTTT TGCTGATCCT GCTGCCCGCC AACATCATGA CGATGGTGAT GTATGCCTTT CGTGCGGAAC GCAAACACAT TTCCGAAAGC GAAACCCGTT TTCGGAACGC CATGGAATAT TCCGCTATCG GTATGGCGTT AGTGGGCACC GAGGGACAAT GGCTGCAAAC CAACAAAGCG CTCTGCCAGT TTCTCGGGTA CAGTCAGGAA GAGCTGCGCG GACTCACCTT TCAGCAACTG ACCTGGCCGG AGGATCTCAA TAAAGATCTC CAACAGGTTG AAAAGCTGAT AAGCGGTGAA ATAAACACCT ATTCAATGGA AAAACGCTAC TACAACCGCA ATGGCGATGT TGTCTGGGCG TTGCTTGCCG TCTCACTGGT GCGCCACACG GATGGCACGC CGCTCTATTT TATCGCTCAG ATTGAAGACA TTAACGAGCT AAAACGCACC GAACAGGTGA ATCAGCAACT GATGGAGCGC ATCACGCTGG CTAACGAAGC GGGCGGGATT GGCATCTGGG AGTGGGAGCT GAAGCCGAAT ATTTTTAGCT GGGATAAGCG GATGTTCGAG CTGTATGAAA TTCCTCCGCA TATCAAACCG AACTGGCAGG TGTGGTACGA GTGCGTGCTG CCGGAAGATC GCCAGCATGC CGAAAAAGTG ATTCGTGATT CGTTGCAATC ACGCTCGCCC TTTAAGCTGG AATTTCGCAT TACCGTAAAA GACGGTATTC GCCATATCCG CGCCCTCGCC AACCGGGTAC TGAATAAAGA AGGCGAAGTC GAACGCCTGC TCGGCATTAA TATGGATATG ACCGAGGTTA AACAGCTTAA CGAGGCATTG TTTCAGGAAA AAGAGCGCCT GCACATTACG CTTGATTCCA TCGGTGAAGC CGTGGTCTGT ATTGATATGG CGATGAAAAT TACCTTTATG AATCCAGTGG CGGAGAAGAT GAGCGGCTGG ACGCAGGAAG AAGCGTTAGG TGTTCCGCTC CTGACGGTGT TGCATATTAC TTTTGGCGAC AACGGACCAT TAATGGAGAA CATTTACAGT GCCGACACCT CACGTTCCGC GATTGAACAA GATGTGGTGT TGCACTGTCG GAGCGGCGGC AGCTACGACG TGCATTACAG TATTACGCCG TTAAGTACTC TGGACGGCAG CAATATTGGT TCGGTTCTGG TGATTCAGGA CGTCACCGAA TCACGCAAAA TGCTGCGCCA GCTGAGCTAC AGCGCCTCCC ATGATGCACT GACGCATCTC GCCAACCGCG CCAGTTTTGA GAAACAACTG CGTATCCTGC TGCAAACGGT AAACAGTACA CATCAGCGAC ATGCCCTGGT GTTTATCGAT CTTGATCGCT TTAAAGCGGT GAATGACAGC GCCGGGCATG CGGCGGGTGA CGCTTTACTG CGCGAACTGG CGTCGTTGAT GCTGAGTATG CTGCGCTCCA GCGACGTGCT GGCGCGACTC GGTGGTGATG AATTTGGTCT ACTGTTGCCA GACTGTAATA TCGAAAGTGC GCGTTTTATC GCTACACGCA TTATCAGTGC CGTGAATGAT TATCACTTTA TATGGGAAGG ACGTGTACAT CGGGTAGGTG CCAGTGCCGG GATTACCTTG ATTGATGACA ACAATCATCA GGCGGCTGAA GTGATGTCGC AGGCTGATAT CGCCTGTTAT GCCTCCAAAA ATGGTGGCCG GGGCCGGGTG ACGATTTACG AACCGCAGCA AGCTGCCGCA CATAGCGAGC GGGCAGCGAT GTCGCTTGAT GAACAGTGGC GGATGATTAA AGAGAATCAG TTGATGATGA TCGCCCACGG TGTCGCTTCG CCACGGATCC CGGAAGCACG TAATTTGTGG CTGATTTCAC TTAAGCTCTG GAATTGCGAA GGCGAGATTA TTGATGAACA AACATTTCGT CGTAGCTTCA GCGATCCGGC ACTTAGCCAT GCTCTTGACC GACGGGTATT CCACGATTTT TTCCAGCAGG CCGCAAAAGC GGTTGCCAGT AAAGGCATAA GCATCGCCCT CCCCCTTTCC GTTGCCGGTT TGAGTAGCGC CACGCTGGTG AATGATCTGC TTGAGCAGCT GGAAAACAGC CCTCTACCAC CACGGTTATT ACATCTGATT ATTCCGGCTG AAGCGATTTT AGAGCACGCA GAAAGCGTGC AAAAACTGCG GCTGGCGGGA TGTCGGATAG TGCTCAACCA GGTGGGCCGC GATCTGCAAA TCTTCAACTC GCTGAAAGCG AATATGGCAG ATTACCTGCT ACTTGATGGT GAGTTATGCG CCAACGTGCA GGGTAATTTG ATGGATGAGA TGCTGATTAC GATTATTCAG GGGCACGCTC AGCGACTCGG GATGAAAACC ATCGCCGGGC CAGTCGTTTT ACCCTTAGTG ATGGATACGC TTTCTGGCAT CGGCGTCGAT CTGATTTATG GTGATGTGAT TGCCGATGCC CAACCGCTGG ATTTGCTGGT GAATAGTAGT TATTTCGCGA TTAACTGA
|
Protein sequence | MSKQSQHVLI ALPHPLLHLV SLGLVSFIFT LFSLELSQFG TQLAPLWFPT SIMMVAFYRH AGRMWPGIAL SCSLGNIAAS ILLFSTSSLN MTWTTINIVE AVVGAVLLRK LLPWYNPLQN LADWLRLALG SAIVPPLLGG GLVVLLTAGD DPLRAFLIWV LSESIGALAL VPLGLLFKPH YLLRHRNPRL LFESLLTLAI TLTLSWLSML YLPWPFTFII VLLMWSAVRL PRMEAFLIFL TTVMMVSLMM AADPSLLATP RTYLMSHMPW LPFLLILLPA NIMTMVMYAF RAERKHISES ETRFRNAMEY SAIGMALVGT EGQWLQTNKA LCQFLGYSQE ELRGLTFQQL TWPEDLNKDL QQVEKLISGE INTYSMEKRY YNRNGDVVWA LLAVSLVRHT DGTPLYFIAQ IEDINELKRT EQVNQQLMER ITLANEAGGI GIWEWELKPN IFSWDKRMFE LYEIPPHIKP NWQVWYECVL PEDRQHAEKV IRDSLQSRSP FKLEFRITVK DGIRHIRALA NRVLNKEGEV ERLLGINMDM TEVKQLNEAL FQEKERLHIT LDSIGEAVVC IDMAMKITFM NPVAEKMSGW TQEEALGVPL LTVLHITFGD NGPLMENIYS ADTSRSAIEQ DVVLHCRSGG SYDVHYSITP LSTLDGSNIG SVLVIQDVTE SRKMLRQLSY SASHDALTHL ANRASFEKQL RILLQTVNST HQRHALVFID LDRFKAVNDS AGHAAGDALL RELASLMLSM LRSSDVLARL GGDEFGLLLP DCNIESARFI ATRIISAVND YHFIWEGRVH RVGASAGITL IDDNNHQAAE VMSQADIACY ASKNGGRGRV TIYEPQQAAA HSERAAMSLD EQWRMIKENQ LMMIAHGVAS PRIPEARNLW LISLKLWNCE GEIIDEQTFR RSFSDPALSH ALDRRVFHDF FQQAAKAVAS KGISIALPLS VAGLSSATLV NDLLEQLENS PLPPRLLHLI IPAEAILEHA ESVQKLRLAG CRIVLNQVGR DLQIFNSLKA NMADYLLLDG ELCANVQGNL MDEMLITIIQ GHAQRLGMKT IAGPVVLPLV MDTLSGIGVD LIYGDVIADA QPLDLLVNSS YFAIN
|
| |