Gene SbBS512_E1077 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E1077 
Symbol 
ID6269852 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp981793 
End bp983469 
Gene Length1677 bp 
Protein Length558 aa 
Translation table11 
GC content51% 
IMG OID641725217 
Productdiguanylate cyclase (GGDEF) domain protein 
Protein accessionYP_001879735 
Protein GI187733517 
COG category[T] Signal transduction mechanisms 
COG ID[COG2199] FOG: GGDEF domain 
TIGRFAM ID[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000734609 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAAACC AGAGCTGGTT GAAAAAAATC GCACGCCGCC TGGGGCCTGG TCATGTCGTT 
AATCTCTGCT TTATCGTGGT ATTGCTTTTT TCCACCTTGC TGACCTGGCG TGAAGTGGTA
GTGCTGGAAG ATGCCTATAT CTCCAGCCAG CGTAATCATC TGGAAAACGT AGCCAACGCG
CTCGATAAGC ATTTGCAGTA TAACGTCGAC AAACTGATCT TTTTGCGTAA TGGCATGCGC
GAAGCTCTCG TAGCGCCACT GGATTTCACT TCACTGCGTA ATGCTGTAAC CGAGTTCGAA
CAGCATCGCG ACGAGCACGC CTGGCAAATT GAACTCAACC GACGACGCAC CCTGTCAGTC
AATGGCGTAT CGGATGCATT AGTCAGCGAG GGGAATCTCC TGTCTCGCGA AAATGAAAGC
CTCGACAATG AAATTACCGC TGCACTGGAA GTTGGTTACT TGCTGCGACT GGCGCACAAC
ACCTCGTCGA TGGTTGAACA GGCGATGTAT GTCTCGCGTG CCGGATTTTA CGTTTCGACG
CAGCCGACCT TGTTTACGCG CAATGTACCA ACGCGTTATT ACGGCTATGT CACCCAACCC
TGGTTTATCG GCCATTCGCA ACGAGAAAAT CGTCACCGCG CGGTACGCTG GTTTACTTCG
CAACCGGAAC ACGCCAGCAA TACTGAACCG CAGGTTACCG TCAGTGTTCC GGTAGACAGT
AATAACTACT GGTATGGCGT GCTGGGGATG AGTATTCCCG TGCGTACCAT GCAGCAATTT
TTAAGAAACG CCATCGATAA AAACCTCGAT GGTGAGTATC AGCTCTATGA CAGTAAGCTG
AGATTTTTGA CCTCTTCCAA TCCTGACCAT CCAACAGGGA ATATTTTTGA TCCTCGTGAA
CTGGCCTTGC TGGCGCAGGC GATGGAACAT GACACGCGGG GCGGCATTCG TATGGACAGT
CGCTATGTTA GCTGGGAACG TCTGGACCAT TTCGACGGTG TGCTGGTGCG CGTCCATACG
CTAAGCGAAG CCGTGCGCGG CGATTTCGGC AGTATCAGCA TTGCATTAAC CCTGCTGTGG
GCGCTCTTTA CCACCATGTT ACTCATCTCC TGGTATGTGA TTCGCCGGAT GGTTAGCAAC
ATGTATGTTC TGCAAAGCTC GTTGCAGTGG CAGGCGTGGC ACGACACCTT AACCCGTTTA
TATAATCGTG GCGCACTGTT TGAAAAAGCC CGTCCGCTCG CGAAATTGTG TCAGACGCAC
CAACATCCTT TTTCTGTCAT TCAGGTCGAC CTTGACCATT TTAAAGCGAT TAATGACCGC
TTTAGTCATC AGGCGGGCGA CCGTGTTCTT TCTCATGCTG CCGGATTAAT TAGCAGTTCC
TTGCGTGCGC AGGACGTTGC CGGGCGGGTC GGTGGTGAGG AGTTTTGTGT GATTCTGCCA
GGCGCGAGTC TGACGGAGGC TGCGGAAGTC GCAGAACGTA TTCGCCTGAA GTTAAATGAA
AAAGAGATGT TGATTGCTAA GAGTACGACG ATACGCATCA GTGCCTCGTT GGGGGTAAGT
AGCAGCGAGG AAACCGGTGA TTATGATTTT GAACAACTCC AGTCACTGGC TGACCGTCGG
CTTTATCTCG CTAAACAGGC CGGGCGTAAT CGGGTATGCG CGAGCGATAA CGCTTAA
 
Protein sequence
MENQSWLKKI ARRLGPGHVV NLCFIVVLLF STLLTWREVV VLEDAYISSQ RNHLENVANA 
LDKHLQYNVD KLIFLRNGMR EALVAPLDFT SLRNAVTEFE QHRDEHAWQI ELNRRRTLSV
NGVSDALVSE GNLLSRENES LDNEITAALE VGYLLRLAHN TSSMVEQAMY VSRAGFYVST
QPTLFTRNVP TRYYGYVTQP WFIGHSQREN RHRAVRWFTS QPEHASNTEP QVTVSVPVDS
NNYWYGVLGM SIPVRTMQQF LRNAIDKNLD GEYQLYDSKL RFLTSSNPDH PTGNIFDPRE
LALLAQAMEH DTRGGIRMDS RYVSWERLDH FDGVLVRVHT LSEAVRGDFG SISIALTLLW
ALFTTMLLIS WYVIRRMVSN MYVLQSSLQW QAWHDTLTRL YNRGALFEKA RPLAKLCQTH
QHPFSVIQVD LDHFKAINDR FSHQAGDRVL SHAAGLISSS LRAQDVAGRV GGEEFCVILP
GASLTEAAEV AERIRLKLNE KEMLIAKSTT IRISASLGVS SSEETGDYDF EQLQSLADRR
LYLAKQAGRN RVCASDNA