Gene ECD_01973 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_01973 
SymbolyegE 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp2042273 
End bp2045590 
Gene Length3318 bp 
Protein Length1105 aa 
Translation table11 
GC content52% 
IMG OID 
Productpredicted diguanylate cyclase, GGDEF domain signalling protein 
Protein accessionACT43824 
Protein GI253978154 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAAAC AATCACAGCA TGTATTAATT GCCCTGCCCC ACCCGCTGCT TCACCTGGTC 
AGTTTAGGTT TAGTCTCGTT TATCTTTACC CTTTTCTCGC TTGAGCTTTC GCAGTTTGGC
ACCCAACTCG CCCCACTGTG GTTCCCGACG TCCATCATGA TGGTGGCGTT TTATCGCCAT
GCCGGGCGCA TGTGGCCGGG AATTGCGCTG AGCTGTTCGC TGGGAAATAT CGCCGCATCC
ATCCTGCTTT TTTCCACCAG CTCGCTGAAC ATGACCTGGA CGACCATCAA TATTGTTGAA
GCCGTGGTCG GGGCAGTGCT GCTACGTAAA TTGCTGCCGT GGTATAACCC CTTGCAAAAT
CTGGCTGACT GGCTGCGTCT GGCACTCGGC AGCGCCATTG TTCCACCTCT GTTGGGGGGT
GTTCTGGTTG TCCTGCTGAC GCCCGGAGAC GATCCTCTCA GGGCATTTTT GATATGGGTA
CTGTCAGAAT CCATCGGCGC TCTGGCACTG GTGCCGCTGG GATTGTTATT TAAACCACAC
TATCTGCTGC GCCATCGCAA CCCACGGTTG CTTTTTGAGT CGCTGCTCAC ATTAGCCATC
ACACTGACGT TAAGCTGGCT TTCGATGCTG TATCTGCCGT GGCCTTTTAC TTTCATTATT
GTGCTGTTGA TGTGGAGCGC CGTGCGCCTG CCACGAATGG AAGCCTTTTT GATCTTCCTT
ACCACGGTGA TGATGGTGTC GCTGATGATG GCCGCGGATC CCTCCCTGCT TGCTACGCCG
CGTACGTACC TGATGACCCA TATGCCGTGG CTACCGTTTT TGCTGATCCT GCTGCCCGCC
AACATCATGA CGATGGTGAT GTATGCCTTT CGTGCGGAAC GCAAACACAT TTCCGAAAGC
GAAACCCGTT TTCGGAACGC GATGGAATAT TCCGCCATCG GCATGGCATT AGTGGGCACC
GAGGGACAAT GGCTGCAATC CAACAAAGCA CTCTGCCAGT TTCTCGGTTA CAGTCAGGAA
GAGCTGCGCG GACTCACCTT TCAGCAACTG ACCTGGCCGG AGGATCTCAA TAAAGATCTC
CAACAGGTTG AAAAGCTGAT AAGCGGTGAA ATAAACACCT ATTCAATGGA AAAACGTTAC
TACAACCGCA ATGGCGATGT TGTCTGGGCG TTGCTTGCCG TCTCACTGGT GCGCCACACG
GATGGCACGC CGCTCTATTT TATCGCTCAG ATTGAAGACA TTAACGAGCT AAAACGCACC
GAACAGGTGA ATCAGCAACT GATGGAACGC ATCACGCTGG CTAACGAAGC GGGCGGGATT
GGCATCTGGG AGTGGGAGTT GAAGCCGAAT ATTTTTAGCT GGGATAAGCG GATGTTCGAG
CTGTATGAAA TTCCTCCGCA TATCAAACCG AACTGGCAGG TGTGGTACGA GTGCGTGCTG
CCGGAAGATC GCCAGCATGC CGAAAAAGTG ATTCGTGATT CGTTGCAATC ACGCTCGCCC
TTTAAGCTGG AATTTCGCAT TGCCGTGAAA GATGGCATCC GCCATATCCG CGCCCTCGCC
AACCGGGTAC TGAATAAAGA AGGCGAAGTC GAACGCCTGC TCGGCATTAA TATGGATATG
ACTGAGGTTA AACAGCTTAA CGAGGCATTG TTTCAGGAAA AAGAGCGCCT GCACATTACG
CTTGATTCCA TCGGTGAAGC CGTGGTCTGT ATTGATATGG CGATGAAAAT TACCTTTATG
AATCCAGTGG CGGAGAAGAT GAGCGGCTGG ACGCAGGAAG AAGCGTTAGG TGTTCCGCTC
CTGACGGTGT TGCATATTAC TTTTGGCGAC AACGGACCAT TAATGGAGAA CATTTACAGT
GCCGACACCT CACGTTCCGC GATCGAACAA GATGTGGTGT TGCACTGTCG GAGCGGCGGC
AGTTACGACG TGCATTACAG TATTACGCCG TTAAGTACTC TGGACGGCAG CAATATTGGT
TCGGTTCTGG TGATTCAGGA CGTTACCGAA TCACGCAAAA TGCTGCGCCA GCTGAGCTAC
AGCGCCTCCC ATGATGCACT GACGCATCTC GCCAACCGCG CCAGTTTTGA GAAACAACTG
CGTATCCTGC TGCAAACGGT AAACAGTACG CATCAGCGAC ATGCACTGGT GTTTATCGAT
CTTGATCGCT TTAAAGCGGT GAATGACAGC GCCGGGCATG CGGCGGGCGA CGCTTTGCTG
CGCGAACTGG CGTCGTTGAT GCTGAGTATG CTGCGCTCCA GTGACGTGCT GGCGCGGCTC
GGCGGTGATG AATTTGGTCT GCTGCTGCCA GACTGCAATG TTGAAAGCGC TCGTTTTATC
GCTACACGTA TTATCAGTGC TGTGAATGAC TATCACTTTA TATGGGAAGG CCGTGTGCAT
CAGGTAGGTG CCAGTGCCGG GATTACCTTG ATTGATGACA ACAATCATCA GGCGGCAGAA
GTGATGTCGC AGGCTGATAT CGCCTGTTAT GCCTCCAAAA ATGGTGGCCG GGGCCGGGTG
ACGGTTTACG AACCGCAGCA AGCTGCCGCA CATAGCGAAC GGGCGGCGAT GTCGCTTGAT
GAACAGTGGC GGATGATTAA AGAGAATCAG TTGATGATGC TCGCCCACGG TGTCGCTTCG
CCACGGATCC CGGAAGCGCG TAATTTGTGG CTGATTTCAC TTAAGCTCTG GAGTTGCGAA
GGCGAGATTA TTGATGAACA AACATTTCGT CGTAGCTTCA GCGATCCGGC GCTTAGCCAT
GCTCTTGACC GCCGGGTATT CCACGAATTT TTCCAGCAGG CCGCAAAAGC GGTTGCCAGT
AAAGGCATAA GCATCTCCCT CCCCCTTTCC GTTGCCGGTT TGAGTAGCGC CACGCTGGTG
AATGATCTGC TTGAGCAGCT GGAAAATAGC CCTCTACCAC CACGGTTATT ACATCTGATT
ATTCCGGCTG AAGCGATTTT AGATCACGCA GAAAGCGTGC AAAAACTGCG GCTGGCGGGA
TGTCGGATAG TGCTCAGCCA GGTGGGCCGC GATCTGCAAA TCTTCAACTC GCTGAAAGCG
AATATGGCAG ATTACCTGCT ACTTGATGGT GAGTTATGCG CCAACGTGCA GGGTAATTTG
ATGGATGAGA TGCTGATTAC GATTATTCAG GGGCACGCTC AGCGACTCGG GATGAAAACC
ATCGCCGGGC CAGTCGTTTT ACCCTTAGTG ATGGATACGC TTTCTGGCAT CGGCGTCGAT
CTGATTTATG GTGAGGTGAT TGCCGATGCC CAACCGCTGG ATTTGCTGGT GAATAGCAGT
TATTTCGCGA TTAACTGA
 
Protein sequence
MSKQSQHVLI ALPHPLLHLV SLGLVSFIFT LFSLELSQFG TQLAPLWFPT SIMMVAFYRH 
AGRMWPGIAL SCSLGNIAAS ILLFSTSSLN MTWTTINIVE AVVGAVLLRK LLPWYNPLQN
LADWLRLALG SAIVPPLLGG VLVVLLTPGD DPLRAFLIWV LSESIGALAL VPLGLLFKPH
YLLRHRNPRL LFESLLTLAI TLTLSWLSML YLPWPFTFII VLLMWSAVRL PRMEAFLIFL
TTVMMVSLMM AADPSLLATP RTYLMTHMPW LPFLLILLPA NIMTMVMYAF RAERKHISES
ETRFRNAMEY SAIGMALVGT EGQWLQSNKA LCQFLGYSQE ELRGLTFQQL TWPEDLNKDL
QQVEKLISGE INTYSMEKRY YNRNGDVVWA LLAVSLVRHT DGTPLYFIAQ IEDINELKRT
EQVNQQLMER ITLANEAGGI GIWEWELKPN IFSWDKRMFE LYEIPPHIKP NWQVWYECVL
PEDRQHAEKV IRDSLQSRSP FKLEFRIAVK DGIRHIRALA NRVLNKEGEV ERLLGINMDM
TEVKQLNEAL FQEKERLHIT LDSIGEAVVC IDMAMKITFM NPVAEKMSGW TQEEALGVPL
LTVLHITFGD NGPLMENIYS ADTSRSAIEQ DVVLHCRSGG SYDVHYSITP LSTLDGSNIG
SVLVIQDVTE SRKMLRQLSY SASHDALTHL ANRASFEKQL RILLQTVNST HQRHALVFID
LDRFKAVNDS AGHAAGDALL RELASLMLSM LRSSDVLARL GGDEFGLLLP DCNVESARFI
ATRIISAVND YHFIWEGRVH QVGASAGITL IDDNNHQAAE VMSQADIACY ASKNGGRGRV
TVYEPQQAAA HSERAAMSLD EQWRMIKENQ LMMLAHGVAS PRIPEARNLW LISLKLWSCE
GEIIDEQTFR RSFSDPALSH ALDRRVFHEF FQQAAKAVAS KGISISLPLS VAGLSSATLV
NDLLEQLENS PLPPRLLHLI IPAEAILDHA ESVQKLRLAG CRIVLSQVGR DLQIFNSLKA
NMADYLLLDG ELCANVQGNL MDEMLITIIQ GHAQRLGMKT IAGPVVLPLV MDTLSGIGVD
LIYGEVIADA QPLDLLVNSS YFAIN