Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_2499 |
Symbol | |
ID | 4245268 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | - |
Start bp | 3855819 |
End bp | 3857666 |
Gene Length | 1848 bp |
Protein Length | 615 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 638107580 |
Product | diguanylate cyclase/phosphodiesterase |
Protein accession | YP_722179 |
Protein GI | 113476118 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG5001] Predicted signal transduction protein containing a membrane domain, an EAL and a GGDEF domain |
TIGRFAM ID | [TIGR00254] diguanylate cyclase (GGDEF) domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0730388 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTGTTA CACAAATACC AGGGGTTTGT TCATTGAGAC TGATATTAAA TTTAATAGCT ACTATTTCGA GAATTCATAC TTTTCATCTA TTGAAAGGGT TAACGTCCTT GCTATTAACC TGTGATTATT TATCAATTTG GAAGAGCTTT AAGCAGTTTA CTAATAAAAT TTATCAGCTT ATTCATTATA TTTATAAATT ACTTCAGCTC AACTCTGCCT TTTGGAAAAA GCTCAAGTCT AAGGTGGATG ATGTTTTATC TAGTGGTCGA AAAAAGTTGA TTAAACTAAA TGAACAAGCT ATAGATTCTT CTATGCAAAT TTATTATAAT GATAATAATG ATAATAATGA GGTAGCAAAG CAGGTAATTG TTAAAAAGCT CAAGAAACAC AGACAAAAAC AAGCTACACT TTTATATAAG GCATTTCACG ATGGGCTGAC GGGTTTACCA AACCGTAATT TGTTTTTTAA TGAACTAAAA CTAGTTTTAC ATCAGTCTAG AAAAGACCTA AGTTATAAGT ATGGGGTTCT GTTTTTAGAT CTAGATCGCT TTAAAGTAAT TAATGATAGT TTGGGGCATG TTATTGGAGA TCAAGTATTA GTCATCATCA GCAGGCGACT ACAGAATTAT ATCAAAGCTT CAGATACTGT GGCGAGGTTA GGAGGGGATG AGTTCACTAT CTTACTAAGG TCTCTACCCG ATGCTGACTA TGCTACTAAA GTAGCTATGA AAATTAATCA AGAACTTGCT CAACCAATTT ATGTTGAAGG ACATGAAATA TTTACTACAG TAAGTATTGG CATTGTTACT AATTGTAAAC AGTTTAGTGA TCAGCTCGGT GAAAATAGTT TTCCTATTTG TCCTATTTAC AACCACCCAG AAGATGTACT GCGAGCTGCA GATATTGCGA TGTATCGGGC TAAAGATTTA GGTAAGGCAA GGTATGAGGT GTTTGATTTG ACAATGCACA GAGAAGTTGT ATCTTTATTA GAATTGGAAA ATGATCTACG ACGAGCTGTA GAAATGATTA AACAAAATCC AGTAAATTCT CAGTTTTTTT TAAGTTACCA ACCAATTATT TGTTTAACTA CAAATAAAAT TACTGGTTTT GAATGTTTGG TACGCTGGGC TCATCCTACT AAAGGCTTAA TTCAACCAGG AAAGTTTATC CCATTGGCAG AAGAAACGGG TTTAATTATT CCTCTAGGAA TGTGGATATT GCGAACGGCT TGTCATCAGT TGGCTGTTTG GCAGAAAAGG TTGACAGGGA GACATGTTTG CTCTGGTTTG GCATATTCTG ACTGTAATTT TATTGCTCAT AATTTTACTA TGAGTGTCAA TATTTCTAGC AAGAATATCT CACAACCAAA TTTTTTGGAA CAAGTTAATG AAATTTTGGC GGCAACAAAT TGTCAACCTC ATTGTTTGAA TCTAGAAATT ACTGAAAGTT TAATTATGAC AAATGTTGAT TTAGCTACTG CAGTTTTTGA GAAACTGAAA AATCAAAATA TTAGATTATC AATAGATGAT TTTGGTACTG GTTATTCGTC GTTAAGTTAT TTACATCAGT TTCCTATTAA TACTATTAAA ATTGACCGTT CTTTTGTGAG TCAATTAGAT TCGGATACAA GCGGTCAAAC TTTAAAAATT GTGAGTGCTG TTATTGCATT GGCTAATAAT TTAGATTTAG AAATAATTGC AGAAGGAATA GAAACTAAGG CTCAAATGAA TCAGCTCAAG CAACTTAAGT GTCGCAAAGG ACAGGGGTAT TTTTTCTCAA AGCCATTGAC AACTTCCAGT GTAAATGAAC TACTTCAGTT TACCTCACTA ACTAGTCCTT TTCTTTAA
|
Protein sequence | MIVTQIPGVC SLRLILNLIA TISRIHTFHL LKGLTSLLLT CDYLSIWKSF KQFTNKIYQL IHYIYKLLQL NSAFWKKLKS KVDDVLSSGR KKLIKLNEQA IDSSMQIYYN DNNDNNEVAK QVIVKKLKKH RQKQATLLYK AFHDGLTGLP NRNLFFNELK LVLHQSRKDL SYKYGVLFLD LDRFKVINDS LGHVIGDQVL VIISRRLQNY IKASDTVARL GGDEFTILLR SLPDADYATK VAMKINQELA QPIYVEGHEI FTTVSIGIVT NCKQFSDQLG ENSFPICPIY NHPEDVLRAA DIAMYRAKDL GKARYEVFDL TMHREVVSLL ELENDLRRAV EMIKQNPVNS QFFLSYQPII CLTTNKITGF ECLVRWAHPT KGLIQPGKFI PLAEETGLII PLGMWILRTA CHQLAVWQKR LTGRHVCSGL AYSDCNFIAH NFTMSVNISS KNISQPNFLE QVNEILAATN CQPHCLNLEI TESLIMTNVD LATAVFEKLK NQNIRLSIDD FGTGYSSLSY LHQFPINTIK IDRSFVSQLD SDTSGQTLKI VSAVIALANN LDLEIIAEGI ETKAQMNQLK QLKCRKGQGY FFSKPLTTSS VNELLQFTSL TSPFL
|
| |