Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_2828 |
Symbol | |
ID | 4245131 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 4394349 |
End bp | 4396478 |
Gene Length | 2130 bp |
Protein Length | 709 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 638107878 |
Product | protein kinase/helix-hairpin-helix DNA-binding domain-containing proteins |
Protein accession | YP_722475 |
Protein GI | 113476414 |
COG category | [R] General function prediction only |
COG ID | [COG4248] Uncharacterized protein with protein kinase and helix-hairpin-helix DNA-binding domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.647129 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.367028 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGTTGC AACGTCAGTT CAGTGAACAA ATTATAACCT TGGATACCCA AAACCCCATT GGTGGGGGTG GTGAAGGTCG TATCTATCCT GTGCTCAAAA ACCCCTCCTT AGTGGCCAAA ATTTACCATA AACCGACAGA CGAGGACGCC GATAAACTGA CCGTAATGTT CTCTCATCCA CCAGATGTAC CCATAGGAAG CGGACATGCA GCGATCGCAT GGCCTATAGA TCTATTGCGC ACTACTAATA ACAAACATCA AATAGTTGGT TTTTTAATGC CTCGGGTTTC CAGAGCACAA CCCCTACATA TATTCTATAA CCCTCGCAAC CGTCGAGAAC AAAAACCTCT TTTCAACTAT CGTTACCTCC ATCGCACAGC TCGTAACTTC GTCGCAGCCG TGAACGCTTT ACATAACAGT GGATATACTA TTGGTGACAT CAATGAATCA AATATTCTCG TCACCGATAC TGCTCTTGTT ACCCTAGTTG ACACAGACTC CTTTCAAGTG CGCGACCCCT ACACCAAAGA AATTTACCGT TGCCCCGTCG GCAAACTAGA ATTTACTCCC CCAGAACTAC AAGGTCAGAG TTTCCGGAAA ATTGACCGCA AACCTGAACA TGACCTATTT GGCATGGCAG TATTAATATT TCAACTTTTG ATGGAGGGAA CTCACCCTTT TGCAGGAATT TACAAAGGTA GAGGAGAACC CCCCAACCTT GAAGCTCGGA TCCAGGCTGG TCATTTTCCC TATAGTCAAA AGCGAGTTCC CTATCGCACG ACTCCAACGG CACTACATTG GGAAATTTTA CACCCCACTA TACAAGAGCT ATTCATTAAC TGTTTTGAGT TAGGGCATGA ACGCCCAGAG TTGCGCCCAG ATGCTAAGGC TTGGTTACAG GGGTTGCGAG AGTCAGAAGA TAGTCTAGTG CACTGTCAAA AAAATCATCA GCATTGGTAT GGAAACTACC TAAAGTCTTG TCCTTGGTGC GCGCGGACTA AACTTTTAAA AGGACGAGAC CCTTTTCCAT CAAGGGAAGC AGTTAGAAAT AAGCAACATT TGGAACCTGC TCTCAAAAAA GGGAAACTTT TACCTTTAAA GTCACCAAAA GCTACTCCTT CTCAAAAACC AGCTCCACCA AAACCAATTG TGATCACTCG ACCCATGGGG CAATATCAAC CTTTACTGAC TCCTGTTGTT ACTAAATTTA AGCCCCCCTC TTTACCAATA ATTCCTTTGC CAGGGGGTGG TAAAGGTCTT CTTGGTATTG CTCAAGATGC TATTTTTGGT GGTTTTTGGG GAGTTTTGCT GGCGGCGGCA GTAGCAGGTC TGGTCTTAGG CATAGTTAAC AGTAGCATTG CTATTTTAGG TGGTGTTGTT ATGGGAATAT TTTGGGGTGC TTTTTTCGCG ATCGCTTGCC AATATGTTAT CCCTATGAAC TCATCTAGTG CTAGTTTAGG TTTAACAGGG GGGCTCTGGG GAGGGTTTCT GGCTGCTGCT ATTACTGGTG CTGTTTATGG AATAGGTAAC CGTCTTGGTG CATTTGGCCA AGCAGTCGTA TTAGGAGGAT TTTTAGGTAC TATTTGGGGT AGTGTTTGGA GTTATTTTAA ACCACCTTTA GCGTTTCCTG TTCCTGGGCG AGTTTTGGGT AGACGAGGTT TATTTTTAGG TGTAATTTGG GGTGCTTTTT TAGGTACAGT TATTGGAGCT ATTTTAGCTG GAATTTTTGT CTTTCAGGAA GCTATGAATG ATAATAGTAA TTCTTGGGAT GAATTGGTAT CTATATTAAT AAGTAGTATG ATTTCGGCAA TGGGTGTCGG TGCTATGGGT GGAGTTATAG GGGGTGTAAT TTTAGGACCA ATTTTAGGTG CTCCTACTTT GCCTTTGTCT AATAATTTAT CCGGGGCTAG AGGTGCTTTA TTAGGTGGAA TTTGGGGGTG TTTTTTAGGT ACAATTATTG GCATAATTTG CGGTGGTTTT TTACCACAAA TAGTGTCAAT AGACTTAGTA GAAATTATGC GAGTTAATGG AGACTGGACT TCAATTTTTT CAACAATTTT TGGAGCTGGT TTAGGTGCTC TTTGTGGCTG TTTTTCTGGG GCAACTTGGG GTTCTATGGG GAAATGGTAA
|
Protein sequence | MRLQRQFSEQ IITLDTQNPI GGGGEGRIYP VLKNPSLVAK IYHKPTDEDA DKLTVMFSHP PDVPIGSGHA AIAWPIDLLR TTNNKHQIVG FLMPRVSRAQ PLHIFYNPRN RREQKPLFNY RYLHRTARNF VAAVNALHNS GYTIGDINES NILVTDTALV TLVDTDSFQV RDPYTKEIYR CPVGKLEFTP PELQGQSFRK IDRKPEHDLF GMAVLIFQLL MEGTHPFAGI YKGRGEPPNL EARIQAGHFP YSQKRVPYRT TPTALHWEIL HPTIQELFIN CFELGHERPE LRPDAKAWLQ GLRESEDSLV HCQKNHQHWY GNYLKSCPWC ARTKLLKGRD PFPSREAVRN KQHLEPALKK GKLLPLKSPK ATPSQKPAPP KPIVITRPMG QYQPLLTPVV TKFKPPSLPI IPLPGGGKGL LGIAQDAIFG GFWGVLLAAA VAGLVLGIVN SSIAILGGVV MGIFWGAFFA IACQYVIPMN SSSASLGLTG GLWGGFLAAA ITGAVYGIGN RLGAFGQAVV LGGFLGTIWG SVWSYFKPPL AFPVPGRVLG RRGLFLGVIW GAFLGTVIGA ILAGIFVFQE AMNDNSNSWD ELVSILISSM ISAMGVGAMG GVIGGVILGP ILGAPTLPLS NNLSGARGAL LGGIWGCFLG TIIGIICGGF LPQIVSIDLV EIMRVNGDWT SIFSTIFGAG LGALCGCFSG ATWGSMGKW
|
| |