Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_4844 |
Symbol | |
ID | 4246498 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 7437613 |
End bp | 7439352 |
Gene Length | 1740 bp |
Protein Length | 579 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 638109680 |
Product | hypothetical protein |
Protein accession | YP_724256 |
Protein GI | 113478195 |
COG category | [R] General function prediction only |
COG ID | [COG0433] Predicted ATPase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.415691 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATTTTG AAAGACCATT AGGCTCAGTA ATTCAAGGTT CCCTCAGTCA AGGATTAGAA GTACGACTCC ACCCGGACGT ATTGGTAGAA GATATGCGGG TTGGTAAATT TTTAGTTGTA CAAGGAGTAC GCGCCCATTT TTTCTGTATG CTAACCGATG TTTTATTAGG AACATCTAGC GAACGAATAA TGATTAATCC ACCCCTACCC ACAGATGATT TTTTGCAATC TGTTTTAGCT GGAGGTAGTA CCTACGGAAC TATCGAACTC GCTCCAATGT TAATGTTAGC TATTGCTCCA GAACAATTAC CAGACTCTTT CAATTTCAAC AACACAAATG ATAATCAAAA AAAATTAAAG TCAACCCAAA ATTTAGCATC CTTTGAAGCT CAAAGCAGTT CTCAAATTAA ATTAATGCCA GTCAAAACTA TTCCTAGTCA TTTTTCCCAA GTATTTGAAG CAAGTGTCAG AGATTTTAGT TTAGTTTTTG GCAGGGAAGA CGACCCGACT CGCCGGAATT TTGCTGTGGG TAAACCTATT GATATGGATG TACCTGTTTG TTTAGATTTA GATAGATTTG TAGAACGAAG TAATGGGATT TTTGGTAAGT CGGGAACTGG TAAATCTTTT CTGACTCGTT TACTTTTATC TGGCATTATT CGGAAAGGTG CTGCCGTAAA TTTAATTTTT GATATGCACT CCGAATATGG TTGGGAAGCA ATTGCAGAAG GAAAACAAGT TAATACTGTA AAAGGTTTAC GACAATTATT TCCCGATCGA GTCGAACTTT GGACGCTTGA CCCAGAATCT ACTAGACGTA GAGGGGTGCA TGATGCACGA GACCTGTATT TAAGTTATAA CCAAATTGAG GTTGAAGATA TTGGGTTAGT GCAACGGGAG TTAAATTTAT CTGAGGCAAG TATTGATAGT GCAAATATTC TCCGCAGTGA ATTTGGTAAA TCTTGGATTA CTAAATTATT GGCAATGACT AATGAAGATA TTCAAATATT TTGTGATGAA AAAAGAGGTC ATAAAGGTTC GATTATGTCG TTGCAAAGAA AGTTGTTACG ACTGGATAAT CTGAAATATA TGCAGACAAA AAATACCAAC AATTATATAG AAGAAATCTT AGAATCTTTA GATGCAGGTA AGCACGTTAT TATTGAATTT GGTTCCCAGT CAAATATGCT TTCCTATATG TTGGCAGCTA ATATGATTAC TCGCCGAATT CATAATAGTT ATGTACGGAA AGCAGATAAA TTTTTGAGTT CTAAAAATCC GAGCGATCGC CCTCAACCAT TAGTTATAAC TATTGAGGAA GCTCATCGTT TTCTTGATCC TGCGATAGTA CGTTCAACTA TTTTTGGTAC GATAGCTAGG GAGATGCGGA AATATTTTGT GACTCTATTG GTTGTAGACC AGCGACCTTC GGGAATAGAT GCGGAAGTTA TGTCTCAAAT TGGTACGAGA ATTACAGCTT TGTTGAATGA TGATAAGGAT ATTGATTCTA TTTTTACAGG AGTTTCTGGA GGTCATAGTT TGAGGTCTGT TTTGGCAAAG TTGGATTCTA AACAGCAAGC TTTGGTATTA GGTCATGCGG TACCAATGCC TGTGGTAATT CAAACTCGTG CTTATGATCA GACTTTTTAT CAGGAAATTG GAGATACGGA TTGGCGGTAT GTTTCTGATA ATGAGTTATT TGAAGCGGCT CAAGTGGCTT TGGATGATAT TGGGTTTTAG
|
Protein sequence | MDFERPLGSV IQGSLSQGLE VRLHPDVLVE DMRVGKFLVV QGVRAHFFCM LTDVLLGTSS ERIMINPPLP TDDFLQSVLA GGSTYGTIEL APMLMLAIAP EQLPDSFNFN NTNDNQKKLK STQNLASFEA QSSSQIKLMP VKTIPSHFSQ VFEASVRDFS LVFGREDDPT RRNFAVGKPI DMDVPVCLDL DRFVERSNGI FGKSGTGKSF LTRLLLSGII RKGAAVNLIF DMHSEYGWEA IAEGKQVNTV KGLRQLFPDR VELWTLDPES TRRRGVHDAR DLYLSYNQIE VEDIGLVQRE LNLSEASIDS ANILRSEFGK SWITKLLAMT NEDIQIFCDE KRGHKGSIMS LQRKLLRLDN LKYMQTKNTN NYIEEILESL DAGKHVIIEF GSQSNMLSYM LAANMITRRI HNSYVRKADK FLSSKNPSDR PQPLVITIEE AHRFLDPAIV RSTIFGTIAR EMRKYFVTLL VVDQRPSGID AEVMSQIGTR ITALLNDDKD IDSIFTGVSG GHSLRSVLAK LDSKQQALVL GHAVPMPVVI QTRAYDQTFY QEIGDTDWRY VSDNELFEAA QVALDDIGF
|
| |