Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_0152 |
Symbol | |
ID | 4241745 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 222270 |
End bp | 224264 |
Gene Length | 1995 bp |
Protein Length | 664 aa |
Translation table | 11 |
GC content | 31% |
IMG OID | 638105501 |
Product | hypothetical protein |
Protein accession | YP_720120 |
Protein GI | 113474059 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.038472 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCATCTA TAGATCAAAT TATTCAAAGA GAACTTAATC CTTTCGATCC TGTAACTCTT TATACTATTA ATTTTTGGCA AGAGCAACAA AATTCTAACC TGAATGTTGA TTCTATCCAT CAAAATATTA TCACTGATAT AGAAACTGTA CTAGAGCAAG TGGCTCAGGA ACATCACCCC CGTACTTTGA TACTTACAGG TGACTCTGGT TCTGGCAAAA GTTACCTTTT GGGGAGAATT AAAAAATTAT TCAATACTAA AGCTTTTTTT GTCTATGTCG ATCCTTGGCC TCATAGTGAT TATATTTGGA GACATATTTT ACGTCAAACT GTTGACTGCC TGATGAAAAC TCCAGATGGT AAAACAGATT CTCAGCTATT ACTATGCCTC AAAAGTCTAT TAGCTTTCCC AAATAGTAAT TTGGTCCAAA AAATTCTTGG TGAAAGAAGA GTATTCATTA GAAACCTTAA AGCAATTTAC CCTTCTGGGA TTTACAATCC TAATGATTTT TTCAGTGTCC TCTATCATCT ACTAGATCCC CAGTATAACG CTGTTGCCTG TGAATGGTTA CGAGGGGATG ATTTGGATGA AGACGATTTA AAATCTTTGG GCATTAAAAA TAGTATAGAT TCTGAATATT CTGCGCAAAA AATTATTACT AACTTGGGAA TTATTTCCGC TGCATCTCAA CCTATTGTTT TATGTTTTGA TAATCTTGAT AATATTCCTC GCTTGTTAAA TGGAACCTTA GATTTACAAG CATTATTTAA TGTTAATTCT AGTATTCATA CTCACTATCC ACAAAATTTT CTGATAATTA TTAGTATTAT TACGAGTACC TGGAAACAAA ATTCTAAATT CATTCAACCA GCAGATAAAG CCAGAGTTAA TGCAGGATAT TTCCATTTAA AACCAATTAC TTTAGAACAG GGTGAAGCTA TATTAGCAAC TCGTTTAAAT ACTTTACATT CCCAGGCAAC AACTAGGCCT AAATCAGCTA CATTTCCTCT GTCAAAAGAA ATATTAGAAC AGCAGTTTCC CAGAGGAATA ACTCTACCTC GGAATATACT AGAATTGGGA CGTAAAGAAT ATGAGCAGTA CAAATTAAAA TTATTGAATC AGACACAGAA TAGTCAAGAT TTAACACCAG AAACACAATT GGCGACTTTT AAGTTGATTT GGCAAGACCA CTACAATAAG ACTCAGAAAA AAATTAGCAA AATTACTGAT TTAGCAGCAC CTGAATTGAT TCGGATGCTA CAAGAAGTAT TAAATTCTTT ACAATGTCAG AAGGTAAAAA CTAAATTATT AAGTGGGAAA TATGCCAGCC ATTCTTTGAG TTATCAAAAA CAAAACCAAG AGCAAATTAT CGGTATAGTT TGGACAGAAG ATCCAAATAT GAATAGTTTT TACAATACAA TGAAGGCTTG TCAGAAAGTA GTGAATAAAA AATTGTGTCA AACTTTATAT TTACTGAGAA CTTCTGAGGT AGGAAATTCT AAAAATCTTA GCAATAAAAT ATATAGAAAA ATTTTCCAAG GTAAGTCAAA AAATATTCAT ATTAAACCTA ATTTAGAATC GGTGCATTTT TTAGCTACAC ATCATAGTTT GGTGAATGCT GTCCTCGCTA ATGAATTAGT TATTGAAGGT AAGATTATTA ATTTGAAAAA GTTTGAAGAA ATAATTGGTG AATCACAAAT TTTCAATAAC TGTAGCTTAT TAAAATATTT TTCTGTAGTT GTTTCTGTAG GTACTCAAGA ACAGGAAAAT GATCTAGATT TAGATGAGGT CAAGGATTTT GTATTAAATT TAATTAGAAA ACAAAGTTTG ATGGAACGCA ACAGCATAAT CGAAAATACA CTAAATCAAT TTGTCAAAAT AGAAGGTTCA AAAATTGATA TAATAATAAC AGAGCTTGAG GGAGAAAATA AAATAAAAAT TATTACTCCT ACATTAGGAT TAGATGAACA ATTAGTTTGT TTTGTTCCTG GTTAA
|
Protein sequence | MASIDQIIQR ELNPFDPVTL YTINFWQEQQ NSNLNVDSIH QNIITDIETV LEQVAQEHHP RTLILTGDSG SGKSYLLGRI KKLFNTKAFF VYVDPWPHSD YIWRHILRQT VDCLMKTPDG KTDSQLLLCL KSLLAFPNSN LVQKILGERR VFIRNLKAIY PSGIYNPNDF FSVLYHLLDP QYNAVACEWL RGDDLDEDDL KSLGIKNSID SEYSAQKIIT NLGIISAASQ PIVLCFDNLD NIPRLLNGTL DLQALFNVNS SIHTHYPQNF LIIISIITST WKQNSKFIQP ADKARVNAGY FHLKPITLEQ GEAILATRLN TLHSQATTRP KSATFPLSKE ILEQQFPRGI TLPRNILELG RKEYEQYKLK LLNQTQNSQD LTPETQLATF KLIWQDHYNK TQKKISKITD LAAPELIRML QEVLNSLQCQ KVKTKLLSGK YASHSLSYQK QNQEQIIGIV WTEDPNMNSF YNTMKACQKV VNKKLCQTLY LLRTSEVGNS KNLSNKIYRK IFQGKSKNIH IKPNLESVHF LATHHSLVNA VLANELVIEG KIINLKKFEE IIGESQIFNN CSLLKYFSVV VSVGTQEQEN DLDLDEVKDF VLNLIRKQSL MERNSIIENT LNQFVKIEGS KIDIIITELE GENKIKIITP TLGLDEQLVC FVPG
|
| |