Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_0047 |
Symbol | |
ID | 4242584 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 71367 |
End bp | 72836 |
Gene Length | 1470 bp |
Protein Length | 489 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 638105417 |
Product | hypothetical protein |
Protein accession | YP_720036 |
Protein GI | 113473975 |
COG category | [S] Function unknown |
COG ID | [COG4995] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAAATTC TACACCTGAA TCTTAAGCTA AAAGGAGATG ATTCTGTTGA ATTCCGTTTC TATTGGGATA ACCTCCTCGA GTTTCAACCT GTTTCGCGCA GTCTAGGAGA AATCTCAGAC TTAATTAAAA AGTCAGAAGC TGAGTATTAT ACTTACCTTC CAGTAGACTA TACCCAAACA GGTCAAAATC TCTACAAATG GCTAAATGGA GATAAACGGG ATCTAGACAG AGTAATTCAT GATGGTTCAT CAGGCGAAGG AATTGTTCTA GCAATTGCAA CATTCGAAGG GCTATCACAT CTACCTTGGG AACTGCTGCA TAATGATGAA GATTTCTTAG TTAATTCTAC ACCTGCAATT GTTCCCGTTC GTTGGGTGCC TAATTCAGGA AGCATGAAAG TACTTACTAT GGATGATAAA CCCAAGAATG AAGCATTGAG TGTGGCATTT ATGCCAGCTT CTCCTCACAA TAAACCAAAA TTAGACCTTG TAGAAGAGGA AATAAGTATT TTAAAAGCAA GCCAAGAGTC CCCTTTCTCT CTAGAAATAG AATATAGTGG TTGCTTGAAT AAGCTGAAGG GTTTGATAAA TAAACATGAT AAAGGCTTTT TTGATGTGCT TCATCTAGCA GGTCATGCAG AAAACTCTTA TTTGATTACA GAAACAGAAT CTGGAGAGGC TAAATATAGT AGTGCTGAAG ATATTGCTGA TGCACTACTA TTTAAAAATC CTCAACTGAT TTTCTTGTCG GCTTGTCGGA CTGCTTATTC ACGGGAGTCA TCTATTCCTT CGATGGCTGA AAGTTTAGTA AAAAATGGTT TTAAAGCAGT TTTGGGCTGG GGCGATCGGG TATTAGATAC TAGCGCAATA GCGGCTGCAG AAATATTTTA TCAGAATTTG TCATCAGGTT CTACAGTAAC CCAAGCTTTG GCTCAAACTT ATCGGAAACT AATTGAAACA GAGAAAGGTT GGCACTTGTT GCGATTATAT GTCAAAGAAA GTTTACCGGG AGCATTAGTA ACAGCATCGG CAACACTTGG ACGAATACCA CCGCATCGGA AAATAACGAC TCTACCTGTA AAAAATTGTT ACTATCAGCT GAAACAATGT GTATCTATCC TAAAACAAGA TAGTAAAAAA GGTGTCTTGA TTTTAGGAAA AAGTAATATT ACTTCCTTAT TTACTGATGG ACTATGTGAA CGTTTACCCA AGTCTAAGAT TGTTAAATTG GATCAACAAA TTGATGAATA CGAATTCGTT AATAAACTAG CGAGTGAATT GGAAAATTCA GAGCAGCGTA ATGCACTTCG TAATGCATTT CAACGCGGTG ATAAACTTAA ATATCCACTA AGAGATTTAT TTAAAAAACG GAATGAAACT GGACTAGAAC CTTTAGACAT CTCCAAAAAT ACTCAAACAT TTATCCTAAC TATCTTTAGT GGTGCGATCG CTACTTATGA AGATAGTTAA
|
Protein sequence | MQILHLNLKL KGDDSVEFRF YWDNLLEFQP VSRSLGEISD LIKKSEAEYY TYLPVDYTQT GQNLYKWLNG DKRDLDRVIH DGSSGEGIVL AIATFEGLSH LPWELLHNDE DFLVNSTPAI VPVRWVPNSG SMKVLTMDDK PKNEALSVAF MPASPHNKPK LDLVEEEISI LKASQESPFS LEIEYSGCLN KLKGLINKHD KGFFDVLHLA GHAENSYLIT ETESGEAKYS SAEDIADALL FKNPQLIFLS ACRTAYSRES SIPSMAESLV KNGFKAVLGW GDRVLDTSAI AAAEIFYQNL SSGSTVTQAL AQTYRKLIET EKGWHLLRLY VKESLPGALV TASATLGRIP PHRKITTLPV KNCYYQLKQC VSILKQDSKK GVLILGKSNI TSLFTDGLCE RLPKSKIVKL DQQIDEYEFV NKLASELENS EQRNALRNAF QRGDKLKYPL RDLFKKRNET GLEPLDISKN TQTFILTIFS GAIATYEDS
|
| |