Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_0590 |
Symbol | |
ID | 4242747 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | - |
Start bp | 947925 |
End bp | 949313 |
Gene Length | 1389 bp |
Protein Length | 462 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 638105894 |
Product | hypothetical protein |
Protein accession | YP_720507 |
Protein GI | 113474446 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.401741 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAATTCC AAGAATTATT AACTTGGGCA GAGCAGCAGG TTTTGGCCCA CACTGGAAAA CCTTTAGACG ACCTCCAAAA AGCTATTTTG TGTGGCACCT GGGAACGTGA AAAATACTCC CATATCGCTA AGTCCTATCA TTGTACTGAG GTTCATGTAA AAAAGGTTGC TTCTCTCTTG TGGAAAACAC TCTCACAAAG TTTAGGAGAT GGAAAACAAA TAAATAAATC AAATTTGCGT TCGACATTCC AAAGGTGGCG AGTTTCAAAT ATCTCACATT TCAGAGATAT CCCACATATC AGTCATGTCA GCGTCTGCGG AAACACCTCA TATCGACCAG CAGTTATCAA GCAACAACAG AGCGATCGCT TTCAACCCCC AAATAACAGC AAACCCCATA TAGATCGCTC CGAAGCACCA GACCTAGAAT ATGATTGCGA TCGCCCCGCA GAACTGGCTA CCCTAAAAAA CCTGCTCCTC AAAAAACGGA GTCGTCTAGT AGCCATTTCC GGAATTAGCG GCATAGGGAA AACAGCGATC GCCCTTCAAT TACTATCCCA AATAGAAGAT GAATTCGACC ATATAATATG GCGGAGTCTT CGCGGTGCCC CGACTCCAGA AACTACCCTC AAATCTTTAA TTCAATTTTT TGATCAAACT TTCCAAAGTC GCAAAAATCA AAAAGAAGAA GAGCAACTAT CTTTAATAAT AGAATACTTA CGAAATAACC GCTGTTTAAT CATATTAGAT GACCTCCATC AAGTTTTAGA AAAAGATAAA TTAGTTGGTC ATTATCAACC GGGATACGAA AGTTATAAAA ACCTGTTTGA AACCATTGCT GAACTATCCC ATCAAAGCTG TATAATTTTC AATACCTGGG AATTACCCCT AGAAATTCTC AATTTAAAAA ATAAAAATGC TCCTGTTTCC TACTTACAAC TAGAAGGTTT GGGAAAAGCC GCCAATAAAA TTTTACAACA AGCAGATTTA TTAGATCAAG AAAAATGGTC CGAATTAATC AATATTTATG GAGGCAACCC CCACTGGTTA AAAATTACAG CAACAGGTAT CAAAGATTTA TTTGGTGGCA GGGTAGGAGA ATATTTGCAG TACAAACCTC TATTTTTAGC AGAAGAATTG ACAGTTATCT TAAAACAACA TTTAAGTCGG CTATCGGAAT TAGAGAGTCA AATTTTACTA CAAATTAGCC ATCAAAAAGA ACCAGTTTCC ATATCTTGGT TGCAGGAGGG AAGCGATCGC TCCTATTCCA ATATTTTGAA TGCCATTATG TCTTTAGGAA AGCGATCGCT ACTAGAAAAA ATTGAAGTGC AAAAATCGAC TCTATTCGTC GTCAAACCGA TTTTTCAAGA ATATTTGCTA CAGCAATAA
|
Protein sequence | MEFQELLTWA EQQVLAHTGK PLDDLQKAIL CGTWEREKYS HIAKSYHCTE VHVKKVASLL WKTLSQSLGD GKQINKSNLR STFQRWRVSN ISHFRDIPHI SHVSVCGNTS YRPAVIKQQQ SDRFQPPNNS KPHIDRSEAP DLEYDCDRPA ELATLKNLLL KKRSRLVAIS GISGIGKTAI ALQLLSQIED EFDHIIWRSL RGAPTPETTL KSLIQFFDQT FQSRKNQKEE EQLSLIIEYL RNNRCLIILD DLHQVLEKDK LVGHYQPGYE SYKNLFETIA ELSHQSCIIF NTWELPLEIL NLKNKNAPVS YLQLEGLGKA ANKILQQADL LDQEKWSELI NIYGGNPHWL KITATGIKDL FGGRVGEYLQ YKPLFLAEEL TVILKQHLSR LSELESQILL QISHQKEPVS ISWLQEGSDR SYSNILNAIM SLGKRSLLEK IEVQKSTLFV VKPIFQEYLL QQ
|
| |