Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_1762 |
Symbol | |
ID | 4242605 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 2682530 |
End bp | 2684572 |
Gene Length | 2043 bp |
Protein Length | 680 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 638106886 |
Product | hypothetical protein |
Protein accession | YP_721495 |
Protein GI | 113475434 |
COG category | [C] Energy production and conversion |
COG ID | [COG0644] Dehydrogenases (flavoproteins) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACAAA TCCTCTACCT AGAAGTACCA ACACCAGATA CAAAAACCGT TTGTCAATGG CTACAATCTA AATTTGAAGC TGGTTTCGGC GAAAAAAATA TAACTCCAGA TGGCTTTATT ATCAAATTTC CAGAAGCTCC AACTCAGGAA ATTTCTGTAT TTGTCTGGTC ATTACAAAGA ACTACCTATT TAAAAGTATT TGCCCAAGGA AATATCTCTC CAATACAAAA AAAATTTACC TCAAACTTAA CTACCAATTT GCGAAAAGAA TTTCCTTTAA ATTATCCAGA ACCTCCCACA GTCGATCTAT CAAATCAGTC AATTTTTACA GCTTTAGCAT CAGATTATCC CCTCACAGTC AAATACTTTC AAAAAATGCC TAGAGGAGAA TATGACCTCA ACCGTTTATA TTGGTGGGAA AAACGTTGGC GTGAAGGAGT TAAAAATCCT CAAAAACCAA AACAAGTACT TTGGAAAAAA GAGTCTGGAA CAAATAATAA TTCAATTGAA TATGACTTAA TTTATGTTGG TGGTGCTCTG GGAACTATTC ATGCAGCAGT CATGGCAAGA CTAGGATATA AAGTATTATT ATTAGAACGG TTACCTTTCG GTAAAATGAA TCGAGAATGG AATATTTCTC GCTCAGAACT TCAGAGTTTA ATTGACTTAG GTTTATTTAC TAACGCAGAA ATAGAAAATT TAATTGCCAG AGAATATAAA GATGGATTTA ATAAATTTTT TGATGCTAAT AACCCTGATT TTTGTAAGGC AGCAATTCTC CATACTCCCA CTGTTTTAAA TGTGGCATTA GATTCAGAAA AATTACTGAA TATATGCGGA ATAAAACTAC AAGAAGCTGG AGGAAAAATC GTCGATGAAA CAGAATTTAT TAAAGCTGAA ATTGAACCAG AAAAAGTTAT AATTCACACT CAAAATTTAT CTAGCAAAAC TGAAGAAATT TTCTGTGGGC GTTTATTAGT AGATGCCATG GGTACTGCTT CTCCTATCGC CTGGCAATTA AATGGTAAAC GTGCTTTTGA TAGTGTTTGT CCGACAGTCG GAGCAATAAT TGATGGTGGA TTTGAAGCAG GTGTTTGGGA TTCAAATTAT GGAGATGTTC TAAATAGTCA TGGGGATATT TCTCGTGGTC GACAATTGAT TTGGGAGTTG TTTCCCGGTG CTGGAAATGA ATTAACAATT TATCTATTTC ATTATCATCA AGTTCATCCT GAAAATCCTG GTTCTCTGTT AGAAATGTAC GAAGATTTTT TCACTATTTT ACCAGAATAT CGACGCTGTG ACATGGAGAA ATTAGTATGG AAAAAGCCAA CTTTTGGTTA TATTCCTGGG CATTTTAGTA TTAAAGCCAG CGATCGTCAA ATTGCTTTTG ACCGTTTAGT TTGTATTGGT GATGCTGCTT CTCTTCAGTC TCCTTTAGTA TTTACTGGTT TTGGGTCTTT GGTCAGAAAT TTACCTCGTT TAACCGAGCT TTTAGATACT GCTTTAAAAC ACGATTTACT TAGTGCTAAT CACTTAAATA AAATTCGTGC TTATCAAAGC AATATTGCTA TTACTTGGTT ATTTTCTAAA GGTATGATGG TGCCCACAGG TAAATATTTA CCCCCCCAAA GAATTAATTC TATTTTGAAT ACTTTTTTTG GGTTATTAGC AGACTCGCCC CCGGAAGTAG CAGATACTTT TATTAAAGAC CGAGCTAATT GGTGGATGTT TACAAAATTA GCCCTTAAAG CAGCCAGAAA AAATTCTTAT CTTTTATTAT GGATTCTAGA TTTTATTAAT TTTCAGGAAA TATTACTTTG GACAATAAGT TATATAAATT TTACTTTGTT TAGTTTGGTA AGTTGGTTAT TAGGATGGTT GCCTAACTTT GCCCGTTGGA TTCAACCTTG GTTAGAACCT CGTTATCCTG GTTTATGGTT TTGGATATTA GCAAATAGTT ATGCCTTAAC CTATAATGTC GGTCAACCAC AGATAAACTT TAAATTACAA AAGCCATTTC AGTCCGAAAG CTTGTCTGGG TAA
|
Protein sequence | MKQILYLEVP TPDTKTVCQW LQSKFEAGFG EKNITPDGFI IKFPEAPTQE ISVFVWSLQR TTYLKVFAQG NISPIQKKFT SNLTTNLRKE FPLNYPEPPT VDLSNQSIFT ALASDYPLTV KYFQKMPRGE YDLNRLYWWE KRWREGVKNP QKPKQVLWKK ESGTNNNSIE YDLIYVGGAL GTIHAAVMAR LGYKVLLLER LPFGKMNREW NISRSELQSL IDLGLFTNAE IENLIAREYK DGFNKFFDAN NPDFCKAAIL HTPTVLNVAL DSEKLLNICG IKLQEAGGKI VDETEFIKAE IEPEKVIIHT QNLSSKTEEI FCGRLLVDAM GTASPIAWQL NGKRAFDSVC PTVGAIIDGG FEAGVWDSNY GDVLNSHGDI SRGRQLIWEL FPGAGNELTI YLFHYHQVHP ENPGSLLEMY EDFFTILPEY RRCDMEKLVW KKPTFGYIPG HFSIKASDRQ IAFDRLVCIG DAASLQSPLV FTGFGSLVRN LPRLTELLDT ALKHDLLSAN HLNKIRAYQS NIAITWLFSK GMMVPTGKYL PPQRINSILN TFFGLLADSP PEVADTFIKD RANWWMFTKL ALKAARKNSY LLLWILDFIN FQEILLWTIS YINFTLFSLV SWLLGWLPNF ARWIQPWLEP RYPGLWFWIL ANSYALTYNV GQPQINFKLQ KPFQSESLSG
|
| |