Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_4044 |
Symbol | |
ID | 4242072 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | - |
Start bp | 6249775 |
End bp | 6250944 |
Gene Length | 1170 bp |
Protein Length | 389 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 638108950 |
Product | aminotransferase, class V |
Protein accession | YP_723531 |
Protein GI | 113477470 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1104] Cysteine sulfinate desulfinase/cysteine desulfurase and related enzymes |
TIGRFAM ID | [TIGR03402] cysteine desulfurase NifS |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.159236 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTTAAAC GTCCTATATA TCTCGACTGT AATGCAACTA CTCCCCTTGA TGAACGAGTA TTAAAAACAA TGTTGCCCTA CTTCACAGAA CATTTTGGCA ACCCCGCTAG CATTACCCAT CAATATGGTT GGGAAGCAGA AGCAGCAGTG AAAAAAGCTA GAGAAATTTT GGCTACAGGT ATTAATGCTA GTCCTGAAGA AATTATCTTT ACCAGTGGTG CAACAGAATC AAATAATTTA GCCATCAAAG GAATAGCTGA AGCTTACTTT AATAAAGGCA AACATATTAT TACGATCACT ACTGAACATA ATGCAGTTCT CGACCCCTGT GCCTATTTAC AAAATTTGGG ATTTGAAGTA ACTTATTTAC CTGTAAATAG AGATGGAATT ATCGATATAA CTCGTCTTGA AACAGCTTTG CGTGATGATA CAATTCTCGT ATCTATTATG GCAGCAAATA ATGAAATTGG AGTCTTACAA CCCTTAGCAA AAATAGGAGA AATATGCAAA GAAAATTCGA TTATTTTCCA TACTGATGCT GCACAAGCTA TTGGTAAAAT TTCTCTTGAC GTACAGGCAA TGAATATTGA TTTAATGTCA TTAACTGCCC ATAAAATTTA CGGACCAAAA GGTATTGGTG CTATCTATGT GCGTCGTCGC CATCCGAGAG TCAAAATAGC GCCTCAAATA CATGGAGGTG GACACGAACG AGGAATACGT TCTGGTACTT TGTGTACGCC TCAAATAGTT GGTTTTAGTA AAGCGGTGGC ATTGGCGTTA GCAGAAATAA AGTCGGAGGC AAAACGGTTA ACTAGTTTAC GACAACAGTT ATGGGAGAAG TTACAAACAT TAGAAAATAT TTTTCTCAAC GGACATCCGA CTCAGCGTTT ACCAGGAAAT TTAAACATTA GTGTTGAGGG TGTAGATGGC CAAGCTTTAT TGTTGGGCTT ACAAAGTGTG ATGGCGGTTT CTTCTGGTTC TGCTTGCACT TCTGCCAAAA TCTCACCTTC CCATGTTTTG CAAGCTTTAG GGCGTTCAGA AAAGTTAGCT TATGCTTCTG TGCGCTTTGG TATTGGGCGG TTTAATACTG CCGAAGAAAT AGATCTAGTA GCAGAACAGG CGATCGCCAC AATTAAATCT TTACGTCAAG CAACTACGAG TATTAAATAA
|
Protein sequence | MFKRPIYLDC NATTPLDERV LKTMLPYFTE HFGNPASITH QYGWEAEAAV KKAREILATG INASPEEIIF TSGATESNNL AIKGIAEAYF NKGKHIITIT TEHNAVLDPC AYLQNLGFEV TYLPVNRDGI IDITRLETAL RDDTILVSIM AANNEIGVLQ PLAKIGEICK ENSIIFHTDA AQAIGKISLD VQAMNIDLMS LTAHKIYGPK GIGAIYVRRR HPRVKIAPQI HGGGHERGIR SGTLCTPQIV GFSKAVALAL AEIKSEAKRL TSLRQQLWEK LQTLENIFLN GHPTQRLPGN LNISVEGVDG QALLLGLQSV MAVSSGSACT SAKISPSHVL QALGRSEKLA YASVRFGIGR FNTAEEIDLV AEQAIATIKS LRQATTSIK
|
| |