Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_3472 |
Symbol | |
ID | 4244472 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 5346447 |
End bp | 5347757 |
Gene Length | 1311 bp |
Protein Length | 436 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 638108446 |
Product | recombinase |
Protein accession | YP_723035 |
Protein GI | 113476974 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1961] Site-specific recombinases, DNA invertase Pin homologs |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAAAATTA TTGCATATTC ATACACCGAT CCTCTGCAGG GACCAGCACC AGACCAAACT ATTTGGGGAT GGGAAATTGA CCAAATATAT GAAGACCTAG GCGATCGCCA ACAACTTGAA AAACTGCTAC AGGACTCCGA AGCCGAACCA CCTAACTATC TCCTAGTTCA GGAACTAGAG GAACTAGGAA ACTCAGTAGA AGAAGTGTGT AACCGCCTGG CTCAACTAGA AAGTCTGAAA ATAAAAACGA TCCCCCTAAA ATCAGAAATC AAAATAAACC AACCCTGGCC CAGCACATCG AACATAACGA AAGCCGAGCT ACTGAAACTC CTGAACGAAA TTCGCCAAAA GCAACACAGC CGAAAAATTC GTGAAGCCCA TGCCCGTAAC CGAGTCAAAG GAACACCTCC CCCAGGTAAA GCACCCTATG GTTACCGCCG AGGGAAAGAC CGCTATACCC TTGATAAAAA CACCGCCCCC ATCATTAAAG AATTTTTTGA AAGCTTTCTG TTATTCGGGT CTCTCCGTGG AGCCGTCCGT CATATTGAGC AAAAATACGG AAAAAAAATC TCAGTCACCA CAGGGCGACG TTGGTTAACT AACCCTGTTT ATCGGGGAGA CCTACAATAT CAAAACGGTG AAATTATCTC CAATGCCCAT GTTCCCATTA TTTCCAGAGA AGAAGCCTCT CAAGTAGAAC GCTTGCTCCA AAGGAACAGC AAAATGCCAC CACGAACCGC AAGCACTCCC CATTCTTTAG GAGGCTTAGT TGTGTGCAAA GAATGTCAGT CTCAAATGAT AACTGCCAAA GTTACCACCT TTCGCAGAAA CAAAGAGTAC CTTTACCTAC GGCCAAAAAG TTGCCCCCGT ACGCAAAAAT GTAAAGCCTT AGCCTATGAA GAAATATTAG AGCAAACCAT AAAAACAATT TGTCAAGAGT TACCCCTTGC TGTTGCCTCT TTTGATGCGC CACAGATAGA AGAAGCTAAA GCGAACATCA AAAATAACAT TTGTGAAAAA ACAGAAATGC TTTCAAGACT ACCAAATTTA ATAACTGAGG GAGTCTTTGA CGAAGAAACA GCCAAACTCC GCGCTTACAA ACTCAAAACA GAAATTTCTC AATTAGAAAA CAAGCTTTAT AAATTACCTC CAGTCAAGTT ATTAGAAACA GCAAAAACAG TATCCATACC TCAGTTTTGG TGGGACTTAT CTGAGTCAGA ACGCAGATTT TACCTGCGGG AATTTATCAG TAGAATAGAA ATTATTCGTC AAGGTGTAAA TTGGAATTTA CAAGTAATTT TTGTATTTTA G
|
Protein sequence | MKIIAYSYTD PLQGPAPDQT IWGWEIDQIY EDLGDRQQLE KLLQDSEAEP PNYLLVQELE ELGNSVEEVC NRLAQLESLK IKTIPLKSEI KINQPWPSTS NITKAELLKL LNEIRQKQHS RKIREAHARN RVKGTPPPGK APYGYRRGKD RYTLDKNTAP IIKEFFESFL LFGSLRGAVR HIEQKYGKKI SVTTGRRWLT NPVYRGDLQY QNGEIISNAH VPIISREEAS QVERLLQRNS KMPPRTASTP HSLGGLVVCK ECQSQMITAK VTTFRRNKEY LYLRPKSCPR TQKCKALAYE EILEQTIKTI CQELPLAVAS FDAPQIEEAK ANIKNNICEK TEMLSRLPNL ITEGVFDEET AKLRAYKLKT EISQLENKLY KLPPVKLLET AKTVSIPQFW WDLSESERRF YLREFISRIE IIRQGVNWNL QVIFVF
|
| |