Gene Tery_3472 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_3472 
Symbol 
ID4244472 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp5346447 
End bp5347757 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content42% 
IMG OID638108446 
Productrecombinase 
Protein accessionYP_723035 
Protein GI113476974 
COG category[L] Replication, recombination and repair 
COG ID[COG1961] Site-specific recombinases, DNA invertase Pin homologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAAATTA TTGCATATTC ATACACCGAT CCTCTGCAGG GACCAGCACC AGACCAAACT 
ATTTGGGGAT GGGAAATTGA CCAAATATAT GAAGACCTAG GCGATCGCCA ACAACTTGAA
AAACTGCTAC AGGACTCCGA AGCCGAACCA CCTAACTATC TCCTAGTTCA GGAACTAGAG
GAACTAGGAA ACTCAGTAGA AGAAGTGTGT AACCGCCTGG CTCAACTAGA AAGTCTGAAA
ATAAAAACGA TCCCCCTAAA ATCAGAAATC AAAATAAACC AACCCTGGCC CAGCACATCG
AACATAACGA AAGCCGAGCT ACTGAAACTC CTGAACGAAA TTCGCCAAAA GCAACACAGC
CGAAAAATTC GTGAAGCCCA TGCCCGTAAC CGAGTCAAAG GAACACCTCC CCCAGGTAAA
GCACCCTATG GTTACCGCCG AGGGAAAGAC CGCTATACCC TTGATAAAAA CACCGCCCCC
ATCATTAAAG AATTTTTTGA AAGCTTTCTG TTATTCGGGT CTCTCCGTGG AGCCGTCCGT
CATATTGAGC AAAAATACGG AAAAAAAATC TCAGTCACCA CAGGGCGACG TTGGTTAACT
AACCCTGTTT ATCGGGGAGA CCTACAATAT CAAAACGGTG AAATTATCTC CAATGCCCAT
GTTCCCATTA TTTCCAGAGA AGAAGCCTCT CAAGTAGAAC GCTTGCTCCA AAGGAACAGC
AAAATGCCAC CACGAACCGC AAGCACTCCC CATTCTTTAG GAGGCTTAGT TGTGTGCAAA
GAATGTCAGT CTCAAATGAT AACTGCCAAA GTTACCACCT TTCGCAGAAA CAAAGAGTAC
CTTTACCTAC GGCCAAAAAG TTGCCCCCGT ACGCAAAAAT GTAAAGCCTT AGCCTATGAA
GAAATATTAG AGCAAACCAT AAAAACAATT TGTCAAGAGT TACCCCTTGC TGTTGCCTCT
TTTGATGCGC CACAGATAGA AGAAGCTAAA GCGAACATCA AAAATAACAT TTGTGAAAAA
ACAGAAATGC TTTCAAGACT ACCAAATTTA ATAACTGAGG GAGTCTTTGA CGAAGAAACA
GCCAAACTCC GCGCTTACAA ACTCAAAACA GAAATTTCTC AATTAGAAAA CAAGCTTTAT
AAATTACCTC CAGTCAAGTT ATTAGAAACA GCAAAAACAG TATCCATACC TCAGTTTTGG
TGGGACTTAT CTGAGTCAGA ACGCAGATTT TACCTGCGGG AATTTATCAG TAGAATAGAA
ATTATTCGTC AAGGTGTAAA TTGGAATTTA CAAGTAATTT TTGTATTTTA G
 
Protein sequence
MKIIAYSYTD PLQGPAPDQT IWGWEIDQIY EDLGDRQQLE KLLQDSEAEP PNYLLVQELE 
ELGNSVEEVC NRLAQLESLK IKTIPLKSEI KINQPWPSTS NITKAELLKL LNEIRQKQHS
RKIREAHARN RVKGTPPPGK APYGYRRGKD RYTLDKNTAP IIKEFFESFL LFGSLRGAVR
HIEQKYGKKI SVTTGRRWLT NPVYRGDLQY QNGEIISNAH VPIISREEAS QVERLLQRNS
KMPPRTASTP HSLGGLVVCK ECQSQMITAK VTTFRRNKEY LYLRPKSCPR TQKCKALAYE
EILEQTIKTI CQELPLAVAS FDAPQIEEAK ANIKNNICEK TEMLSRLPNL ITEGVFDEET
AKLRAYKLKT EISQLENKLY KLPPVKLLET AKTVSIPQFW WDLSESERRF YLREFISRIE
IIRQGVNWNL QVIFVF