Gene Tery_2421 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_2421 
Symbol 
ID4244837 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp3735023 
End bp3736231 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content38% 
IMG OID638107511 
Productrestriction modification system DNA specificity subunit 
Protein accessionYP_722111 
Protein GI113476050 
COG category[V] Defense mechanisms 
COG ID[COG0732] Restriction endonuclease S subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.318068 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATGGC AGCGTGTTTT TGTTGAAGAT GTAGCTAAAA TTGTAACTAA GGGAACTACT 
CCTACTTCTA TAGGTTTTAG CTTTTCTAAA GAAGGTATCC CTTTTCTACG AGTCAATAAT
ATCCAAGATG GTAAAATCAA TCTTGGTGAT GTTTTATTTA TTGACTCAAA AACGGATCAA
GCTCTTGCGC GTTCTCGAAT TTTAAAAAAA GATGTAATAA TTTCAATTGC TGGTACAATT
GGAAAAACCG CAGTTATTCC TACTAATGCT CCAGCAATGA ACTGCAACCA GGCACTTGCA
ATAATAAGGC TTCACAATAA TGTAGACCCC TACTATTTTA ACCATTGGCT GAATACAGGA
GATGCGTTTC GACAAATTAC AGGTTCAAAA GTGACAGCAA CTATTTCTAA CCTAAGTCTT
GGTTGTATCA AAAAGCTCAA AATCCCCCTC CCCCCAATAG AAGAACAGCG CCGAATAGCT
GCAATACTCG ACCAAGCTGA TGCTATCAGA CGAAAGAGAC AACAAGCGAT CGCTCTAACA
GATGAATTAT TGCGTTCTAC ATTCCTGGAG ATGTTCGGCG ACCCTGTTAT TAATCCGAAG
GGGTGGGAGG TAAAAAAATT AGAGGAAGTT GCATTAAAAC GCAAGGGGGC TATAAAATGC
GGACCTTTTG GTAGCCAACT ACTTATAAGC GAGTTTGTCA AAGATGGTAT TCCAGTATAC
GGAATAGACA ATGTTCAAAA AAATGAGTTT GTTTGGGCCA AACCCAAGTA TATTACTACT
GAAAAGTACG AGCAATTAAA AAGCTTTTCT ATCCAGGATG AGGACGTTCT GATTTCAAGA
ACTGGAACAG TTGGAAGAAC TTGTGTCGCA CCACCTGATA TCCCTAGAAG TATCCTTGGA
CCTAACTTGC TGAAAGTTTC CTTAAATACT AACAAAATGC TTCCTAAATA TTTGTCCTAT
GCTTTAAATC ACTCTAATCC CCTGATTGAA GAGATAAAAA GAATGTCACC AGGTGCTACA
GTTGCAGTTT TTAACACAAC AAACCTTAAA GCTTTGAGGT TAACAATTCC CCATATAAAC
CTGCAATCCC AGTTTGTCAA CTTTACTGAA AATGTTGAAT TGACAAAGCA AAAAGAGTCT
AACTACCTCA CAGAATCCAA CAACCTATTT AACTCCCTGT TACAACGCGC ATTCAAAGGC
CAACTATAA
 
Protein sequence
MKWQRVFVED VAKIVTKGTT PTSIGFSFSK EGIPFLRVNN IQDGKINLGD VLFIDSKTDQ 
ALARSRILKK DVIISIAGTI GKTAVIPTNA PAMNCNQALA IIRLHNNVDP YYFNHWLNTG
DAFRQITGSK VTATISNLSL GCIKKLKIPL PPIEEQRRIA AILDQADAIR RKRQQAIALT
DELLRSTFLE MFGDPVINPK GWEVKKLEEV ALKRKGAIKC GPFGSQLLIS EFVKDGIPVY
GIDNVQKNEF VWAKPKYITT EKYEQLKSFS IQDEDVLISR TGTVGRTCVA PPDIPRSILG
PNLLKVSLNT NKMLPKYLSY ALNHSNPLIE EIKRMSPGAT VAVFNTTNLK ALRLTIPHIN
LQSQFVNFTE NVELTKQKES NYLTESNNLF NSLLQRAFKG QL