Gene Tery_4472 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_4472 
Symbol 
ID4246125 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp6902230 
End bp6903477 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content32% 
IMG OID638109355 
Productrestriction modification system DNA specificity subunit 
Protein accessionYP_723932 
Protein GI113477871 
COG category[V] Defense mechanisms 
COG ID[COG0732] Restriction endonuclease S subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.270866 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0902542 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTAATT TTAATTGGCA AAAATATCCG GTTTATAAAA GTTCTGGTGT TGAATGGTTA 
GGAGAAATAC CGGAACATTG GGAGATGAAA AGACTAAAGT TTATATCTCA TCTTGTTTAT
GGTGACTCTC TAGGTTCAGA AAATAGAGAA GACGGTAATA TTAATGTATA TGGTTCTAAT
GGAATGATAG GTCTGCATTC AAAAGCGAAT ACTCTTTCAC CAGTAATTAT TGTTGGTAGA
AAAGGTTCTT TTGGTAAAAT TCAATATTCT TTGTTTCCTT GTTTTTGTAT TGATACAGCT
TATTTGATTG ACCAAAGAAA AACCAAACAA AATCTCAAAT GGTTATGCTA TGCTCTACAA
ATATTAGAGC TTGATAAAAT TTCCCAAGAT ACAGGTGTAC CAGGGTTATC TCGCGAAAAA
GCTTATCAAA AATTAGTTCC AGTTTCTCCC CTTTCAGAAC AACAAGCGAT CGCTAATTTC
CTAGACGAAA AACTCGCCCA AATAGATGAA TATATTGCCA AAAAACAACG GATAATAGAA
CTACTAAAAG AACAGAAAAC TGTTATTATT AATCAGGCAG TTACTAAAGG TATTAACCCA
GATGTTTCGA TGAAATATTC TGGTATTGAA TGGTTAGGAG AAGTACCGGA ACATTGGGAG
GTTTTACCAG CATTTGCAGT TTTCAAAGAG CAATGTGTAA TCAACAGAGA CTTAGTTGAA
AAAAATCTTT TGTCTCTTAG CTACGGTAAA ATAATAAGAA AAAGTTTTAC TAACAATTTT
GGTTTACTTC CAGAGTCTTT TGAAACTTAC CAAATAGTCA CACCTGGTAA TATAATACTA
AGACTAACAG ACTTGCAAAA CGATAAAAGA AGTTTGAGAG TAGGGTTGGT TAAAGAAAAA
GGAATAATAA CATCTGCTTA CCTTTGTTTA AACCCTCAAA ACGTAATTCC AGAATATGTG
TATACCCTAC TTCACATCTA TGATATTTTG AAAATTTTTT ATTCTATGGG AAGCGGTGTA
AGACAGAATA TGAAATTTAA AGACCTCAAA AGACTTCCTA TTACTTTTCC TCCAGTTTCT
GAACAAAAAG AAATAGTGTC ATTCATTGAA AAAAAATTAG AAAAAATTGA GCGATCGCTC
ACCGTAATAG AAAAGGAAAT AAAATTAATC CAAGAATATC GAACAACTCT AATTTCTGAA
ACAGTAACAG GCAAAATAGA CGTTAGAAAA TATCATCCTC CCCAATAA
 
Protein sequence
MVNFNWQKYP VYKSSGVEWL GEIPEHWEMK RLKFISHLVY GDSLGSENRE DGNINVYGSN 
GMIGLHSKAN TLSPVIIVGR KGSFGKIQYS LFPCFCIDTA YLIDQRKTKQ NLKWLCYALQ
ILELDKISQD TGVPGLSREK AYQKLVPVSP LSEQQAIANF LDEKLAQIDE YIAKKQRIIE
LLKEQKTVII NQAVTKGINP DVSMKYSGIE WLGEVPEHWE VLPAFAVFKE QCVINRDLVE
KNLLSLSYGK IIRKSFTNNF GLLPESFETY QIVTPGNIIL RLTDLQNDKR SLRVGLVKEK
GIITSAYLCL NPQNVIPEYV YTLLHIYDIL KIFYSMGSGV RQNMKFKDLK RLPITFPPVS
EQKEIVSFIE KKLEKIERSL TVIEKEIKLI QEYRTTLISE TVTGKIDVRK YHPPQ