Gene Tery_4275 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_4275 
Symbol 
ID4245927 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp6591534 
End bp6594539 
Gene Length3006 bp 
Protein Length1001 aa 
Translation table11 
GC content44% 
IMG OID638109167 
Producthypothetical protein 
Protein accessionYP_723745 
Protein GI113477684 
COG category[L] Replication, recombination and repair 
COG ID[COG1743] Adenine-specific DNA methylase containing a Zn-ribbon 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.196704 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAAATT ATCCCAAACG CTTAATTGAA GTAGACCTCC CAATTAAACA AATCTCTGCC 
CATGCTAGAC GAGAAAAATC AATTCGTCAC GGTCATATTT CGACCTTACA TATTTGGTGG
GCAAGACGAC CTTTGGCTGC TTGTCGGGCT GTAATTTGTG CTGCTCTTTG GTTAGACCCA
GTTGATGATA ATTGCCCGGA ATTGTTTCGG GTTGAGGCTG CAAAAGTTAT TAATGGTTTT
GCCAAAATAG TTGATAAAAA TTTAGATAAT CATGGCTCCA CAGCAAACCT AAAAAAATGG
CAAGTTTTAG CTAAACCTAA AAATCAGTTA AATCCAGAAA AAATAGAACA TTTAAACATA
CTCCGTTATC GCTTATTAGA CTTTATTGCT GACTTTGCTA ATTGGGATAA TTCTACACAT
CCAGATTATT TACAAACCAG TCGTAAACTA ACAGAAATTA GTCATTATTC TTTAGGAGGA
ATTACTGATA CAAAACCCGT AGTTTTTGAC TCTTTTGCGG GTGGAGGTGC GATTCCTTTG
GAGGCTTTGC GGGTAGGAGC AGATGCTTTT GCCTCTGATT TGAATCCGGT GGCGGTTTTG
TTAAATAAGG TTGTGTTGGA ATATATTCCT AAGTATGGGC AAACTTTGGC GGATGAGGTG
CGTAAGTGGG GAGAATGGAT AAAGCAGGAG GCAGAAAAGG AATTGGGGGA GTTTTATCCT
AAATCTGCTG CAACACTTAA GAATGGGGAT GTGGAAACAC CTATTGCTTA TCTTTGGGCG
CGGACGATTG TTTGTGAGGG GCCTGGATGT GGGGCGGAGG TTCCTTTGAT GCGGAGTTTG
TGGTTGGCAA AGAAAAAGAA TCGGTCGGTG GCGTTACGGT TGATTCCTCG GCAGGAGGAA
AAGCGAGTTG ATTTTGAGAT TGTGGAGCAG GTGAAAGGGA AAGATGTTGG GGATGGAACG
GTGAAACGAG GTTCAGCGAC TTGTCCTTGT TGTGGGTTTA CGACTCCGGT GGCTTCGGTG
CGAAAGCAGT TGCAAGCAAG GGGTGGGGGT GCGGATGATG CCCGGTTGTT TTGTGTGGTG
ACGACTAGGG AGAAGGTGAA GGGTAGGTTT TATCGGTTGC CCAATGAGAG AGATTTGGAA
GCGGTGAGAA TGGCAGGTGA GGAGTTGGCA AGGCGGAAGT TGGAGTATGG TGGGGAGTTG
AGTTTGGTGC CGGATGAACC GGTGCCAGTT ATGAGCGGTG TTTTTAATGC GCCCATTTAC
GGACATAACA CCTGGGGAAG TTTATTTTCT TCTCGTCAAG CTTTAGCTTT AACTACTTTG
GTGAGGTTAG TTAAGGAAGT GGGAAAAAAA TTGGCTAGTA ATGAGGATGA AAGGTTGGCG
ATCGCTATTC AGACTTGTTT GGCTTTGGCA GTTGATCGCT GCACTAATCA GTTTTGCTCT
CTATCAAAGT GGAATAACAG TCGAGAGTTA ATAGACGGTG TTTTCGCTAG GCAAGCTTTA
CCCATGCTTT GGGATTTTGG AGAAACAAAT CTAGTGGGAG GTTCTGACGG ATACTGGCAA
GGTGCTGTCA GTTGGGTTTT ATCCGTCATC AAAGCATCAT TATTACCCGA CCCCGGTCAA
ACCCAACAAG CAAATGCAGC GAGTCACCCC CTACCAGACG ACTTTACTCA ATGCTTCTTT
TCCGATCCTC CTTATTACAA CGCTGTACCC TACGCTGACT TATCCGACTT TTTCTATGTC
TGGCTAAAAA GAACTCTAAA CAAAACCTAC CCCAACTTAT TTTCTAGTGA AACTACTCCG
AAAGATGATG AAATTTGTGA AATGGCAGGA TGGGATTCCA AAAGATACCC ACACAAAAAC
GGCAAATGGT TTGAAACCCA GATGGCAAAA GCAATGGCAG AAGGATGTCG CATCATATCC
TCAGACGGAA TAGGTATTAT TGTCTTTGCC CACAAATCAA CTGCTGGATG GGAAGCTCAA
CTACAAGCAA TGATAGATGC TGGTTGGAAA ATAACCGCAT CCTGGGCAAT AGACACAGAA
AGGGGCAACA GACCCCGCGC CCAAAACTCT GCTGCCCTCG CCTCATCCAT CCATCTTGTT
TGCCGACCCC GCAACAATAA CAATATCGGC GACTGGCGCG ACATCATCCA AGAGCTACCC
CAACGCATCC AAGACTGGAT GCCACGTCTG ACATCAGAAG GTATAGTCGG TGCTGATGCT
ATCTTTGCTT GTATCGGACC CGCCCTAGAA ATATTCTCTC GTTACTCCAG CGTGGAAAAA
GTCAATGGGC AACCAGTCAC TTTGGGCGAA TACCTAGAAT ATCTCTGGGG AACAGTAGCA
AACGAAGCCT TAAAACAGAT ATTCTCGGAA GCTGACACTA ATAACCTGGA AGCAGACGCT
CGTCTCACAG TCATTTGGTT ATGGACGCTT TCTACTGGCA GCACCGAATC AGCAACAGAA
ACAGAGTCAG ACCAAGATGA GGAAGACACC CCCGCCAGCA AAAATCAAAA ATCTACAGGT
TTTGCCTTAG AGTTTGACAC CGCCCGCAAA ATCGCCCAAG GTTTAGGAGC GCACCTAGAA
AACCTCACCC ATTTAGTAGA AATAAAAGGT AATAAAGCCA GACTTTTACC AGTACAAGAA
AGGAGTCATT ATTTAATCGG AAAAGTTGCC CCTACTACTA ATACTAAAAA ACGCAAAAGT
AAGACTCCAA AACAACTAAG TTTAGAAGGT ATTTTGCCAG AAAAAGAAAC TGAAATAGAA
GTAGAAGCAG ATATAGCGAT CGCTGAGATA GGCAAAACCG TATTAGACCG GATACATCAA
ACCATGTTGT TACACAAAAC AGGACGTAGC GAAGCAGTGA AACGACTATT AGTAGAGGAG
GGAGTAGGCA AAGACCCCAG ATTTTGGAAT TTAGCAGATG CCCTATTGCG GGTATATCCT
CAGAATGTGG AAGAGAAACG TTGGTTAGAG GGAATATTGG CAAGGAAAAA AGGATTGGGT
TTTTAG
 
Protein sequence
MTNYPKRLIE VDLPIKQISA HARREKSIRH GHISTLHIWW ARRPLAACRA VICAALWLDP 
VDDNCPELFR VEAAKVINGF AKIVDKNLDN HGSTANLKKW QVLAKPKNQL NPEKIEHLNI
LRYRLLDFIA DFANWDNSTH PDYLQTSRKL TEISHYSLGG ITDTKPVVFD SFAGGGAIPL
EALRVGADAF ASDLNPVAVL LNKVVLEYIP KYGQTLADEV RKWGEWIKQE AEKELGEFYP
KSAATLKNGD VETPIAYLWA RTIVCEGPGC GAEVPLMRSL WLAKKKNRSV ALRLIPRQEE
KRVDFEIVEQ VKGKDVGDGT VKRGSATCPC CGFTTPVASV RKQLQARGGG ADDARLFCVV
TTREKVKGRF YRLPNERDLE AVRMAGEELA RRKLEYGGEL SLVPDEPVPV MSGVFNAPIY
GHNTWGSLFS SRQALALTTL VRLVKEVGKK LASNEDERLA IAIQTCLALA VDRCTNQFCS
LSKWNNSREL IDGVFARQAL PMLWDFGETN LVGGSDGYWQ GAVSWVLSVI KASLLPDPGQ
TQQANAASHP LPDDFTQCFF SDPPYYNAVP YADLSDFFYV WLKRTLNKTY PNLFSSETTP
KDDEICEMAG WDSKRYPHKN GKWFETQMAK AMAEGCRIIS SDGIGIIVFA HKSTAGWEAQ
LQAMIDAGWK ITASWAIDTE RGNRPRAQNS AALASSIHLV CRPRNNNNIG DWRDIIQELP
QRIQDWMPRL TSEGIVGADA IFACIGPALE IFSRYSSVEK VNGQPVTLGE YLEYLWGTVA
NEALKQIFSE ADTNNLEADA RLTVIWLWTL STGSTESATE TESDQDEEDT PASKNQKSTG
FALEFDTARK IAQGLGAHLE NLTHLVEIKG NKARLLPVQE RSHYLIGKVA PTTNTKKRKS
KTPKQLSLEG ILPEKETEIE VEADIAIAEI GKTVLDRIHQ TMLLHKTGRS EAVKRLLVEE
GVGKDPRFWN LADALLRVYP QNVEEKRWLE GILARKKGLG F