Gene Tery_4303 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_4303 
Symbol 
ID4245955 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp6626753 
End bp6630622 
Gene Length3870 bp 
Protein Length1289 aa 
Translation table11 
GC content37% 
IMG OID638109193 
Productpeptidase-like 
Protein accessionYP_723771 
Protein GI113477710 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0594037 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0791445 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAACA GATATAACAA ACCAGATCAA GTAGACAATA GTCGGAAAAA AGCTTATCAA 
GTTGGAGCTT TGCAGAATGA TGAGAAGTTT GAGGATTTTG TAGGTAAGTC TGATAAGAGG
GATTTTTATC AGTTTAAGGT TGAAGAAAAG ACGGATGTAG ATATTAAGTT GGATGACCTG
AGTGGGAATG CAAATTTGTA TTTGCTGAAT AATAAGGGGA AGCTTATTGA GAAGTCTACT
AAAGGTGGTA AGAACGCAGA AGATATTGAG CGGACTTTGA ACCCAGGGGC TTATTATGTA
AAGGTGCAGT CGAATGAGGA AGATGCTAAT TATACTTTGA GTTTGGATGT AGGTTCAGGT
TCTCAGCAAG AAGATAAGGT AGGTAATAAC CGGAAAAAAG CCTATAAATT CGGAGCTCTG
AAGAATGATG AGAAGTTTGA GGATTTTGTA GGTAAGTCTG ATCAGAGGGA TTTTTATCAG
TTTAAGGTTG AAGAAAAGAC GGATGTAGAT ATTAAGTTGG GTGGCCTCAG TGGAAATGCG
GATTTGTATT TGCTGAATAA TAAGGGGAAG CTTATTGAGA AGTCTACTAA AGGTGGTGAT
AGCTTAGAAA ATATTGAGCG GACTTTGAAC CCAGGGGCTT ATTATGTAAA GGTGCAGTCG
AATGAGGAAG ATGCTGATTA TACTTTGAGT TTGGGTGTAG GTTCAGGTTC TCAGCAAGAA
GATAAGGTAG GTAATAACCG GAAAAAAGCC TATAAATTTG GAGCTCTGAA GAATAATGAG
CAGTTTGAGG AATTTGTAGG TAAGTCTGAT AAGAGGGATT TTTATCGGTT TAAGGTTGAA
GAAAAGACGG ATGTAGATAT TGAGTTGGGT GGCCTCGGTG GGAATGCAGA TTTGTATTTG
CTGAATAATA AGGGGAGGCT TATTGAGAAG TCTACTAAAG GCCTTAATAA GGTAGAAGAT
ATTGAGCGGA CTTTGAACCC AGGGACTTAT TATGTAAAGG TGCAGTCGAA TGATGAAGAT
GCTAAATATA CTTTGAGTTT GGATGTAGGT TCAGGTTCTA AGAATCTAGA AAAGTACGAG
TTTACTTACT ACTATAGTGG TAGTCAGGAT AATGAGTTTC AGGACTACTA TAGTGGCTAT
TTTTACGCTC CAAAAGGAAC TTATGAAGTA GATAGTTACT ATGACTTTAA TGAAGAAAAA
AATGAAGCAA GTGCTAACGG TAAATACTAT ATTTCTAGTT CTAGTAAAGC GGGGGCTAAA
GCAAAAGATG GAGAGGTTTA CCTGGAAAGC TATTATGATT TAGAAACTAA AAAAGAGTAT
GTTCCTTATT ATGCTAGTGA AGATTTGCCT TCTGGTGCTA GTGGCATGGG GAGTGAAGAA
GACTTTATTG ATTTTGAGAA AGATGAGGAT GTAATTGAGC TAAAGCAGGA AGGCTTTTTT
GGTGCTGACT ATTCTCTGGC AAATATAGTT AAAGAGGCAC AAACGTCTAA ATCTGGAGAT
TATCAGATTG ATGCGTTATT AAGCCCTTAT AAGTGGGGAA CTAACACAAT AACTTATAGC
TTCTATGATA ATGATTCAGG TCCATATTAT GGTGCGGAAA GAGATGTTGG GGAGGTTAGT
GATAAGGTTC AGGAAGATAT TCGACATATT CTTGAAAATA TTTATGAGCC TCTTTCGGAT
CTTGATTTTG TAGAGGTAGC GGATACTTCT AATAGCTATG GTCTAATTCG AATTATGAAA
TCAAGTAACC CAAATTATGC TTATGCATAT TATCCGTATG CAGATGATTA TAATGCGGGA
AATCTTCTTG ATTTGGCTGG TGATGTCCAT TTTAATGTTT ATTATGATGG TAATGGTGAT
CATAACAATT TTCAGGGGGG TCCGGGAACT CATGGTTATA TGACTCTGAT CCATGAGTTA
GGTCATGCTG TTGGTTTAAA GCATCCTCAC GAGGATGGTG ATACGTTGCC AGAAAATGAG
GATAATACGA GCAATACTGT GATGTCATAT AACTTTACTT ACACCGCAGA TAATCTGAGG
TCTTATGACG TGAATGCATT ACAGTATATC TACGGTTTTG AAACTCCAGA AGATTCAGTG
ACTGTGAAGT CCCCTAATGG AGGTGACACT CTGAAGTTAG GAAAAAGCTA TACTATTACT
TGGGATGATA ATTTTAGTGA AAATGTGAAG CTGGAGTTGT ACAAGGGGGG TTCGTTTGAT
AGCATAATTA CTAATTCAAC TGCTAGTGAT GGCAGTTATG GTTGGACTTT GCCTACGTCT
ATGGCTATTG GTAGTGACTA TAAGGTGAAA ATTACTAGTG TCAGTGATGC TGGGGTTTCT
GACTTGAGTG ATAGTAATTT TACTATTGAA CGAGATGGTG TTATTACTGT GAAGTCGCCT
AATGGAGGGG ATGTTATCAA TACAGGGGAT ACTTACGATG TTACTTGGGA TGACAATATC
AGCGAGAATG TTCGTATTTA TCTGTATCAA GGTAACAGGT ATAAGCAGAT GATAAGTAAT
TCAACTCCTA GTGATGGTAG TTACAGTTGG ACGGTGCCGA CTAGTTTGAC TGCTGGTGAT
AATTATGAAG TTGCTATTCA AAGTGTTGAG AAAGGTAGTC TTTATGACTA TAGCGACAGT
AGTTTTACTA TTGAACCAGA TGGTGTTATT ACTGTGAAGT CGCCTAATGG GGGGGATGTT
ATCAATACAG GAGATACTTA CAATATTACT TGGGATGACA ATATCAGCGA GAATGTTCGT
ATTGATCTGT ATCAAGATAA CAGGTATAAG CAGATGATAA GTAATTCAAC TTCTAGTGAT
GGTAGTTTCC GTTGGAGGGT GCCGACAAAT TTGGCTACTG GTGATAATTA TCAAGTTGCT
ATTCAAAGTG TTGAGAAAGA TCGTATTTTT GATTTTAGTG ATAGTAATTT TACTATTGAA
CCGGATGGTG TTATTACTGT GAAGTCGCCT AATGGGGGGG ATGTTATCAA TACAGGAGAT
ACTTACAATA TTACTTGGGA TGACAATATC AGCGAGAATG TTCGTATTTA TTTGTTTCAG
GGTAACAGGT ATAAGCAGAT GATAAGTAAT TCAACTCCTA GTGATGGTAG TTACAGTTGG
ACGGTGCCGA CTAGTTTGAC TGCTGGTGAT AATTATGAAG TTGCTATTCA AAGTGTTGAG
AAAGGTAGTA TTTATGACTA TAGCGACAGT AGTTTTACTA TTGAACCTGA TGATTTTATT
ACGGTGATGT CGCCTAATGG AGGGGATATT CTGAATACAG GAGATACTTA TAATATTACT
TGGGATGATA ATATTGGGGA GAATGTTCGT ATTTATTTGT TTCAGGGTAA CAGGTATAAG
CAGATGATAA GTAATTCAAC TCCTAGTGAT GGTAGTTACA GTTGGAAGGT ACCGACAAAT
TTGACTACTG GTGGTAATTA TCAAGTTGCT ATTCAAAGTG TTGAGAAGGG TAGTCTTTAT
GATTTTAGTA ATAGCAATTT TAGTATTAAA ACAGAAGATG TGAGTTCTGA TAAATACTAT
TTTACTTATT ATTATAATGT AGGTGATTCT TATAGTGGTT TTTTGTATGA AAAATCGGGT
ACATATTCTT CGGATGATAT TTTAAGTGTT ACTAGTGGTT TTTACCAGAT AGATGATATT
GAGAGTGGTG TAGGTAATAA AGATGATATT GGTGATGTTT ACATTTCTTC TTACTATGAT
AGTAATTATA CTGGGAATAC TTACCAACCA TTATTATCGC GTTATGGATG GCGGTCAGGT
GGAAATGGTT TGGGTAGCGA GCGTGACTAT TTAGGTTTCC CTAGTTATCA ATCTTTTGAC
TCTAATAATG AGTTTAATGG TTTAATCTAG
 
Protein sequence
MNNRYNKPDQ VDNSRKKAYQ VGALQNDEKF EDFVGKSDKR DFYQFKVEEK TDVDIKLDDL 
SGNANLYLLN NKGKLIEKST KGGKNAEDIE RTLNPGAYYV KVQSNEEDAN YTLSLDVGSG
SQQEDKVGNN RKKAYKFGAL KNDEKFEDFV GKSDQRDFYQ FKVEEKTDVD IKLGGLSGNA
DLYLLNNKGK LIEKSTKGGD SLENIERTLN PGAYYVKVQS NEEDADYTLS LGVGSGSQQE
DKVGNNRKKA YKFGALKNNE QFEEFVGKSD KRDFYRFKVE EKTDVDIELG GLGGNADLYL
LNNKGRLIEK STKGLNKVED IERTLNPGTY YVKVQSNDED AKYTLSLDVG SGSKNLEKYE
FTYYYSGSQD NEFQDYYSGY FYAPKGTYEV DSYYDFNEEK NEASANGKYY ISSSSKAGAK
AKDGEVYLES YYDLETKKEY VPYYASEDLP SGASGMGSEE DFIDFEKDED VIELKQEGFF
GADYSLANIV KEAQTSKSGD YQIDALLSPY KWGTNTITYS FYDNDSGPYY GAERDVGEVS
DKVQEDIRHI LENIYEPLSD LDFVEVADTS NSYGLIRIMK SSNPNYAYAY YPYADDYNAG
NLLDLAGDVH FNVYYDGNGD HNNFQGGPGT HGYMTLIHEL GHAVGLKHPH EDGDTLPENE
DNTSNTVMSY NFTYTADNLR SYDVNALQYI YGFETPEDSV TVKSPNGGDT LKLGKSYTIT
WDDNFSENVK LELYKGGSFD SIITNSTASD GSYGWTLPTS MAIGSDYKVK ITSVSDAGVS
DLSDSNFTIE RDGVITVKSP NGGDVINTGD TYDVTWDDNI SENVRIYLYQ GNRYKQMISN
STPSDGSYSW TVPTSLTAGD NYEVAIQSVE KGSLYDYSDS SFTIEPDGVI TVKSPNGGDV
INTGDTYNIT WDDNISENVR IDLYQDNRYK QMISNSTSSD GSFRWRVPTN LATGDNYQVA
IQSVEKDRIF DFSDSNFTIE PDGVITVKSP NGGDVINTGD TYNITWDDNI SENVRIYLFQ
GNRYKQMISN STPSDGSYSW TVPTSLTAGD NYEVAIQSVE KGSIYDYSDS SFTIEPDDFI
TVMSPNGGDI LNTGDTYNIT WDDNIGENVR IYLFQGNRYK QMISNSTPSD GSYSWKVPTN
LTTGGNYQVA IQSVEKGSLY DFSNSNFSIK TEDVSSDKYY FTYYYNVGDS YSGFLYEKSG
TYSSDDILSV TSGFYQIDDI ESGVGNKDDI GDVYISSYYD SNYTGNTYQP LLSRYGWRSG
GNGLGSERDY LGFPSYQSFD SNNEFNGLI