Gene Tery_4157 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_4157 
Symbol 
ID4245808 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp6412451 
End bp6415675 
Gene Length3225 bp 
Protein Length1074 aa 
Translation table11 
GC content35% 
IMG OID638109058 
Producthypothetical protein 
Protein accessionYP_723637 
Protein GI113477576 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGGGA ATAAAACAAT ATTAACTGAA AATAGCTCCC CCCTGGGAAT TTGGGCTAAG 
GAAGCTTTAG GACTTCCAGA AGTACAAGTA CAGGTAAGAT TGCGAGGTAA TCATCTACAT
ATTCTCTGTG AAGCAGAAAA ATGTCCTGAA ATGAGCTTTG CACTTGCCCA GTTTTCTCAG
GCACTGAGTC AGATCAATAT AGAATCTTTA TTACCTCCTA ATCAACCTCG GATTTACCAG
ACGTTTCTGT GTGGTCGTAC TTTGGGACGT AGGCGACCTG ACTGGACAGT AAAACTTGAT
AGTAATAAGG TAAGGGCTCA AAGTGGGAGA CTAAATTCTC CTATCTTATC TAATGATTCA
GAAACTAATT TCGATCCAAC ATCTACTTCT ACTGCTTCTA CTCCTCAAGA ATACTCACAA
AATTATCATG GCACTGAAGG TTTACCTGAA GAAAAATTCA CAAAAATTGG TGTTGGACAA
TCTTCTGAAC TGGATATCCA TCGCGATGTT GCCTCTGAAT TGCTGTTAGC TCCTGATCAG
GATTTTGTCG GGTCTAATGA CACATTAGAT AATCTCAACT TTTTATCTCC AAAATTCGAC
TCAGAGACTT TTTCTAGTAC TGAACCAGAA AAGTCTAAGT TAAAAACGCA GCTAGATAAA
CTCAAGGAAT CAGATATTAA TCTGATTAAT TCTTCTTTAA CAGTTTCTTC TAAAAGGTTA
GCTAAGTATG GTCACCCAGA TGCAATTGCT AGTTATCTGA GTGAAATTTT AGGTGAGTTA
GGGGTCTGTG TAAATGTTAG TGTTAGAGAA AAACAATTTA AGGAAAAATT AACAGAAACA
AGTTCACAAT TAGAGGATGT TACTAAAAAT TTTAAGTTAA AAACTCAAAA AATCTTATGG
GTATCTTGTG AAGCAACTTA TAGTCCCGAT CCATCTCTGT TAGCTGAACC AATTACTCAA
AAACTAAGAG ACCTCAAACT TAAAGATTTC CATGAAGCTT TAATTTCTCT ACTTGTTCGA
GGTGAAACGG CTCCTGACTG GATGTTGCGA GTGGATCTAA CTCCACCCGA TCTGATGCTA
CAAGAATGGG CGAGTTGGGG GGATGTAACT GCAATTGAAC GTTTGCTTAA ACAAAAGCTG
GCTACCCTGG GAGTTGATAT TCGAGGGATC CTCAAAGAAT CAACTTTACA TTTATTTTGT
ACTAGTACTA ATAATTCCAG CCAAGAGTAT CCAGATCAAC AGAAAACAAA AGCAGCGATC
GCTTCTATAT TAGCTACAAT TATACCAAGA GGTATTCAAG CTGCTACCAT ATACGGTTGT
ACGGTTAATG AAAGTAACTA CAAAAGGAAA GAATTTCCTC GCTGGATAGA CTGGTTGAAT
TTACCGGCAT CAGAAAATCA AGATCTATCT GCTAGTGCTG AGGTTTTGGC AAGTCAAGGA
AACTATGAAG CAATTAGTTT TTTGTTAAAT AGATTAGTTA ATTCTAACCT TGATCAAAGA
CTTAAAACAG GTGGTATCCG GGTATTAATA TTATCGAAGC AAGAATTATT GCATGTTATG
AGCGAAGCTC CTACAAGTCC ATCTCAATCT CAAGTAGGAC CATGCATTGC TAATTTTTTG
CGTCAACTTC AAATTCCTGG AGTTAGTGGG GTAAGGGTTT ACGGTCGTCG TGCTGGTCAA
AAATTACCTT TATGGCGCTA CGGTATTAAT TTTACTACGA AAAATCGACG ATATTCTGAA
AAACCTCCAG AGTTTGCTGC AACTGCCGAA ATGGATTTTT TGTTGGGTAA AAGAGCAGAT
CGTGCTTTCC TCAAGTTAGC ACCAAAAATG ACTGAAAAAA GTGACTCAAG TCTTTTATAT
TTTCATGGGT ACACATTATT TAAAGGACGT CTGTTGACAG GTATTCAGCA GTTACTTATT
GGCTCGCGTT TATTTATTCC TAATGAAGAA GATTTGACAA AAATAAGTAA TTCATTAACA
TACAATAGTC GCAGTTATGG TAAATGGTTT GCTGCTATTT CTTATGCTGC TTTGGGAATT
TTATTGACGG TTCAAACTGA TTTAAAAGTA GATGAAATAT TAAAGAAAGT TCCAGATGTT
TCTCTGTATG GAAATATTTG TCAAGGTCGA AATTGCCAAA CTCAATCAGA AGAAATTGTA
GATGAAAATC AAGCTTCAGG ATTAATGAAT TATCCTAGTT TTAATAGTCC GCAATTAGAT
GAACAGTTGC TTCGTTATCA ACAATATTTA GAAGCAGAAA AAAAAATACC AGATATTTTA
ATAGTTGGTA GTTCTAGAGC TTTGAGAGGG GTCGATCCCA CGGTGCTTGC TGAAAGTTTG
GCAGCTTTAG GATATCCAAA TTTCAGAATT TATAATTTTG GGGTAAATGG TGCAACTGCT
CAAGTAGTAG ATTTGATTGT ACGTCAGGTT TTACCACCAG AACAATTGCC AAAATTAATT
ATTTTTGCTG ATGGAGTTCG GGCTCTAAAT AGTGGTAGAG TTGATAGAAC TTTTGATATT
ATTGCTGGTT CTGAAGGTTA TGATCAAGTA GATGCTGGTT CGTTTATTAT TGGGAATAAT
AGATCAGACT CATCAGTTAA TTATGATTTA AATAAATATA AGCAAATATT GAAAAAATTA
GAGTCAAAAG TTAAGAAAAT ATCAGTGACT TATCAGCAGC GCGATCGCCT AAAAAGATTG
ATAGTATCAA TTATTAAGAA TCCGACAAAT TTTGATTTTT TGTCTGGGGA AGGTAACCAG
GATTTAGTAA AAGATCATAA CCTAGAGTTA GAGCAATTTC AAGCAAATGG TTTTCTGCCA
ATTTCTATTA AATTTAAACC TGAAAGTTAC TATCAAAATT ATACTAAAGT TTCTGGTGAT
TACGATGCTG ATTATGCTGA GTTTCAATTA CGAGGAAAAC AAACCACTGC ACTCAAAAAA
TTACTACAAT ATACTCAGTC AATAGGGGTT AATTTTGTAT TTGTAAATAT GCCTTTAACT
ATAGTATATT TGGATGAAGT ACGAACTGTT TATGAACAAG AATTTCAGGA ATATATGCAA
CAGTTATCTG GGGAATATAC TAATTTTATA TTTCGAGATT TAGGCAGTGC ATGGCCAGAA
ACTTATGACA ACTTTTCTGA CCCTAGTCAT CTAAATCTTT ACGGAGCGAT CGCTGTTTCT
CAAACATTAG CTGATGATCA TATTATTCCT TGGCGAAAGC GTTAA
 
Protein sequence
MKGNKTILTE NSSPLGIWAK EALGLPEVQV QVRLRGNHLH ILCEAEKCPE MSFALAQFSQ 
ALSQINIESL LPPNQPRIYQ TFLCGRTLGR RRPDWTVKLD SNKVRAQSGR LNSPILSNDS
ETNFDPTSTS TASTPQEYSQ NYHGTEGLPE EKFTKIGVGQ SSELDIHRDV ASELLLAPDQ
DFVGSNDTLD NLNFLSPKFD SETFSSTEPE KSKLKTQLDK LKESDINLIN SSLTVSSKRL
AKYGHPDAIA SYLSEILGEL GVCVNVSVRE KQFKEKLTET SSQLEDVTKN FKLKTQKILW
VSCEATYSPD PSLLAEPITQ KLRDLKLKDF HEALISLLVR GETAPDWMLR VDLTPPDLML
QEWASWGDVT AIERLLKQKL ATLGVDIRGI LKESTLHLFC TSTNNSSQEY PDQQKTKAAI
ASILATIIPR GIQAATIYGC TVNESNYKRK EFPRWIDWLN LPASENQDLS ASAEVLASQG
NYEAISFLLN RLVNSNLDQR LKTGGIRVLI LSKQELLHVM SEAPTSPSQS QVGPCIANFL
RQLQIPGVSG VRVYGRRAGQ KLPLWRYGIN FTTKNRRYSE KPPEFAATAE MDFLLGKRAD
RAFLKLAPKM TEKSDSSLLY FHGYTLFKGR LLTGIQQLLI GSRLFIPNEE DLTKISNSLT
YNSRSYGKWF AAISYAALGI LLTVQTDLKV DEILKKVPDV SLYGNICQGR NCQTQSEEIV
DENQASGLMN YPSFNSPQLD EQLLRYQQYL EAEKKIPDIL IVGSSRALRG VDPTVLAESL
AALGYPNFRI YNFGVNGATA QVVDLIVRQV LPPEQLPKLI IFADGVRALN SGRVDRTFDI
IAGSEGYDQV DAGSFIIGNN RSDSSVNYDL NKYKQILKKL ESKVKKISVT YQQRDRLKRL
IVSIIKNPTN FDFLSGEGNQ DLVKDHNLEL EQFQANGFLP ISIKFKPESY YQNYTKVSGD
YDADYAEFQL RGKQTTALKK LLQYTQSIGV NFVFVNMPLT IVYLDEVRTV YEQEFQEYMQ
QLSGEYTNFI FRDLGSAWPE TYDNFSDPSH LNLYGAIAVS QTLADDHIIP WRKR