Gene Tery_4072 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_4072 
Symbol 
ID4242100 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp6286709 
End bp6288205 
Gene Length1497 bp 
Protein Length498 aa 
Translation table11 
GC content32% 
IMG OID638108975 
Productisochorismate synthases 
Protein accessionYP_723556 
Protein GI113477495 
COG category[H] Coenzyme transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1169] Isochorismate synthase 
TIGRFAM ID[TIGR00543] isochorismate synthases 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.108156 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000687043 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGCCTGTAA CACCAACTTC TGTTAATCTA TTTCAGACTT ACCAAGATCT GTATCAATTT 
CTGTTTAATT GTCAACAGAC ATTAACAGAC AATATGCAAA CAAAAATAAT TAGTATTTCT
CGAGAAATAT TATCAGTAGA TCCCCTAGCA GTATTGCAAA AAATTTGTCA ACCACATCAA
CTACATTTTT ATTTAGAAAA ACAAGCCATT GGAGAAAAAA ACATCCACAA AAATAGGTTG
GCGATCGCTG CTGTAGATAC TGCTACCCAT TTTACTGTTA AAAGTGGGAG TCGTTTTGCG
CAAGCTGAGT CTTTTATTCA ATCATGTTTA AACAATACAA TTTCTCTAGG TGCAACACAC
TTACCATATT CAGGTCCCCA CTTTTTTTGT AGCTTTACTT TCTTTGAAAA AGATACTCAT
GCCTATCTAC AAAACCTTTA TCAAAATGGT AATTACCAAC AAAAATTATC TCTAAACCTT
CACTTTCCTT TAGCAACAAT ATTTCTACCA TGCTGGCAAA TAACTCAGAC AAATAAACAT
AATATACTTG TAATTAATAC CGTTATTAAT AACTCTATTA ACATTAAAAA TCTTTCCCAT
AAAATTTGGC ATAAATTCCA GGAAATAACC CAGATAAAAC ATAATCATTT ATCAACTTTA
ACCAAGCCCA ATCAAAAACT TATAAAAATC AACGTCAATC ATTTACAAAA ATTTAAAAAA
TCAGTAGCTT CAGCTCTAGA ATTAATCAAT TCAAATTATT TAAGAAAAAT TGTTTTAGCC
CATGCTATAG ATATATATTC TCAAAATAAT TTCAACTTAA TCAAATCCTT AAACAACTTG
CGATTCATTT ATCCAGACTG CTATGTATTT TCTATTAGTA ATGGCAAGGG CCAAAACTTC
ATAGGTGCAA GTCCAGAACG CTTAATTAGT ATTAACAATA ATCAATTAGT TACAGATGCT
TTAGCAGGTT CTGCACCTAG AGGCAAAACC CCTAGTCAAG ATGCTAAATT AGCCAATAGT
TTATTATGTA GTGAAAAAGA TTTACGAGAA CATCAATTTG TCATAGATTT CATTATTAAA
CGTCTTCAAT ATTTAGGATT AAAACCAAAT TATTTACCCC AACCAAATCT ACTACAATTG
TCAAATATTC AGCATTTATG GACACCAATA AATGCAGAAG TTTCTCAAAA TATTCATTTA
TTAGAAATAT TAGCACAACT CCATCCCACA CCAGCAGTGG CAGGAGTTCC TAGAGATATT
GCTCAAGAAC AAATACAGAA TTTTGAAACT TTCGATCGCT CACTTTATGC AGCACCTATT
GGTTGGATAG ATCACCAAGG AAATGGAGAA TTTACTGTAG GTATTAGGTC AGCTTTAATT
GATGGAGAAC GCGCTAGACT TTATGCTGGT GCAGGTATAG TTACTGGTTC AAAACCAGAT
CAAGAGTTAG CAGAAGTTCA ACTCAAACTT CAGACATTAT TAAAAGCTTT AGTTTAA
 
Protein sequence
MPVTPTSVNL FQTYQDLYQF LFNCQQTLTD NMQTKIISIS REILSVDPLA VLQKICQPHQ 
LHFYLEKQAI GEKNIHKNRL AIAAVDTATH FTVKSGSRFA QAESFIQSCL NNTISLGATH
LPYSGPHFFC SFTFFEKDTH AYLQNLYQNG NYQQKLSLNL HFPLATIFLP CWQITQTNKH
NILVINTVIN NSINIKNLSH KIWHKFQEIT QIKHNHLSTL TKPNQKLIKI NVNHLQKFKK
SVASALELIN SNYLRKIVLA HAIDIYSQNN FNLIKSLNNL RFIYPDCYVF SISNGKGQNF
IGASPERLIS INNNQLVTDA LAGSAPRGKT PSQDAKLANS LLCSEKDLRE HQFVIDFIIK
RLQYLGLKPN YLPQPNLLQL SNIQHLWTPI NAEVSQNIHL LEILAQLHPT PAVAGVPRDI
AQEQIQNFET FDRSLYAAPI GWIDHQGNGE FTVGIRSALI DGERARLYAG AGIVTGSKPD
QELAEVQLKL QTLLKALV