Gene Tery_4077 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_4077 
Symbol 
ID4242105 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp6291504 
End bp6293252 
Gene Length1749 bp 
Protein Length582 aa 
Translation table11 
GC content33% 
IMG OID638108978 
Product2-succinyl-5-enolpyruvyl-6-hydroxy-3- cyclohexene-1-carboxylate synthase 
Protein accessionYP_723559 
Protein GI113477498 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1165] 2-succinyl-6-hydroxy-2,4-cyclohexadiene-1-carboxylate synthase 
TIGRFAM ID[TIGR00173] 2-succinyl-5-enolpyruvyl-6-hydroxy-3-cyclohexene-1-carboxylic-acid synthase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000513285 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGTCAATTG ATTTTAGAAG CCTCAATACA GTATGGTCAT CTATTTTAGT AGAAACACTA 
CATCATTTAG GATTAACCAC CGCAATTATC AGTCCTGGTT ATCGTTCCAC ACCCCTAACA
TTCGCCTTTG CCACCCATCC CAAAATTGAA ACTATTCCCA TTCTTGATGA ACGTTCAGCA
GCATTTTTTG CCCTAGGAAT AGCAAAAAAA AATTATCAGC CAATAGTTAT AGTCTGTACA
TCTGGTACGG CCGCAGCTAA TTTTTATCCA GCCATAATAG AAGCAAAAGA AAGTCGCATC
CCACTATTAG TTCTAACTGC AGATAGACCC CCAGAATTAC GAAATTGTCA TGCTGGTCAA
ACCATAGATC AAGTCAAATT ATATGGTAAT TATCCTAACT GGCAAATTGA ACTAACTTTA
CCTTCAGTAG AACTAAAAAG ATTGGAATAT TTAAGACAAA CAGTTATTCA TGGTTGGGAA
AAAACTATGT TTCCTACACC TGGACCTGTG CATTTTAATA TTCCTTTTCG AGACCCTTTA
GCGCCCATAA ATCAACCAGA AGCGATCGCT TTAGAATCAA AATTTTCCCA AAACTTTTTT
GCTAGTTTAC GACCAATTAT TAGAACAGAA TTAATCCCAA ATTCTGACTT AATAGAATTA
TTAAAAAATC AATTTAAATC TCACTCAGGA ATTATAATTG CAGGTTTAGC ACAACCAGAA
AAACCTGAAG TATATTGTCA GGCGATCGCT AAAATTTCCC AAACTTTAAA CTTCCCTGTT
TTAGCCGAAG GTTTATCACC TCTAAGAAAC TACTCCCAGT TAAATCCCTA TTTAATTAGC
ACTTATGATT TAATTTTAAG AAATCAAAAA TTAGCCAACA AACTTATTCC CAAAATAGTC
TTACAAATTG GAGAACTACC AACAAGTAAA CAACTAAGAA CTTGGCTAGA AGCAGCTAAT
TCTCACAGAT TAATTATTGA CCAAAGTGAC CATAATTTTG ATCCGCTTCA TGGAAAGACA
ACTCATTTAC GAATCTCAGT AGAACAACTA GCCAAAATTT TAATTACTCA ATATTTCAAC
AATAACCATG ATATAAATTA CTTAAATTTG TGGTGTCAAG CCGAAGAAAA AGTCAGAGAA
AATATTGACA CAAAAATGGC AAAAATAAAT CATATATTGG AACCAAAAAT TTCTTGGTTA
ATATCTCAGA CATTACCAAA AAATACATCA ATATTTGTTG CTAATAGTAT GCCAGTTAGA
GATGTAGAGT TTTTCTGGGT TCCTAATAAT TCCCAAATTC AGCCATTTTT TAATCGAGGT
GTAAATGGAA TAGACGGTAC TTTATCAACA GCTTTAGGTA TTGCCCACCG GTATCAAAAA
ACTGTTATGT TAACAGGAGA TTTGGCGCTT TTACATGACA CCAATGGTTT TTTACTAAGA
AATAAATTAG TCGGTCATTT AACTATTATT TTGATTAATA ATCAGGGAGG TGGAATTTTT
GAAATGCTCC CAATAGCTAA CTTTGAACCA CCTTTTACAG AATTTTTTGC TACTCCCCAA
GAGATTGATT TTGCTGATTT ATGTAAAACT TATGGTTTAG AACACCAAAA AATATCTTCC
TGGAACCAAT TACAACAACT TTTAAATCCT TTACCAAGTA GTGGAATAAG GATTTTAGAA
TTACAAACAG ACCGCCAACT TGACGCTAGA TGGCGGTTAG ATAATTTAGA TACATTTATA
GATTTATAG
 
Protein sequence
MSIDFRSLNT VWSSILVETL HHLGLTTAII SPGYRSTPLT FAFATHPKIE TIPILDERSA 
AFFALGIAKK NYQPIVIVCT SGTAAANFYP AIIEAKESRI PLLVLTADRP PELRNCHAGQ
TIDQVKLYGN YPNWQIELTL PSVELKRLEY LRQTVIHGWE KTMFPTPGPV HFNIPFRDPL
APINQPEAIA LESKFSQNFF ASLRPIIRTE LIPNSDLIEL LKNQFKSHSG IIIAGLAQPE
KPEVYCQAIA KISQTLNFPV LAEGLSPLRN YSQLNPYLIS TYDLILRNQK LANKLIPKIV
LQIGELPTSK QLRTWLEAAN SHRLIIDQSD HNFDPLHGKT THLRISVEQL AKILITQYFN
NNHDINYLNL WCQAEEKVRE NIDTKMAKIN HILEPKISWL ISQTLPKNTS IFVANSMPVR
DVEFFWVPNN SQIQPFFNRG VNGIDGTLST ALGIAHRYQK TVMLTGDLAL LHDTNGFLLR
NKLVGHLTII LINNQGGGIF EMLPIANFEP PFTEFFATPQ EIDFADLCKT YGLEHQKISS
WNQLQQLLNP LPSSGIRILE LQTDRQLDAR WRLDNLDTFI DL