Gene Tery_0171 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_0171 
Symbol 
ID4242921 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp254069 
End bp255130 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content42% 
IMG OID638105517 
Product3-deoxy-7-phosphoheptulonate synthase 
Protein accessionYP_720136 
Protein GI113474075 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2876] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR01361] phospho-2-dehydro-3-deoxyheptonate aldolase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.623369 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.950344 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATAATAG TAACAAAAAT CGGTTCTCCA GAAGTAGAGA TAGATCGTCT TTGTCAGGAA 
CTCAAAACTA ATTGGGGTCT AACTCCAGAA AAAATTGTTG GGCGCTATAA AGTTGTTATT
GGTTTAGTAG GAGATACTGC AGACCTGGAT CCATTACAAT TACAGGAAAT GAGTCCTTGG
ATCGAACAGG TAATGCGGGT TGAACGTCCT TACAAACGAG CAAGTCTAGA GTTTCATAAC
GGAGAAGCTA GTACAGTAGT AGTATCTACT CCAGATGGAC CTATACCTTT TGGCCTTCAT
CAAGATCTAG TTGTAGTTGC TGGACCCTGT TCGGTAGAAA ATGAGCAAAT GATTATAGAA
ACAGCTCAAC GGGTGAAAGC TGCTGGTGCT AAGTTTTTGC GTGGAGGTGC TTATAAACCT
CGTACTTCTC CTTATTCTTT TCAGGGACAT GGTGAGAGTG CTTTAAATTT GTTGGCTGCG
GCTAAAGAAA AAACTGGTCT GGGAATTATT ACAGAAGTAA TGGATGCTGC TGACTTACCA
AGGTTGACTG AAGTTGCGGA TGTGGTGCAA GTAGGTGCCC GTAATATGCA AAACTTTTCT
CTATTAAAAA AGGTCGGTGC TCAAGATAAA CCAGTGCTTT TGAAGCGGGG AATGTCAGCT
ACTATTGAAG AGTGGCTAAT GGCTGCAGAA TATATTTTAG CAGCCGGAAA TTCAAATGTA
ATTCTTTGTG AGCGCGGTGT TAGGACTTTT GATCGTCAGT ATGCTCGTAA TACTTTGGAC
TTGTCTGCAA TACCAGTTTT GCGATCGCTG ACTCATTTAC CAATTATGAT TGACCCTAGT
CATGGTACTG GTCGAGCTGA ATATGTACCT GCAATGGCAA TGGGAGCTAT TGCTGCAGGA
ACAGATTCTC TAATGATTGA AGTCCATCCA AATCCAGCAA AAGCTCTGTC AGATGGCCCA
CAATCTCTAA CACCAGATAA GTTTGATACC TTAATGGAGG AAATGGCAGT TATTGGTAAA
AGTGTTAAAC GTTGGCCACA ACCGGAACCA GCCATTGTAT AG
 
Protein sequence
MIIVTKIGSP EVEIDRLCQE LKTNWGLTPE KIVGRYKVVI GLVGDTADLD PLQLQEMSPW 
IEQVMRVERP YKRASLEFHN GEASTVVVST PDGPIPFGLH QDLVVVAGPC SVENEQMIIE
TAQRVKAAGA KFLRGGAYKP RTSPYSFQGH GESALNLLAA AKEKTGLGII TEVMDAADLP
RLTEVADVVQ VGARNMQNFS LLKKVGAQDK PVLLKRGMSA TIEEWLMAAE YILAAGNSNV
ILCERGVRTF DRQYARNTLD LSAIPVLRSL THLPIMIDPS HGTGRAEYVP AMAMGAIAAG
TDSLMIEVHP NPAKALSDGP QSLTPDKFDT LMEEMAVIGK SVKRWPQPEP AIV