Gene Tery_4567 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_4567 
Symbol 
ID4246221 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp7025685 
End bp7026989 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content38% 
IMG OID638109440 
Producthomoserine dehydrogenase 
Protein accessionYP_724016 
Protein GI113477955 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0460] Homoserine dehydrogenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTATTCA AAATCGGTTT GTTGGGATTA GGAACAGTCG GTATAGGGAC AGCAAAAATT 
TTGCTATCCC CACAAGGTCG TCATCCTTTA ATTAATGAAT TGGAAATATA CCGAGTTGGA
GTGCGTGATA TATCCAAAAC TAGGGATATT AATATATCTC CAAATATTTT AACAACAGAT
TTAGAGGCGA TAGCTACTGA CCCAGATGTA GACATAATAA TAGAACTCAT CGGTGGGTTA
GAACCAGCAA GAAGTCTAAT TCTCAAAGCT ATCGAACATG GTAAACATAT AGTCACTGCC
AACAAAGCTG TCATCTCTCG TTACGGTCCA GAAATTTTTG ATGCAGCTAA TAAAAAAAGT
GTCTATGTTA TGTTAGAAGC ATCAGTAGGA GGTGGAATTC CAGTTATTCA ACCCCTCAAA
GAATCTCTAA CAGTAAATCG CATTCATTCT ATTACTGGTA TCATCAACGG GACAACTAAC
TACATCCTTA CTAAAATGCA AAAAGAGGGA GCAGACTTTG CAGATGTTCT TGCTGATGCT
CAAAAGCTGG GTTACGCAGA GGCAAATCCC ACTGCTGATG TGGAAGGGTT AGATGCCAAA
GATAAAATCG CCATTCTTGG TTCCCTTGCT TTTAGTGGTT TCATAAAATT AGAAGACATT
TACTGCGAAG GTATTCGTCA AATAACAGCA GCAGATATTA CCTACGCTGC TCAACTAAAT
TTCGTAATTA AATTATTAGC GATCGCCAAG CAAGCTCCTA AAAATACTAA CTTCACCTCT
GATAAACTTC AACTCAGAGT ACATCCTACC CTCATTCCCA AAACTCATCC CCTTGCCAGT
ATCAACGATG TTGACAACGC TATCTTCATT GAAGGAGAAC CCATCGGACA AGTTATGTTT
TTTGGTCCTG GTGCCGGTGC TGGTCCAACT GCCAGTGCAG TAGTTTCTGA TATTATGAAT
ATTGCGGCTA TTCTGCAAAC AGAAACCGAA CCCATTCCCC ATCCATTATT AAGTTGTACT
CATCAAAAAT ATTGTGAAAT AGTACCCATA AAAGAACTGA TCACTAGATT TTATGCTCGT
TTCTTAACTC AAGATAAACC AGGAGTAATA GGTAAACTTG GTAGTTGTTT TGGTAAATAT
AGTGTTAGTT TAGAATCTGT GGTACAAACA GGAATACATA ATCAACTTGC AGAAATAGTT
GTTGTTACCC GTGATGTACG AGAAGGCAAC TTCCGGCAAG CATTAGAGGA AATTCAATCA
TTAGATGTTA TTAATAGCAT TCCCAGTATA TTAAGGGTTT TGTAG
 
Protein sequence
MVFKIGLLGL GTVGIGTAKI LLSPQGRHPL INELEIYRVG VRDISKTRDI NISPNILTTD 
LEAIATDPDV DIIIELIGGL EPARSLILKA IEHGKHIVTA NKAVISRYGP EIFDAANKKS
VYVMLEASVG GGIPVIQPLK ESLTVNRIHS ITGIINGTTN YILTKMQKEG ADFADVLADA
QKLGYAEANP TADVEGLDAK DKIAILGSLA FSGFIKLEDI YCEGIRQITA ADITYAAQLN
FVIKLLAIAK QAPKNTNFTS DKLQLRVHPT LIPKTHPLAS INDVDNAIFI EGEPIGQVMF
FGPGAGAGPT ASAVVSDIMN IAAILQTETE PIPHPLLSCT HQKYCEIVPI KELITRFYAR
FLTQDKPGVI GKLGSCFGKY SVSLESVVQT GIHNQLAEIV VVTRDVREGN FRQALEEIQS
LDVINSIPSI LRVL