Gene Tery_4182 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_4182 
Symbol 
ID4245834 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp6444890 
End bp6445951 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content35% 
IMG OID638109082 
Product4-hydroxyphenylpyruvate dioxygenase 
Protein accessionYP_723660 
Protein GI113477599 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins 
TIGRFAM ID[TIGR01263] 4-hydroxyphenylpyruvate dioxygenase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.909062 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGTTCG ATCACATTCA TTTCTACGTT GAAAATGCTA TAGAGTCCAG AGACTGGTTT 
AGAGAAAAAT TAGGTTTTAA AGCCATTGCT TCTAAAACCA GTCAACATAC CCAGAAGGAA
ATTATTAACA GGGGTCAAGT ATATTTTGCC TTATCTTCTG CCATCACACC AGCAAGTCCT
GTTACCAATT TTCTGAGCTT ACATCCTCCC GGAGTCGCTG ATGTAGCTTT TCGAGTTCGA
GATATTACCT CAGTTGTGGC AAATGCGGCA GCTAATGGAG CAGAAATTTT GCAACCTATT
CAGGAAAACT TAAACGGTTT AAAATGGGCG AAAATTTCTG GATGGGGAGA CTTAACTCAT
ACTTTGATAG AAAAAATTGA TGATGCAAAA ATTTTGAATT CTACTTCTCA ACTTTCTACG
GATCTTATGG TGATCGATCA TGTAGTTTTA AATGTAGCTA AAGACAATTT AGAACCTGCT
TTCAATTGGT ATCATCAAAT TTTCAATTTC CAACCCCATC AAAATTTTGA CATTCAGACA
AATAAATCAG GTTTGCGTAG TCTAGTGATG ATACACCCAG AGGGAGAAGT CAAATTTCCT
ATTAATGAGC CTACATCTGA TAGTTCTCAA ATTCAGGAAT TTTTGGATGC TAATTCTGGT
GCGGGAATAC AACACATTGC TTTACATACA GAAAATATTT TGGGGGTGGT CGGAGAGTTG
CGATCGCTTG GTTTACCTTT TTTACAAGTT CCAAAAACAT ATTACTATAG TCTACAAACA
GAAGCATTAA GTCATCTATC AGAAACTGAC TGGCAGAAAG TTCAAAATTG TCAAATTTTG
GTAGACTGGC AAGAAAAAAT ACCAGGAGCA ATGTTACTAC AAATTTTTAC ACAACCAATA
TTTAACCAAC CAACAGTATT TTTTGAGTTT ATTGAACGTA AAGTTGTTTG GGTAAATGGT
AAACAAATTC AGACACCAGG TTTTGGTCAA GGTAATTTTC AAGCTTTATT TGAAGCTATT
GAAAGGGAAC AAATGAAACG AGGTAGTTTA AGAAAAAATT AA
 
Protein sequence
MQFDHIHFYV ENAIESRDWF REKLGFKAIA SKTSQHTQKE IINRGQVYFA LSSAITPASP 
VTNFLSLHPP GVADVAFRVR DITSVVANAA ANGAEILQPI QENLNGLKWA KISGWGDLTH
TLIEKIDDAK ILNSTSQLST DLMVIDHVVL NVAKDNLEPA FNWYHQIFNF QPHQNFDIQT
NKSGLRSLVM IHPEGEVKFP INEPTSDSSQ IQEFLDANSG AGIQHIALHT ENILGVVGEL
RSLGLPFLQV PKTYYYSLQT EALSHLSETD WQKVQNCQIL VDWQEKIPGA MLLQIFTQPI
FNQPTVFFEF IERKVVWVNG KQIQTPGFGQ GNFQALFEAI EREQMKRGSL RKN