Gene Tery_4113 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_4113 
Symbol 
ID4245627 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp6345000 
End bp6345869 
Gene Length870 bp 
Protein Length289 aa 
Translation table11 
GC content32% 
IMG OID638109014 
Producthistidine triad (HIT) protein 
Protein accessionYP_723594 
Protein GI113477533 
COG category[F] Nucleotide transport and metabolism
[G] Carbohydrate transport and metabolism
[R] General function prediction only 
COG ID[COG0537] Diadenosine tetraphosphate (Ap4A) hydrolase and other HIT family hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.489073 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTACCA ATAAACAACC AAACCAATAT ACCCACCTTA CAGCTATAGA CCGTAAATTT 
ATCTCATTTC CAACTCAAGT AATACTAAAA CAAAACTTGC TAGTAGGTAA AATTCTTGAT
TTTGGCTGTG GTCTCGGTAA AGACGTTAAA CTACTACAAA AAAAAGGCTT TGATATTATT
GGTTACGACC CTTATTATTT TCCAGAATAT CCCCAGGGAA AATTTGATAC TATCCTTTGT
TTTTATGTCT TAAACGTCTT ATTTGAACAA CCACAAATCG AAGTGTTGAT GCAAATTTCC
CAGTTATTAA ACTCTGAAGG AAAAGCTTAT TATGCTGTGA GAAGAGGTTT GAAAGGAGAA
GGTTTTCGAG AACATTACAT ACACAAAAAA CCAACTTATC AGTGTTTAGT TAATTTACCT
TTTAAATCTA TTTATACTGA TGAACTTCGT GAAATTTATG AATATACTCA TTATAATCAG
CAAAAAAATT CAGAGAACAA ATGCTTATTT TGTAATCCTC GTAAAAATCT ACAATTAATT
ACAGAATCAG CAAAAGCTTA TGCAATACTT GACGGTTACC CAGTCACAAA AGGCCATACA
TTAATTATTC CCAAACTTCA CCAGGAAAAT TATTTTGAAT TGCCTATAAA TGACCAGTTA
GAATGTTGGT TAATGGTAAA TCAAGTACAG AAGATTTTAC AGGAAAAATA TCAACCTGAC
GGATTTAATA TAGGAATTAA CGTTAATCAT GCTGGAGGAC AAAAAATGAT GCATACTAAT
ATTCACGTTA TTCCTAGATA TCAAAAGAAC GAGTTAGGAA CTAAAGGGGG AATGAGGTCT
GTTGTTCCAA AGAGGAGAGG CAAAGTATAA
 
Protein sequence
MFTNKQPNQY THLTAIDRKF ISFPTQVILK QNLLVGKILD FGCGLGKDVK LLQKKGFDII 
GYDPYYFPEY PQGKFDTILC FYVLNVLFEQ PQIEVLMQIS QLLNSEGKAY YAVRRGLKGE
GFREHYIHKK PTYQCLVNLP FKSIYTDELR EIYEYTHYNQ QKNSENKCLF CNPRKNLQLI
TESAKAYAIL DGYPVTKGHT LIIPKLHQEN YFELPINDQL ECWLMVNQVQ KILQEKYQPD
GFNIGINVNH AGGQKMMHTN IHVIPRYQKN ELGTKGGMRS VVPKRRGKV