Gene Tery_4574 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_4574 
Symbol 
ID4246228 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp7037993 
End bp7039072 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content41% 
IMG OID638109447 
ProductHI0933-like protein 
Protein accessionYP_724023 
Protein GI113477962 
COG category[R] General function prediction only 
COG ID[COG3380] Predicted NAD/FAD-dependent oxidoreductase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAAAT CACAAAAATT TGATATTGCT GTTATTGGTG CAGGTATAGC CGGTTTAGTC 
TGCGCCCAAG AACTACAGCA GGCTGGTTAT TCAGTGCTAG TCCTAGAAAA ATCTAGGGGA
GTCGGGGGTC GCATGGCAAC TCGCCGAGTT TTAGGAAGTC GCGCTGATCA TGGAGTACGT
TACCTCGAGC CTACAAATAA ATTTTTGCAA CAATTAATTA ATAATCTCGA AATTCAAAAA
AATAGCCCTG ATGCCGAACC TATTCTCCGA CTTTGGACAG ATAAAATTTA TCAGTTAACT
CAACCCCAGA AACCACTATT TCCCATTGCC AAAAATTGCT ATGTTGCACC CCAGGGAATG
AACTCTGTCG GCAAATTTCT CGCCCAAAAT TTAGATATTT GGTTTAGTCG GCGAGTACAA
ACTATACAAC CAATAGCAGA AAAAAATTGG TGCCTGAAGT TAGAAATTAC TAATGATGCT
GCTACAGAAA AACCTACAGA AGTAATCGCA AAAGCCATAG TTTTAGCTAT TCCTGCACCT
CAAGCTTTAA TGATTTTAGA AAAACCTGTT GCTGAAATAC AACCCGAATT TATTCAGCGT
CTGGGTTTTG TAGAATATGA TCCTTGCATT ACCGTGATGG CAGGATATTC TCCAGCATTA
CAAAAAAAAT TACCAGAATG GCAAGCGATC GTTCTCCCAA ATGATGAATA TCTCGACTGG
ATAGGTCTTG ATAGTAGCAA AAGACCTTAT CCTACTCAGC CTATATTTGT GATTCAAAGT
AGCGCCAAGT TTGCTACGAC CCACCTTGAC TCCCCAGACT TACAACCATT AGGGAAAGAA
TTATTAGAAT ATACAGCACA ACAATTATTA CCATGGTTAA GTAGCCCAGA ATGGTTGCAA
GTTCACCGAT GGCGTTATGC TTTCTGTCGT CAACCTTTAG ATGTTGCTTG TTTGACTACA
AAATTACCGT TGCCATTACT GGGTGCTGGT GACTGGTGTG GTGGGAATAA TATTGAGAGT
GCTTTTGCCT CTGGGTTGGC TGCAGCAAAT TCCTTGAGTC AGCTAATTGC GAATAGCTGA
 
Protein sequence
MNKSQKFDIA VIGAGIAGLV CAQELQQAGY SVLVLEKSRG VGGRMATRRV LGSRADHGVR 
YLEPTNKFLQ QLINNLEIQK NSPDAEPILR LWTDKIYQLT QPQKPLFPIA KNCYVAPQGM
NSVGKFLAQN LDIWFSRRVQ TIQPIAEKNW CLKLEITNDA ATEKPTEVIA KAIVLAIPAP
QALMILEKPV AEIQPEFIQR LGFVEYDPCI TVMAGYSPAL QKKLPEWQAI VLPNDEYLDW
IGLDSSKRPY PTQPIFVIQS SAKFATTHLD SPDLQPLGKE LLEYTAQQLL PWLSSPEWLQ
VHRWRYAFCR QPLDVACLTT KLPLPLLGAG DWCGGNNIES AFASGLAAAN SLSQLIANS