Gene Tery_3332 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_3332 
Symbol 
ID4243503 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp5111276 
End bp5112709 
Gene Length1434 bp 
Protein Length477 aa 
Translation table11 
GC content44% 
IMG OID638108317 
Productvon Willebrand factor, type A 
Protein accessionYP_722908 
Protein GI113476847 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1240] Mg-chelatase subunit ChlD 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0350677 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACTGGC AAAACTTGAC TCAAAAACTC CCTAAACCAA TTATCTTCGG AATATTCGGC 
GGTGCTGGAT GTCTTATAGC TGCGGCAATA TTTGGTGAAA TGTGGCTGTC TCTGACCAGG
CGACCTCCCC AACCTCAAAC TGTCGTTCTC CTCATCGACA CTTCTTCTAG CATGTGGGGT
GGTAAACTTC CAGAAGTCCA AGCAGCAGCT ACCGGATTCG TTGAACGACA AAATTTAACT
GTTAATAACT TAGCCATTGT AGAATTTTCC AGCAACTCGC AAGTTCTGAC CAATTTTGAT
GCTGATAAAA CTGAACTCAA ACAAGCGATC GCTAATCTCA CCCCATCTGG AGGTACAAAC
CTTTCTCAAG GCCTCAAAAC AGTCGCTTCT CTTCTGCGAA ACAGCAACAC TCCCAATATT
CTCCTATTTA CAGATGGTCA ACCCAACGAC CCTAGGGCCT CAAAATCAAT AGCTAGAGAA
ATCCGAGAGG CAGGAATTAA TTTAGTCACA GTGGGAACCG GAGATGCAAA CAGTAACTAT
CTCACTTCCT TGACAGAAAA TCCAGACCTA GTCTTTTTTG CTAACTCTGG AGAAATAGAC
CAAGCTTTCC GAGCTGCTGA AAAAGCCATC TCACAACTAT CTGACACAAG TGGTAATTAC
GGCTTAGTCT TCGGTATTTT CCGCATAGGG GCATGGACCG GTTTTCTGGC TCTCGGAATT
GGACTGGCCT TAATCCTTGG ACAAAACTAT AACCTCCGCC GTCGGTTGTT GTCGAAGCAA
GAAGTTGCTC TTGGAGGTGG GGGTGGTTTT CTCGCTGGAG TAGTAGGTGG AGCGATCGGT
CAATTGGCGC TTCTGTCAAG TACTAATCTC CCGACTTTAG CGATCGTAGC TCGAATGACC
GGCTGGACTT TTCTCGGAAC CCTTGTTGGT GGTGGAACGT CTTTATTTGT TCCTAACCTA
CCTCGTGAAA AAGCCTTGAT CGCCGGAGGG TTAGGAGGTG TGTTAGGAGC GACTTGCTTT
CTCTTGCTCA ATGCATTGGT AGGTGTGCTT CCAGCTCGTT TGGTAGGAGC AGGAATTTTA
GGATTTTGCA TTGGGTTGGC TATCGCTTTT AGTGAACAAC TAGACCGGGA GGTAGTATTG
TTGGTTCGCT GGAACAACTC AGAATTTACA ACTATTTCCT TGGGAAAGGA ACCCATTGAA
CTTGGTAGCT CCCGGAATGC TCATATTTAT CTATCAAGAG ATGCTGGTTT TCCCGCTAAG
TTTGCTAAGA TATTTATTGA AGAAGAAAAA ATTATTTTAG AATTTGACCC GTCAATTAGA
GAGCGCCCGA AGTTTCAAAA TATGAAAGTT TTGAAACAGG AACTTTCATA TGGCTCAAGT
CGTAAATTCG GAGATGTTTT ATTAGAAATT CCACAAAAAA ACATACTAAA ATAA
 
Protein sequence
MNWQNLTQKL PKPIIFGIFG GAGCLIAAAI FGEMWLSLTR RPPQPQTVVL LIDTSSSMWG 
GKLPEVQAAA TGFVERQNLT VNNLAIVEFS SNSQVLTNFD ADKTELKQAI ANLTPSGGTN
LSQGLKTVAS LLRNSNTPNI LLFTDGQPND PRASKSIARE IREAGINLVT VGTGDANSNY
LTSLTENPDL VFFANSGEID QAFRAAEKAI SQLSDTSGNY GLVFGIFRIG AWTGFLALGI
GLALILGQNY NLRRRLLSKQ EVALGGGGGF LAGVVGGAIG QLALLSSTNL PTLAIVARMT
GWTFLGTLVG GGTSLFVPNL PREKALIAGG LGGVLGATCF LLLNALVGVL PARLVGAGIL
GFCIGLAIAF SEQLDREVVL LVRWNNSEFT TISLGKEPIE LGSSRNAHIY LSRDAGFPAK
FAKIFIEEEK IILEFDPSIR ERPKFQNMKV LKQELSYGSS RKFGDVLLEI PQKNILK