Gene Tery_1667 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_1667 
Symbol 
ID4245461 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp2534182 
End bp2535216 
Gene Length1035 bp 
Protein Length344 aa 
Translation table11 
GC content39% 
IMG OID638106802 
Productphotosystem antenna protein-like 
Protein accessionYP_721411 
Protein GI113475350 
COG category 
COG ID 
TIGRFAM ID[TIGR03041] chlorophyll a/b binding light-harvesting protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00889751 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAACTT ATGGTAGTCC AGAAGTTAAA TACGATTGGT GGGCTGGAAA TGCTCGCTTC 
GCTAAATTAT CTGGTTTATT CATATCTGCT CATGCAGGTC AAGCTGCTCT AATTACTTTT
TGGGCTGGAG CATTTACTCT ATTTGAAATA TCTTTATATT CCCCAGATTT ACCTATGGGT
GAACAGGGTT TAATCTTACT TCCTCACCTA GCAACTTTAG GCTTAGGTAT TGGAAAAGGT
GGAGCAGTTG TAGACACTTA TCCCTATTTT GTAGTTGGTA TTGTACATCT AATTTCATCA
GCAGTATTAG GAGCAGGAGC ACTTTTTCAC ACCTTTAAAG CTCCAGCAAA TTTAAAGGAT
GCAACAGGTC AAGCGAAGCA ATTTCACTTT GAATGGGATG ATCCAAAAAA GCTAGGTATT
ATCTTAGGCC ATCACCTACT TTTCTTAGGA TTTGGTGCTT TATTATTAGC AGCAAAAGCT
ATGTATTTCG ATGGGCTTTA TGATGCGACA ACTCAAACAG TTCGTTTAGT AACAAATCCC
ACTCTCGATC CATTGGTAAT ATATGGATAT CAAACTCATT TCGCCACTGT TAATAGTTTA
GAAGACCTAG TTGGAGGTCA CATTTATGTG GGATTTATTC TAGTTTTAGG TGGCATTTGG
CACATTATTA AGGAACCCCT TCCTTGGGCA AAACGGGTTC TCATCTTCTC TGGTGAAGCA
ATTCTTTCTT ACTCTCTAGG TGGAATTGCC CTAGCAGGGT TTGTTGCAAC TTATTTCTGT
GCTGTTAATA CTTTGGCTTA TCCTGTAGAA TTTTACGGTC CTGTTTTGGA TATTAAGTTA
GGAATTACTC CTTATTTTGC CGACAGTGTC AAGTTGCCTA ATGGTGCTCA TACTGCTCGT
TGTTGGTTGA CAAATGCTCA TTTCTTCTTA TCGTTCTTTT TCTTGCAAGG TCATTTATGG
CACGCTCTGA GAGCTATAGG TTTTGACTTT AAGCGCGTAG AAGATGCTTT GAATGGCGTA
GCAGAAGAAT CTTAA
 
Protein sequence
MQTYGSPEVK YDWWAGNARF AKLSGLFISA HAGQAALITF WAGAFTLFEI SLYSPDLPMG 
EQGLILLPHL ATLGLGIGKG GAVVDTYPYF VVGIVHLISS AVLGAGALFH TFKAPANLKD
ATGQAKQFHF EWDDPKKLGI ILGHHLLFLG FGALLLAAKA MYFDGLYDAT TQTVRLVTNP
TLDPLVIYGY QTHFATVNSL EDLVGGHIYV GFILVLGGIW HIIKEPLPWA KRVLIFSGEA
ILSYSLGGIA LAGFVATYFC AVNTLAYPVE FYGPVLDIKL GITPYFADSV KLPNGAHTAR
CWLTNAHFFL SFFFLQGHLW HALRAIGFDF KRVEDALNGV AEES