Gene Tery_0072 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_0072 
Symbol 
ID4242455 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp100518 
End bp101846 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content31% 
IMG OID638105435 
ProductHEAT repeat-containing PBS lyase 
Protein accessionYP_720054 
Protein GI113473993 
COG category[C] Energy production and conversion 
COG ID[COG1413] FOG: HEAT repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0791445 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAAATC TTCTTGAACA GGCAAAAATT GCTGCCCAAA AACAAAACTG GTCTTTAGTC 
AATTATTATC TACAGCAATT TATCTTAGAG AGTAAGTCAA ATAAAACTCC TTTAATATTC
CCAGACAACA ATGTTGTTAT AGATGAGGTA ATTGACTTAG CAATAGAAGT TATGGAAAAT
GGAGATTTTC AAGAAAAATG GGATATTAAT AAACTGTTTA AGCAAATTGG AAAACCAGCT
ATCGCTCCCT TAATTGAAAT TCTTAACGCC CAAGAAGTAG ATCTAGAAGA ACGTTGGTTT
ATAACTAGGA TATTGGCAGA GTTTAACACT GAAGAATTCC TAGAAGTTAT GGGGAATATA
GTTATTAGAT CAAAGCCAAA AGATATACAG GAAATAGCAG CAGAAATAAG AGCAAATTGT
GGTAATTCGG GAGTTGATTT ATTAACAAAT TTATTAGCAA AGCCAGAGTT AAAATTATTG
GTCATTAAAG CTTTAGCTAA AATTAACTGT ATTAGTATGA TTCCAGCATT GTTAACTTTC
GTTAAAGATG AAAATTATGA AGTGAGAATC ATGGGAATTT CTGCTTTGAA TAATTATTGT
GACTCACGCA TTCCTATTGT ATTAATTTCG GCTTTAAAAG ATAGAGTGGC AAAAGTTAGA
AAAACAGCAG TGATTAGTTT AGCTAGTTAT GCTAATTTAC ACCAAGAATT AGGTTTGGTG
AAGTTACTAC AACCCTTACT ATGGGATATG GATGTTGAAG TTTCTCAACA AGCAGCGATC
GCTCTGGGAA AAATTGGTAC CAATTTAGCA GCAACAGCTT TATATGAACT ATTAAAAACT
GCCACAGTGC CAATATTTTT AAAGATAGAT GCAGTGCGTG CTTTAGGTAG AATTGAAACT
CAAGTATCCT TAGAATATTT GCAAAATTTA CTAGAGTATA ATTTCTTAGT TGAAGATTAT
GATATGGCAC AAATTGTTAA TGAAATTATC ACGGCATTGG GTAAGTTAGA AAAACCTGAA
TTAAAACTTC AAGCAACAGA TATTTTAATT GAATTTATTA CCAGTCATAA CTCTAATCTA
GAAAATATTT GTGTGAAAAA ATCTTTGGCT TTAGCTCTGG GATACTTAGG AAATATTCAT
GCTTTAGAAT CTCTAATTCA ATTATTAGAA GATGAAGATA ATAGTGTTAG ACTTCATTCT
GTAGCAGCGA TGAAACAATT AGATTCAGAA AAAGCATATC AAAGACTTAT TTTTTTATCT
GCACAAGCAA GCCTTAACTC ACAGTTAAAA ACAGGAATAG CAATAGCTCT TGCAGAATTG
AATAATTAA
 
Protein sequence
MSNLLEQAKI AAQKQNWSLV NYYLQQFILE SKSNKTPLIF PDNNVVIDEV IDLAIEVMEN 
GDFQEKWDIN KLFKQIGKPA IAPLIEILNA QEVDLEERWF ITRILAEFNT EEFLEVMGNI
VIRSKPKDIQ EIAAEIRANC GNSGVDLLTN LLAKPELKLL VIKALAKINC ISMIPALLTF
VKDENYEVRI MGISALNNYC DSRIPIVLIS ALKDRVAKVR KTAVISLASY ANLHQELGLV
KLLQPLLWDM DVEVSQQAAI ALGKIGTNLA ATALYELLKT ATVPIFLKID AVRALGRIET
QVSLEYLQNL LEYNFLVEDY DMAQIVNEII TALGKLEKPE LKLQATDILI EFITSHNSNL
ENICVKKSLA LALGYLGNIH ALESLIQLLE DEDNSVRLHS VAAMKQLDSE KAYQRLIFLS
AQASLNSQLK TGIAIALAEL NN