Gene Tery_4062 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_4062 
Symbol 
ID4242090 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp6274292 
End bp6275599 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content32% 
IMG OID638108966 
Producthypothetical protein 
Protein accessionYP_723547 
Protein GI113477486 
COG category[R] General function prediction only 
COG ID[COG2607] Predicted ATPase (AAA+ superfamily) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000639264 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCTTATT TAGATTCTCC AGTGTTAGCC CAAATTAAAA GCTTACAACA GCAAATAGCA 
TCTCTATTGT TGTATCAGTC GGTTTTACAA AAAACAGTGG GTGAAGCTTT TATTAAGTTA
CTAGAATCTT TGTCTAAAAA TGATGGAGTT GCTATAGAAT GCCTTCGAGC TTATGGTAAC
TGGTTTAGAA CTATTGCTAG TCAAAATATT TCCTGGCAAG AATATCTAAT TGATCGGATA
ATTCAGGATG ATAATCCCTT TAGTCAACAA GTTCAAACAA TCAATATTAC TAACCTATCT
CCTAATCTAA TTGCAGCAGC AAAAAATGAT TTGCAAATTT TACAAAATCT TTATAAGTGC
AATAGTAAAA CAATTGGTCA GTGGGTAAAA ACAATAGCTA AATTAGAAAA ACTTCCTATT
ACTTGGGAAG AAAAAATAGA AAGAAAAGAA CCTATCTATC AAAAGTTCAT AAAGTCAGAA
AATTGGGCAG ATTCTATTGA GGCTTTAGCT ACTCATTATC GTGATTTTGG TATAGGTTTA
TTTGCTGAAT CAATTGCTTT TGAATGGCGT AATGATCAAC TATTAACTAT TACTTATCCT
GATCCAGTTA AATTAAAGGA ACTAGTAGGA TATGAATTCC AACGAGATAC TTTAATTAAA
AATACAGAAT TTCTTCTGGC AGGATATCCA GCTCTTAATG TTTTACTTTA TGGTAGTCGT
GGTTCTGGGA AATCTTCCTT AGTAAAAGCT TTATTAAATG AGTATAGTCA GAGGAATCTC
CGTTTAGTTG AAGTAGCTAA GTCTGACTTA AAAGAGCTAC CATTAATTGT AGAAAAATTA
CGGAATGTTC CACAAAAATT TATTATCTTT GTTGATGACC TTTCTTTTGA AGAAGATGAT
GATACTTTTA AGGCTTTAAA AGTTGTTTTA GAAGGTAATT TAACTGCTAG ACCAGCAAAT
GTGGTAGTTT ATGCTACTTC AAATAGAAGG CATTTAATAA GAGAATTTTT TAATGATCGC
CCTTCCCCAA AAGATAGTGA TGAGGTGCAT AATTGGGATA CAGTTCAAGA AAAGCTTTCT
TTTAGCGATC GCTTTGGTTT AACTTTAACT TTTGAACCTG CTAACCAAGA TACCTATTTA
AAAATTGTTA GACATCTAGC CAAGCAAGAA AAGGTAAATC TAAATCCTGA AGATTTAGAT
TATCAAGCCT TACAATGGGC AACTAGAAAT AATGGTCGTT CTGGGCGAGG GGCGCGACAA
TTTATTGATT TTATCAAAGC TAATTTAGCA GTTTTTGAGA AGTGTTGA
 
Protein sequence
MSYLDSPVLA QIKSLQQQIA SLLLYQSVLQ KTVGEAFIKL LESLSKNDGV AIECLRAYGN 
WFRTIASQNI SWQEYLIDRI IQDDNPFSQQ VQTINITNLS PNLIAAAKND LQILQNLYKC
NSKTIGQWVK TIAKLEKLPI TWEEKIERKE PIYQKFIKSE NWADSIEALA THYRDFGIGL
FAESIAFEWR NDQLLTITYP DPVKLKELVG YEFQRDTLIK NTEFLLAGYP ALNVLLYGSR
GSGKSSLVKA LLNEYSQRNL RLVEVAKSDL KELPLIVEKL RNVPQKFIIF VDDLSFEEDD
DTFKALKVVL EGNLTARPAN VVVYATSNRR HLIREFFNDR PSPKDSDEVH NWDTVQEKLS
FSDRFGLTLT FEPANQDTYL KIVRHLAKQE KVNLNPEDLD YQALQWATRN NGRSGRGARQ
FIDFIKANLA VFEKC