Gene Tery_3585 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_3585 
Symbol 
ID4244218 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp5513152 
End bp5515257 
Gene Length2106 bp 
Protein Length701 aa 
Translation table11 
GC content30% 
IMG OID638108550 
ProductGCN5-related N-acetyltransferase 
Protein accessionYP_723139 
Protein GI113477078 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGAAA AAATAAAAAC TATAGAAATT GACGAAAAAT CTCCCCATTT ACAAGCTGTT 
ATTAAATTGG GCGATGCTAA TACAAAAACC CTAGGTCATT TATCCTACGA TACTTTTTTT
GATTATGCAA AACGTGGCCA AATTATTATA GCTTTAGATC CAAAAGAAAA TTTTATTGGT
TATCTGATGT ACCGAAAGGT TCGTCGAAGT AATCATCTTG TAATAGTTCA TTTATGTGTT
GCTGAATCAT CCCGCGGTAG GGGTTTGACT AGGACACTAG TAAATTATTT AAGTCAAAGG
CATCAAGATT TTTATGGTAT TAAGTTAAAA TGCCGTCGCG ATTATGGATT AGACGAAATG
TGGTCTAAGT TAGGCTTTGT TCCCTTAAAT GAAAAGCCTG GAAAAAGTAA GGAAGGAAAA
CCCTTAACAG TTTGGTGGCT AGATCATGGT CATCCCAATT TATTTTCTGT TAATGCTACC
GAAAAGCTCA ATTCTAAACT ATGTGCAATT ATTGATATAA ATGTTTTCTT TGATTTGTCT
GAAAATGATA GTTTTAAAAG TGAAGAATCA GAGTCTTTAC TTGCTGATTG GCTACAAGAT
GACTTAGAGT TATGTGTCAC TGATGAAATT TTTAATGCTA TTAATAATAT TACTGATTCC
CAAGATAGAA AATATCAACG TCTCAATGCA GAAAAGTTTA CAAAATTACC ATATAATCAG
CAAAATTTTG AGGTTTTTTT AAACTCTTTT ATAGAGTTTA TGGAAGGAAG TAATCTCAAA
CTTGATCAAT CACAAATACG TCAGCTAGCT ATGACACTTG CTTCTGAATG CCTAATTTTT
GTAACTCGTG ATTCACATAT TTTAAATTTA ACCGATGAAA TCTATAAAAT TTTTAAGCTA
GAAATAACTA ATCCTAGTAA CCTCATTATT AACCTAGATG AAATTCGTAG AAGCACTACA
TATCAACCTG TACGATTAGC GGGTTCAAAT CAAATCAATC TTCAATCTGT ACCGATCAAT
CAAATCAAGG GGCTGACTGA TTATTTTTGG AGTCAAAAAA ATGGTGAGAA AAAAGAATAT
TTTCAGCAAA AAATTCGCAG TTTTATTAGT AATTGTTCTA AGTTTATATG TCATCAGCTT
GTAATGGAAG AAAATCAACC AGTCGGTTTA ATTGTTTATA ATAAATCAAA ATCTTCTGAA
TTAGAAATTC CATTACTAAG AGTTAAAGAT AATTCTTTAG GTGGAACACT TGCACGTTAT
TTAATTTTTC ATGCCATAAT AATTTCAGCT CAGGAGAACA GATATTTTAC AAGAATCACT
GACCCCTATT TAGAAGAAAT AGTTGTTAAA GCTATTCAAG AAGATGCTTT TATTAAGAAT
AAGGGAGAAT ATATAAAGGT AAATATTCCC GTAGCAGAAA CAGCATCTAA TCTATCTAAG
CGTCTCAATG ATTTAGCTAA ATTTGAATTA GAATATCAAT CAAATTTTTG TATAAAATTT
GCAAAAACTA TTAGTCAATC TGAATCAACG AAAAATTCAC AAACTACAAT AGAAATAGAA
CGTTTTCTGT GGCCTGCAAA AATAATTGAT GCTAATGTGC CTACATGGAT TATTCCTATT
AAACCATTTT GGGCAAAAGA TTTATTTGAT GAAGAGTTAG CTAATTATTG GTTATTTGGC
TCTAAAACTG AATTAGCACT TAAACGTGAA CTTGTATTTT ATCGTTCTAA GGGCGGTTTA
AAACCTGGTG TAATTGGTAG GATTATATGG TATGTCAGCA ATGATAAATC TTTTCCATAC
GGGACAACAA AAGTAATTAA AGCTTGTTCT CGATTAGATG AAGTCATAGT AGACAAACCA
GAAAAATTAT ATAGACAATT TCGTAATCTA GGTATATATA AATTAGAAGA TTTAATAAAA
ATCACCAACA ATGATCCTAA TGAAGATATA ATGGCTGTTC GATTTAGTGA TACTCAAGTG
TTCACTAATA CTATAACTTT AAAAGAACTT CAAGATATAC TAAAAAAACA GATAACTGTT
CAAGGACCTT TTAAAATAAC ACCAGATCAA TTTGCTAAAA TATATGATCA AATCAACAAA
AATTAA
 
Protein sequence
MTEKIKTIEI DEKSPHLQAV IKLGDANTKT LGHLSYDTFF DYAKRGQIII ALDPKENFIG 
YLMYRKVRRS NHLVIVHLCV AESSRGRGLT RTLVNYLSQR HQDFYGIKLK CRRDYGLDEM
WSKLGFVPLN EKPGKSKEGK PLTVWWLDHG HPNLFSVNAT EKLNSKLCAI IDINVFFDLS
ENDSFKSEES ESLLADWLQD DLELCVTDEI FNAINNITDS QDRKYQRLNA EKFTKLPYNQ
QNFEVFLNSF IEFMEGSNLK LDQSQIRQLA MTLASECLIF VTRDSHILNL TDEIYKIFKL
EITNPSNLII NLDEIRRSTT YQPVRLAGSN QINLQSVPIN QIKGLTDYFW SQKNGEKKEY
FQQKIRSFIS NCSKFICHQL VMEENQPVGL IVYNKSKSSE LEIPLLRVKD NSLGGTLARY
LIFHAIIISA QENRYFTRIT DPYLEEIVVK AIQEDAFIKN KGEYIKVNIP VAETASNLSK
RLNDLAKFEL EYQSNFCIKF AKTISQSEST KNSQTTIEIE RFLWPAKIID ANVPTWIIPI
KPFWAKDLFD EELANYWLFG SKTELALKRE LVFYRSKGGL KPGVIGRIIW YVSNDKSFPY
GTTKVIKACS RLDEVIVDKP EKLYRQFRNL GIYKLEDLIK ITNNDPNEDI MAVRFSDTQV
FTNTITLKEL QDILKKQITV QGPFKITPDQ FAKIYDQINK N