Gene Tery_4380 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_4380 
Symbol 
ID4246033 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp6747162 
End bp6748454 
Gene Length1293 bp 
Protein Length430 aa 
Translation table11 
GC content38% 
IMG OID638109266 
ProductGCN5-related N-acetyltransferase 
Protein accessionYP_723843 
Protein GI113477782 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.693403 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.287127 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACCTT TTATCCGCCA AAAGCAAAGC CAGGTGGTTA TCCGTTCTCT AGAGTACCGA 
GACCTAGAAG CTATCAAAAA GCTATGTCGG GAAACTCCAG AAGCCCAGAA TTGCCACCAA
GATGTTACTT TTAATCTCTC TCCCATTCAC CTTAGCCAAA AATTGCCAGA AATTCATCAT
TGGTACTGGC CTATAAAGCT GTTGAGCTTT TTCCCTAATC CCTGTAAATA TTTGTTAACT
TCTTATGTTG CAGAAGTAGA TGGACAAGTC AAGGGGGTAA TTCAAGTTTG TCCAAGTAAT
CGCACTCGCA GCAGTTGGCG GGTAGAAAAG ATATTTATCG AAGAAACAGG AAATACAAGA
GGAATAGGTT CTCAATTACT TCGATACTGC TTTGAAAAAA TTTGGGAAGC ACGGACATGG
TTGTTGCAAG TTAATGTTAA TGATAAAGAT ACAATGGCAC TGTATCGCCA AAATGGTTTT
CAGCCATTGG CTCAAATTAC TTACTGGACA ATTACTCCCC AACAACTACA AGATTTAGCC
CTTTCACAGC CAAATTTACC CAACTTACTA CCTGTTAGTA ATGCTGATGC TCAGTTGCTC
TATCAATTAG ACACAGCATC AATGCCACCC CTGGTACGCC AAGTATTTGA CCGCCATATC
CAAGATTTTC AAACTAGTTT TGTTAGTGGT TTGATAGAAG GAATCAGGCA ATGGCTTAAT
CATACTGAGT TGGTAAGTGG TTATGTGTTT GAATCTCAAC GTAAGGCCGC AATAGGTTAC
TTCCAAGTAA AATTATGCCG TGACGGTACA CAACCACACA TAGCTGAGTT AACTGTACAT
CCAGCTTATA CTTGGTTATA TCCAGAATTA TTATCACAAA TGGCAACATT GGCTCAAGAT
TTTCCTAATC AATCTTTGCA ACTTGCTTGT GCTGATTATC AACCTGAACG TGAAGCTTAT
TTAGAACAGA TAAAAGCAAA ACAAATTGAA CATACTTTGT TAATGTCTCG TTCTGTCTGG
CATAAAATTC GGGAGTCTAG ATTAGTTATT GCCGAACTCC AGTTTTCAGG AGTTCTTCAA
AGTTTACAAC CAGTTCGCAA ACCAGTTCCT AGTCGTAGTT CTTGGTTGAA AATGATGCGA
AATAAAAAAG AGATGCGTAG TAACTCCCAG AAAAATACTA ATACTGATAA TAATCATGAT
TGGAAAGGGA AAAATGAAAA ATTAAATTCG GTATCGGAGT TTATACCTCA TCATTCAAAA
AGTGCAGAGA ATAATGATGA TAAAAAAATC TAA
 
Protein sequence
MKPFIRQKQS QVVIRSLEYR DLEAIKKLCR ETPEAQNCHQ DVTFNLSPIH LSQKLPEIHH 
WYWPIKLLSF FPNPCKYLLT SYVAEVDGQV KGVIQVCPSN RTRSSWRVEK IFIEETGNTR
GIGSQLLRYC FEKIWEARTW LLQVNVNDKD TMALYRQNGF QPLAQITYWT ITPQQLQDLA
LSQPNLPNLL PVSNADAQLL YQLDTASMPP LVRQVFDRHI QDFQTSFVSG LIEGIRQWLN
HTELVSGYVF ESQRKAAIGY FQVKLCRDGT QPHIAELTVH PAYTWLYPEL LSQMATLAQD
FPNQSLQLAC ADYQPEREAY LEQIKAKQIE HTLLMSRSVW HKIRESRLVI AELQFSGVLQ
SLQPVRKPVP SRSSWLKMMR NKKEMRSNSQ KNTNTDNNHD WKGKNEKLNS VSEFIPHHSK
SAENNDDKKI