Gene Tery_3801 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_3801 
Symbol 
ID4242251 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp5842271 
End bp5844271 
Gene Length2001 bp 
Protein Length666 aa 
Translation table11 
GC content32% 
IMG OID638108735 
Productglycosyl transferase family protein 
Protein accessionYP_723319 
Protein GI113477258 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.361968 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACTAA TACAACAGTT AGAAAATTTC AAAAAAATTA AAGCCAAACA AAAAAAATAT 
CTTTTAATAT TAACACTTTC TATTTTATAC CTCTGTTTAC TTGCTTCCTT AGCCTTCTTC
TATAACCTTG GTAATATTGG TTTAATTGAT GAAACAGAAC CAATTTTTGC CGAAACTGCT
CGTCAAATGG TAAAAACTGG AGATTGGATT ACACCTTATT TTAATGGAGA AACTCGCTTT
GATAAACCAC CTTTAATTTA TTGGTTAATA GCAATTTCTT ATCATCTTTT TGGTATTAAT
GAATGGTCTG TCCGTCTCCC ATCAGCAATA TCTGGAACCG GCTTAATGTG CCTTGGTTTT
TATACCCTCT ACAGATATGG TTATTATCAC CTGAATCCTC AAATATATAC CCCTAAAAAT
AAACTTTTAA TAGTGAAATT ATTAATAGGA TATATTGGGG CAGCGATGAT AGCCATAAAT
CCAGAAACTA TTGTTTGGGG ACGTATAGGA GTTTCTGATA TGTTATTAAC AAGCTGTATG
TGTTCAGCAC TACTAGCCTT TTTTATAGGA TATGCTTCAC AAACAGAAAA TGCTCTTATA
CATCAGCAAA AAAACAGCAA AATCATCATA CAGAAAACCT CTTATTTACC AAATCAAAAT
CAATCCTCAA AACCTAGAAA ATCTCCCCTA TTCAACAAGT GGTATTTAGC CTTTTATATA
CTAATATCTT TAGCAGTTCT CACCAAAGGT CCCATTGGAA TTGTTTTACC AGGAATAATT
ATTGGTTCCT TTTTATTATA TGTAGGTAGA CTTTTTAAAG TCTTGCAAGA AATCAAAATT
TGGTATGGAA TTTTAATTTT TTTTACCATT ACATTCCCTT GGTATTATCT AGTTACCTTG
GTAAATGGAA AAGAATACAT TGATAGTTTT TTTGGGTATC ACAATTTTGA ACGTTTTACC
AGAGTTGTGA ATCACCATCA AGGACCGTGG TATTTTTACT TTTTAGTCGT ACTAATTGGT
TTCGCTCCCT GGTCTATTTA TTTACCAGTA GCCATAGCTA AAACTAAGTT TTGGCAACCC
TATTATTGGC GTCATAAACC GAGAAATAAA CAGTTAGGTT TATTTGCCTT TTTCTGGTTT
ATTTGTATCT TTGCTTTCTT CTCCATCTCT GCTACTAAAC TACCTAGCTA TGTCTTACCA
ATAATGCCCG CCGCAGCAAT ATTATTAGCA CTATTTTGGA GTAATATTAT TCTTCACAGA
TATTCTCTAT CTAGTCAGAC TAATAAACCT GAAAATAACT CCACTCAATC ATCATTTAAA
GCCACAAATA ACCCTACTCA ACCTATTTCC AGATTAACGA GAAATACTTC TAAATCAAAA
AGTAAATTTT TATCTTTCTC AGTTGTCGCC AACATTATTT TTTTGTTGAT TTTAGCCTTA
GCAATTATCT ACAGTTTTAA CTGGTTAGAT AGAGACCCAG CCATGCCATA TTTCTCAGAA
ATAATTAGAA AATCTGGCTT ATTAATTCGT GGTGGCTTAA TTTTAATAAC CACAGCAATA
GTCATTGGAT TTTTTGTCAT AAAAAAACAA AATTCTTGGG TTTGGAGTGC TAATTTTATT
GGGTTAGTAG CTTGTTTAAT TTTTACTATT AACCCGATCA TGTTTTTAGT AGATCAAGAA
CGTCAATTAC CTTTACGTCA GCTAGCTCAA ACTATTATTC AAGCCAGACA ACCAGGAGAA
GAAATAATTA TGGTTAGCTT TGAAAAACCT AGTTTAGTTT TTTATACTAG GCAACAAGTA
AAATTTTTTC GACGTGCTAC AGATGCCAGA GAATATCTAG GGAAAAATCT CTCAAAAAAC
TCTTCTGATA ATGTATTGAT AATTGGCTAC CCAAAAAAGT TTATTCATAT AGGATTAAAA
CCAGGGCAAT ATCAATATTT AGACAGTCGT GGTGCTTATC AATTAGGTAA AGCTCCTAAA
AATATCTTTT TACCAAAATA A
 
Protein sequence
MKLIQQLENF KKIKAKQKKY LLILTLSILY LCLLASLAFF YNLGNIGLID ETEPIFAETA 
RQMVKTGDWI TPYFNGETRF DKPPLIYWLI AISYHLFGIN EWSVRLPSAI SGTGLMCLGF
YTLYRYGYYH LNPQIYTPKN KLLIVKLLIG YIGAAMIAIN PETIVWGRIG VSDMLLTSCM
CSALLAFFIG YASQTENALI HQQKNSKIII QKTSYLPNQN QSSKPRKSPL FNKWYLAFYI
LISLAVLTKG PIGIVLPGII IGSFLLYVGR LFKVLQEIKI WYGILIFFTI TFPWYYLVTL
VNGKEYIDSF FGYHNFERFT RVVNHHQGPW YFYFLVVLIG FAPWSIYLPV AIAKTKFWQP
YYWRHKPRNK QLGLFAFFWF ICIFAFFSIS ATKLPSYVLP IMPAAAILLA LFWSNIILHR
YSLSSQTNKP ENNSTQSSFK ATNNPTQPIS RLTRNTSKSK SKFLSFSVVA NIIFLLILAL
AIIYSFNWLD RDPAMPYFSE IIRKSGLLIR GGLILITTAI VIGFFVIKKQ NSWVWSANFI
GLVACLIFTI NPIMFLVDQE RQLPLRQLAQ TIIQARQPGE EIIMVSFEKP SLVFYTRQQV
KFFRRATDAR EYLGKNLSKN SSDNVLIIGY PKKFIHIGLK PGQYQYLDSR GAYQLGKAPK
NIFLPK