Gene Tery_3937 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_3937 
Symbol 
ID4244020 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp6090836 
End bp6093529 
Gene Length2694 bp 
Protein Length897 aa 
Translation table11 
GC content37% 
IMG OID638108859 
Productglycosyl transferase family protein 
Protein accessionYP_723441 
Protein GI113477380 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0634036 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000739311 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACTGCCC AAGAAAAAAA ACAACCCAAT ATTAATCATA TTCTCACCCT AGCTATCCTG 
TGGTCATTAG GGGCCGTGAG CGATCGCCTC TGGTTTACCT TCGATAAATC AGTTCCAGCT
TGGGATCAGG CCGACTATCT CACAAGTTCC CTAACCTACT GGCGTGCTTT GCAAAACCTA
CAATTATTCT CTGGAGAATG GTGGAAAAAC CTTTGGCAGC TTTCCCCAAA AGTACCCCCC
TTAACTTATA TTCTTGCAGT TCCCTTTCAA AATATTTTTG GTAGAGGAGC TGACCAAGCC
ACCTTAGTAC ATCTATTATT TAGTGCTATT TTACTAACCT CAGTTTACAG TTTAGGCAGC
AAACTATTTA ATCAAAAAGT AGGTTTTTGG GCAGCAGTAT TATGTATATT ATTTCCAGGG
TTATATAGAT ATCGTTTACA ATTTTTACTC GACTATCCCC TAACAGCAAT AGTAACCTTA
AGCTTCACTT GTTTAACACT TTGGTATTTT TCAAAACCGA ATCATAAACA GGAAATAGAC
AACAGCGAAA AGACAACAAA AAAAACCGAA AAAAAAGCAA TTAACTTAGA CTTAGAAAAA
ACCAACCTTA CCAAAACCAA CATCACAAAA ATATCTCCTC AAATAGATCA ACTATCTCAA
TCCAAAAATC TCAAAAATTG GCTATTAGCA ACTGCTTTTG GCTTAACTTT TGGCCTAGCA
ATTTTAGTTA AGCAAACCAC AATATTTTTT CTGTTTTTTC CCTTACTTTG GGTAACTATA
AATATTCTCA AAAAAAAGCA ATGGAACCAA CTAATTCAAC TCACTTACTC TCTATGCTTA
TCCATCACAA TTTTTTACCC TTGGGCAAGA ACAAATTGGC TATTAATGTT AACCTCCGGT
AAACGAGCAA CAATAGACTC TGCCATAGCC GAAGGAGACC CCAACCTATT AAGTTTAGAT
GCTTGGATTT ATTATGGAAA ACTACTCCCC AACCATATTT CCTTACCTCT ATTAATCATT
CCAATTAGTG GATTAATTTT TTACTTAATC CGGCTCAATA AAAAACAAAA AAACCTTTCC
ACTCAGCTCC ATTCTTTTCA ATGGCTATCC ATATTTTTAA TCGGCGGATA TCTAATTTGC
TCCCTCAACA TCAACAAAGA CTTTCGTTAC ACCTTACCAT TATTGCCAAC CTTATCAATT
TTATTAGCCT TTGGTTTAAT GCAATTTCCA AGAAAAGTAG GAAAACAAAT TCGCTATCTT
ACTATTAGTT TAGCAATTAT TTTAATGCTA TTAAATATCT GGTCTGTAGG TGGAAACTTC
CCCAGAAAAA TCACCGCTTG GTTAAGTCCT GGTGCAGCTT ATTCTGCTCA TTTAGGCAAA
GAATGGCCCC ATAAACAAAT AATTGCTGAA ATCATCAAAA CTAACCCCTA TTTACGGTCA
ACCTTAGGAG TTTTACCTTC CACTCCAGAA ATTAATCAAC ATAACTTAAA TTATTACGGT
GCTCTGCAAA ACTTACAAGT CTATGGTCGG CAAGTAGGAA CCAACTTCAA ACAAGTAGAA
CAAGATGCGC GATCGCTATC TTGGTTTATC ACAAAAACCG ATGAACAAGG TTCTGTCAAA
AGAATCAAAA AAGCTCAAGC TGCTATTGTT AACCTCATCG AAAAACACCC AAACTTTCAA
CTACAAAAAA GCTGGCAGTT ACCAGATAAT AGTAACTTAA ATCTATATCA TAATCAGTTT
TTACCAGTAG AAGTTCATCC CCTGACCAAA CCACAAAAAA AAGTGAAACT AGACTATATA
ATTTTATCCC CAAAAATTCG CTCAGGAACA ACCATACCTA TTACCTACAA ATGGTCCGGT
TCTTGGCAAA AACTACAGTC AGGTTTAGTC TTATTAACCT ATCGCCGAGA AAATATTAAT
CAAACAGAAC TTCAGCCATT AACTTCTAAC AATAATGCAT CAGAAAAACC TCTTACCAAA
TTCATTCACG ACCACGGTAT CGGCATGGGC AGTTTACACC CTAGTCTACT TCAAGCAAAT
AAGTCTGAAG TCGGGTTCGA AATTATTGAA CGAACAGGAA TGTTACTCCC ATCTAACATT
GTACCGGGAA CTTATATATT AGAAGCCACC TATCTTAACC GAGAAACCGG GGAAAATTAT
CCCATTTCTG TAGAACCAAC CCTGCGAGTA CAGGTTGAAC CCCAAGCACA AGCGTTACCA
GCACCAGAAC TAGATCTAGT AACCCAGTTG CGGACGTGGG GCCAAAAGTT ACCCCAGGGA
ATAAAAGGTT TGGAAACCAT ATTTGCGGAA GTAGCGCGAG TTAACCAATA CGATCCAGTT
CAAGACTATA CTCAGCAAGC AGAAAAAACT TTAACCTATC GTTTGCAACA GGAACCAGAT
AATTTAGAGT GGCTCTATAG TTTAGCTTTA GCCCAAGTTT TACAAAAAGA TGCCCAGGGA
GCGATCGCTA CCTTAACTAG AGTAACTAAA CTAGATGCTC AAAACGCCTT TGCTCATGCT
TATTTAGCTT TTGTCTATTT ATACGATCTT AATCCTGGAG CAGCGAAAAT AGCATTAAAG
TCAGCTTTAG AAATTGATCC CCATCAACCA GAAATTAGGT CATTAAATGG TATTGCTGCT
TTAATGCAAG GAAATTTTAT TCAAGCATGG CATAACTTGA TAACTCAAAA ATAA
 
Protein sequence
MTAQEKKQPN INHILTLAIL WSLGAVSDRL WFTFDKSVPA WDQADYLTSS LTYWRALQNL 
QLFSGEWWKN LWQLSPKVPP LTYILAVPFQ NIFGRGADQA TLVHLLFSAI LLTSVYSLGS
KLFNQKVGFW AAVLCILFPG LYRYRLQFLL DYPLTAIVTL SFTCLTLWYF SKPNHKQEID
NSEKTTKKTE KKAINLDLEK TNLTKTNITK ISPQIDQLSQ SKNLKNWLLA TAFGLTFGLA
ILVKQTTIFF LFFPLLWVTI NILKKKQWNQ LIQLTYSLCL SITIFYPWAR TNWLLMLTSG
KRATIDSAIA EGDPNLLSLD AWIYYGKLLP NHISLPLLII PISGLIFYLI RLNKKQKNLS
TQLHSFQWLS IFLIGGYLIC SLNINKDFRY TLPLLPTLSI LLAFGLMQFP RKVGKQIRYL
TISLAIILML LNIWSVGGNF PRKITAWLSP GAAYSAHLGK EWPHKQIIAE IIKTNPYLRS
TLGVLPSTPE INQHNLNYYG ALQNLQVYGR QVGTNFKQVE QDARSLSWFI TKTDEQGSVK
RIKKAQAAIV NLIEKHPNFQ LQKSWQLPDN SNLNLYHNQF LPVEVHPLTK PQKKVKLDYI
ILSPKIRSGT TIPITYKWSG SWQKLQSGLV LLTYRRENIN QTELQPLTSN NNASEKPLTK
FIHDHGIGMG SLHPSLLQAN KSEVGFEIIE RTGMLLPSNI VPGTYILEAT YLNRETGENY
PISVEPTLRV QVEPQAQALP APELDLVTQL RTWGQKLPQG IKGLETIFAE VARVNQYDPV
QDYTQQAEKT LTYRLQQEPD NLEWLYSLAL AQVLQKDAQG AIATLTRVTK LDAQNAFAHA
YLAFVYLYDL NPGAAKIALK SALEIDPHQP EIRSLNGIAA LMQGNFIQAW HNLITQK