Gene Tery_5018 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_5018 
Symbol 
ID4246673 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp7669368 
End bp7671188 
Gene Length1821 bp 
Protein Length606 aa 
Translation table11 
GC content40% 
IMG OID638109827 
Producthypothetical protein 
Protein accessionYP_724403 
Protein GI113478342 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2942] N-acyl-D-glucosamine 2-epimerase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTACTC CAGTAGAGTT TACTTTTTCC GACTTGATCG CCGGGTATGT TACAAACTTC 
GACTCAGGCA CAGATATTTT TGGGCTCAAA ACAACAGATG GCAGAGAATT TAAAGCCAAG
TTAACTCCTA CTAGTTATGC TAAGCTAGTA CAAAACTTAG ACGAAGCATA CCCAGATGCT
ACAGGTGCTA TGCGATCGAT GCTGGTGCCT GGTAGGTATG TATTCACTTA TGGCGTTTTT
TACCCAGATA GTTCCATATT TGAAGCTAAA CAAATAGTAT TTGTCGGTCG TCAAGCAGAT
GACTATATAT TTGAAAAATC AAACTGGTGG GTACACCAGG TTCGCTCCCT AGCTAATTTT
TATGTCAAAG CTCAATTTGC TGGTGAAGAA ATAGACTACC GCAACTATCG CACAACTTTA
AGTCTTTCTG GTGTAAGGTC TCAGGTGAAT TTCCGACAAG AAACAGATAC TATCTCCCGG
ATGGTTTATG GGATGGCTAC AGCATATATG ATGACTGGGG AAGAAATTTT CCTAGAAGCA
GCTGAGAAAG GTACTGAATA TCTGAGAGAC CACATGAGAT TTGTGGACTT AGATGAAGGT
ATAGTTTATT GGTATCACGG TATCGATGTG CAAGGGGAAC GAGAGCAGAA AATTTTCGCT
TCAGAATTTG GTGATGACTA TGATGCTATT CCCGCTTACG AACAAATTTA TGCCTTGGCA
GGTCCATTAC AAACTTATCG CATTAATGGC GACCCTAGGA TAATGGACGA TACCGAGAAA
ACTATTAAGT TATTCAACGA CTTTTTTCTA GATAAAACTG ATCGCGGTGG TTATTACTCC
CACCTAGATC CTATTACTCT AGATCCTCTG AGCGAGTCAT TAGGTCGGAA CAAAGGTACT
AAAAACTGGA ACTCAGTTGG TGACCATGCA CCAGCATATC TAATTAATCT CTGGTTAGCG
ACAGAAAAAC CTGAATATGC AGATATGCTC GAATACACTT TCGATACTAT TGAAAAGCGT
TTTCCAGATT ACGAAAACTG CCCCTTTGTC AATGAAAAAT TCTTTGAAGA CTGGAGTGCA
GATCATACTT GGGGATGGCA GCAAAACCGG GCAGTAATTG GTCACAATAT GAAAATTGCT
TGGAATTTGA TGCGGATGAA TAGCCTCAAA CCTAAGGATA CTTATGTTGA ACTGGCAAAG
AAAATTGCTG AGGTGATGCC AGCAGTAGGG AGTGATCAAC AACGAGGTGG TTGGTATGAC
GTGGTGGAAA GAGCATTAGG AGAAGATGAA AAAAATCATC GTTTTGTTTG GCATGATCGC
AAAGCTTGGT GGCAGCAAGA ACAGTCTATT CTGGCTTATT ATATTCTTGC AGGAACTCTT
AAAGATCAAG AATATCATCG TTTAGGGCGG GAAGCTGCAG CTTTTTACAA CGCCTGGTTT
CTGGATACAG AAGATGGTGG GGTTTATTTC AATGTTCTTG CTAATGGTAT CCCTTTCTTG
GCAAGTGGTA ATGAACGAGG TAAAGGTTCC CACTCTATGA GCGGTTATCA CTCTACAGAA
TTATGCTATC TAGCTGCAGT TTATACTAAT CTATTGGTTA CTAAGCAGCC AATGGATTTT
TATTTTAAGC CTATTCCTAG TGGTTTCCCT GATAATATTT TACGAGTATC ACCAGATATT
CTGCCTCCTG GTAGCATTAA AATTGGTTCT GTAGAAATTG ATGGTAAACC TTACAGTGAT
TTTGATGCGG ATAAACTTTT TGTGAAGTTG CCTGATACTA AGGAACGGGT GAAAGTTAAG
GTCAATATTG TACCTAATTA A
 
Protein sequence
MTTPVEFTFS DLIAGYVTNF DSGTDIFGLK TTDGREFKAK LTPTSYAKLV QNLDEAYPDA 
TGAMRSMLVP GRYVFTYGVF YPDSSIFEAK QIVFVGRQAD DYIFEKSNWW VHQVRSLANF
YVKAQFAGEE IDYRNYRTTL SLSGVRSQVN FRQETDTISR MVYGMATAYM MTGEEIFLEA
AEKGTEYLRD HMRFVDLDEG IVYWYHGIDV QGEREQKIFA SEFGDDYDAI PAYEQIYALA
GPLQTYRING DPRIMDDTEK TIKLFNDFFL DKTDRGGYYS HLDPITLDPL SESLGRNKGT
KNWNSVGDHA PAYLINLWLA TEKPEYADML EYTFDTIEKR FPDYENCPFV NEKFFEDWSA
DHTWGWQQNR AVIGHNMKIA WNLMRMNSLK PKDTYVELAK KIAEVMPAVG SDQQRGGWYD
VVERALGEDE KNHRFVWHDR KAWWQQEQSI LAYYILAGTL KDQEYHRLGR EAAAFYNAWF
LDTEDGGVYF NVLANGIPFL ASGNERGKGS HSMSGYHSTE LCYLAAVYTN LLVTKQPMDF
YFKPIPSGFP DNILRVSPDI LPPGSIKIGS VEIDGKPYSD FDADKLFVKL PDTKERVKVK
VNIVPN