Gene Tery_4279 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_4279 
Symbol 
ID4245931 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp6600206 
End bp6601207 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content40% 
IMG OID638109171 
ProductUDP-galactose 4-epimerase 
Protein accessionYP_723749 
Protein GI113477688 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1087] UDP-glucose 4-epimerase 
TIGRFAM ID[TIGR01179] UDP-glucose-4-epimerase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.277879 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0248708 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCACAAA CCAAGCCAAC GATTTTAGTG AGTGGGGGAG CAGGATATAT TGGTTCCCAT 
GCAGTTCAGG CTTTACAAAA TGCAGGTTAT GATATAGTGA TTTTGGATAA CCTCGTCTAT
GGACATCGGG ACATTGTAGA AAATGTCCTG AAGGTAGAAA TGATTGTTGG GGATACTAGC
GATCGCTCTT TATTAGATAA AATTTTTGCT ACTCACAACA TTGCTGCAGT AATGCATTTT
GCAGCATATA TTTTTGTCGG TGAATCAGTA AAGGATCCGC AGAAATATTA CCACAATAAT
GTAGTTGGGA CACTAACATT ATTAGAAGCG ATGCTTAAAG CATCCATAAA AAAGTTTGTT
TTCTCTTCAA CTGCTGCTAT TTATGGGAAA CCACAAACCA TTCCTATTCC CGAGGATCAT
CCGAAAAATC CAATTAACCC TTATGGTGCA AGTAAGCGGA TGATAGAGCA AATACTTGCA
GATTTTGAGA TCGCTTATGA TTTTAAGTCG GTTTGTTTTC GCTACTTTAA TGCAGCAGGA
GCACATCCTA ATGGTTTGAC TGGGGAAGAT CATAACCCGG AAACTCATTT AATTCCTCTG
GTATTGTTTG CAGCATTGGG CAAGCGAGAT TCTATATCAA TTTTTGGCAC AAATTATAAG
ACTCCTGATG GTACTTGTAT TCGAGATTAT ATTCATGTGT GTGATTTAGC GGATGCTCAT
GTTTTGGGGT TAGAATATTT GTTGAATGGT GGTGAGAGCA ATATTTTTAA TTTGGGCAAT
GGTAATGGGT TTTCGGTTAG GGAAGTGATA GAGACTGTGA AGCAAGTAAC TGGTAGAGAG
TTTAAAGTGG AGGAGCGCGA TCGCCGACCT GGAGATCCAC CTATTTTGGT AGGGAGTAGT
GAGAAAGCCA GGAAAGTGTT GGGTTGGTCT CCGAAATATC CAGAGGTTAA GGAAATAGTT
AGTCATGCTT GGCAGTGGCA TCAAAAACGA CATGGGAAAT GA
 
Protein sequence
MSQTKPTILV SGGAGYIGSH AVQALQNAGY DIVILDNLVY GHRDIVENVL KVEMIVGDTS 
DRSLLDKIFA THNIAAVMHF AAYIFVGESV KDPQKYYHNN VVGTLTLLEA MLKASIKKFV
FSSTAAIYGK PQTIPIPEDH PKNPINPYGA SKRMIEQILA DFEIAYDFKS VCFRYFNAAG
AHPNGLTGED HNPETHLIPL VLFAALGKRD SISIFGTNYK TPDGTCIRDY IHVCDLADAH
VLGLEYLLNG GESNIFNLGN GNGFSVREVI ETVKQVTGRE FKVEERDRRP GDPPILVGSS
EKARKVLGWS PKYPEVKEIV SHAWQWHQKR HGK