Gene Ava_1047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_1047 
Symbol 
ID3678599 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp1276257 
End bp1277267 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content44% 
IMG OID637716383 
ProductUDP-galactose 4-epimerase 
Protein accessionYP_321566 
Protein GI75907270 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1087] UDP-glucose 4-epimerase 
TIGRFAM ID[TIGR01179] UDP-glucose-4-epimerase 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00240501 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAATCAAA AAGTTTTAGT TACCGGTGGT GCTGGCTACA TTGGTTCTCA TGTGGTGCGT 
CAGCTAGGTG AAGCAGGTTA CGATGTTGTT GTGTATGACA ACTGTTCTAC TGGTTTACCC
CAAGCCGTAC TACATGGTGA GCTAATTATC GGCGATTTAA AAAATTCCGA ATGTCTTTCC
CAAGTATTTC ATCAACATCA ATTTGCGGCA GTTTTACACT TCGCGGCTAG TCTGAGCGTA
CCAGAATCTG TTGCCCGTCC CCTGGACTAC TACGCTAACA ACACACGCAA CACCCTCAAT
TTACTTCGCT GTTGTCACGA AACTGGTGTT AACCAAATTA TCTTTTCCAG CACAGCCGCA
GTTTATGGAC AACCGGAAAC TGCCGTTGTC ACCGAATCTA CACCAACTGA ACCGATTAAT
CCCTATGGAC GCTCAAAACT TTCTTGTGAA TGGTTGATTC GTGACCACGC CAAGGCTTCT
GATCTACGCT ATGTCATTCT ACGTTACTTT AATGTTGCTG GAGCCGAGCC TGGTGGGCGA
TTGGGACAGA TGTCAAAAGA CGCATCCCAT TTAATTCGTG TCACTTGTGA TGCTGCACTT
AAACGCAGAC TGGGAGTAAA AATTTTTGGT ACAGATTTTC CCACACCAGA CGGAACGGCA
ATTAGAGACT ACATTCATGT AGAAGACCTC GCAACAGCAC ATTTAGACGC TTTAGCTTAT
TTAGAGCAAG GTAATGCCAG CCAAATCCTT AACTGTGGTT ATGGACAGGG CTATAGTGTT
CGTCAAGTTA TCGAGCGGGT TAAGGCAATT TCTGGCGTAG ATTTTCCGGT GATAGAAGCG
GAACGTCGTT CTGGTGATCC GGTTTGTGTG ACAGCTTGTA GTGATAAAAT CCGTCACGTA
CTAGGATGGC AACCTAAGTA TGATGATATG AATCAGATAA TTCATAGTAC CCTCACCTGG
GAAATAAGTA AACAAGAATT GTTCCTCAAT TCATTAGCTG GAGTTGCATA A
 
Protein sequence
MNQKVLVTGG AGYIGSHVVR QLGEAGYDVV VYDNCSTGLP QAVLHGELII GDLKNSECLS 
QVFHQHQFAA VLHFAASLSV PESVARPLDY YANNTRNTLN LLRCCHETGV NQIIFSSTAA
VYGQPETAVV TESTPTEPIN PYGRSKLSCE WLIRDHAKAS DLRYVILRYF NVAGAEPGGR
LGQMSKDASH LIRVTCDAAL KRRLGVKIFG TDFPTPDGTA IRDYIHVEDL ATAHLDALAY
LEQGNASQIL NCGYGQGYSV RQVIERVKAI SGVDFPVIEA ERRSGDPVCV TACSDKIRHV
LGWQPKYDDM NQIIHSTLTW EISKQELFLN SLAGVA