Gene Francci3_3550 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3550 
Symbol 
ID3904489 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4245428 
End bp4246495 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content68% 
IMG OID637880871 
ProductUDP-galactose 4-epimerase 
Protein accessionYP_482631 
Protein GI86742231 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1087] UDP-glucose 4-epimerase 
TIGRFAM ID[TIGR01179] UDP-glucose-4-epimerase 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.211522 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGTCGA CGAGTCGAGC CGAGACGGTC CTGGTGACCG GGGCGACGGG GTTCATCGGC 
TCTCATACCT GGGTGGACCT GCTCGCCGCG GGTCATCGGG TAGTGGGAGT GGACAACTTC
GTCAACAGCT CACCGCGGGT GCTGGACAGG CTGCGTAAGG TGGTGGACGG CGACATCGAC
TTCGTGCGGC TCGATGTCCG TGACCGGGCG GCGCTCGGTG ACGTGTTCCG CCGATGGAAG
ATCGATTCCG TTATTCACTT CGCCGCTCTC AAAGCCGTCG GCGAGTCGGT CGACATCCCG
CTGGAGTACT ACGACACGAA CGTCAACGCG ACGTTGGGTC TGGTCCGCGT GATGGCGGAG
CACGGCGTGC GCCGGCTGGT CTTCTCGTCC TCGTGCGCAA TCTACGGAGC GGCGGACAAG
GTACCGATCG CCGAGGACAC GCCGGCCCGC CCGACCAATC CCTACGCGCG CACCAAATGG
ATGTGCGAGC AGATCCTCGC CGACCTCTGC GCCCGGGATC CGTCCTGGCA CGTGACGTCC
CTGCGATACT TCAACCCCGC CGGGGCGCAC GAGTCGGGCC TGCTCGGCGA GGATCCCCGT
GGGGTGCCGA ACAACGTCAT GCCCTACCTG GCCCAGGTGG CGGTCGCCCG GCGCCCGGAG
CTGTCCATCT TCGGCGACGA CTACCCCACG CCCGACGGCA CGGGCGTACG CGACTACATC
CACGTGGTCG ACCTGGCGGA GGGCCATCGA CTCGCTCTCG ATCATCTCGA TGACCAGGCG
GGACATCGGG TCATCAACCT CGGGACCGGC GCTGGCACCT CCGTGCGGGA ACTGCTCGCG
GCCTTCTCCG CGGCCTGCGC TCGTGATCTC CCCAGTCGCG TCGTGGCGAG GCGGCCGGGG
GACGTCGCCG CCCTGGTCGC CGACGCGGCG CTCGCCCGTA CGGCACTCGG CTGGTCAGCC
CGCCGGGATG TCGCGGACAT GTGCCGGGAC GCCTGGGAGT TTCAGCGTCT CAATCCAGGG
GGGTACGACG ATGAGGAGGA GCCTGATGAG CTCGTCGGAC AGCCTTGA
 
Protein sequence
MTSTSRAETV LVTGATGFIG SHTWVDLLAA GHRVVGVDNF VNSSPRVLDR LRKVVDGDID 
FVRLDVRDRA ALGDVFRRWK IDSVIHFAAL KAVGESVDIP LEYYDTNVNA TLGLVRVMAE
HGVRRLVFSS SCAIYGAADK VPIAEDTPAR PTNPYARTKW MCEQILADLC ARDPSWHVTS
LRYFNPAGAH ESGLLGEDPR GVPNNVMPYL AQVAVARRPE LSIFGDDYPT PDGTGVRDYI
HVVDLAEGHR LALDHLDDQA GHRVINLGTG AGTSVRELLA AFSAACARDL PSRVVARRPG
DVAALVADAA LARTALGWSA RRDVADMCRD AWEFQRLNPG GYDDEEEPDE LVGQP