Gene Francci3_3549 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3549 
Symbol 
ID3904488 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4243822 
End bp4245465 
Gene Length1644 bp 
Protein Length547 aa 
Translation table11 
GC content73% 
IMG OID637880870 
ProductUDP-glucose 6-dehydrogenase 
Protein accessionYP_482630 
Protein GI86742230 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1004] Predicted UDP-glucose 6-dehydrogenase 
TIGRFAM ID[TIGR03026] nucleotide sugar dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.230497 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGAGGA GCCTGATGAG CTCGTCGGAC AGCCTTGATG ATGCGGCGGC GCCCGGAACC 
GGGCCTCGTC CACGCCTGAC CGTCATCGGC ACCGGGTACC TGGGGGCCAC CCACGCGGTG
TGCATGGCGG AGCTTGGCTT CGAGGTGCTC GCCGTGGACG TCGACCATTC GAAGATCGAG
CGGTTGTCCG CGGGGGAGAT CCCGTTCTTC GAGCCCGACC TCGCCGACCT CCTGCGCGCG
AACCTCCGGA CCGGACGGCT GCGTTTCACC ACCTCCTTCG AGGAGATTGC CGAGTTCGGC
GACGTGCACT TCGTCTGTGT GGGCACCCCC CAGCGTGCCG ACGGTTACGG GGCGGACCTG
AGCCACCTGC ACGCCGCCAT CGAGCGCCTC GCGCCGCTGC TGACCCGGCC GTGCCTCGTG
GTCGGCAAGT CGACGGTGCC GGCCGGCACG GCTGCGGGGC TCGCCCGGAC GATCGCGCAG
CTCGCCCCGG CGAACCGCCC GGTCGCGGGC GAGACGGCGC CCGGCGCGGC GGGGGCCGAC
GGGTTGGGCC GCGTCGAGGT GGGTCATGAC GGGTTGGGCC ATGACGAGGT GGACGAGCCC
GCCGCCGACA TCTCCAGCGA TCTCGCCGGC GTCCAGCTGG CCTGGAACCC GGAGTTCCTC
CGGGAGGGCT TCGCGGTGGC CGACACCCTG CGACCGGACC GGCTCGTCTT CGGTGTCGCC
TCCCCGGCTG CGGAGGGAGC GCTGCGGGCC GCTTTCGCGC CGGTGATCGC CCAGGGGGTG
CCGGTCATCG TGACCGACTA CGCGACGGCC GAGCTGGTGA AGACGGCGGC CAACTCCTTT
CTCGCGACGA AGATCTCCTT CATCAACGCG ATGGCGGAGG TGTGCGAGGC GGTCGATGCC
GATGTCCTGA CGCTGGCCGA GGCGCTGTCC CACGACGTGC GCATCGGCGG GAACTTCCTG
CGTCCGGGGG TGGGGTTCGG GGGCGGCTGC CTACCCAAGG ACATCCGGGC CTTCCAGGCC
CGCGCAGACG AGCTCGGGGT CGGTGCGGCG CTGCGTTTCC TGCGTGAGAT CGACGAGATC
AACAATCGTC GCCGGGACCG TGTGGTGGAC CTGGTCACCG CGGCCCTCGA CGGCACGCTG
GTCGGCCGGC GCCTCGTCGT GCTGGGCGCC GCGTTCAAGC CCAACTCCGA CGACGTGCGG
GACTCCCCGG CGCTCGCGGT CGCGGGGCTA CTGGCCGAGA CCGGCGCCGG GGTGACGGTC
GTGGATCCGG TCGCGACGCA CAACGCCCGG CAGGCCCTGC CCGGCCTCGC CTACAGCGAC
TCCGTCGAGG CCGTGGTCGA GGGGGCCGAC GCGCTGGTCC TGCTCACCGA GTGGCGCCAG
TTCGCCGACC TGGACCCGGC CCGGCTCGGG GCCGTGGTCC GGCGCAAGGT CGTGGTGGAC
GCCCGGCACG CGCTCGACGC CGACCAGTGG CGCCAGGCCG GATGGGTCTA CCTGGCCCCC
GGCCGGCCGA CCGGCGTCAT CCCGAGCTCG TTGCCGGAGC AGCGGCACCG GGGGGACGGA
GCGCCGCCGG CGCCGGTCGA GCAGTGCGGC GAGACGGGTG CGCGGCTCGC GGAGGTCGAG
GCGCGCGCGG GGCACCCGGT GTGA
 
Protein sequence
MRRSLMSSSD SLDDAAAPGT GPRPRLTVIG TGYLGATHAV CMAELGFEVL AVDVDHSKIE 
RLSAGEIPFF EPDLADLLRA NLRTGRLRFT TSFEEIAEFG DVHFVCVGTP QRADGYGADL
SHLHAAIERL APLLTRPCLV VGKSTVPAGT AAGLARTIAQ LAPANRPVAG ETAPGAAGAD
GLGRVEVGHD GLGHDEVDEP AADISSDLAG VQLAWNPEFL REGFAVADTL RPDRLVFGVA
SPAAEGALRA AFAPVIAQGV PVIVTDYATA ELVKTAANSF LATKISFINA MAEVCEAVDA
DVLTLAEALS HDVRIGGNFL RPGVGFGGGC LPKDIRAFQA RADELGVGAA LRFLREIDEI
NNRRRDRVVD LVTAALDGTL VGRRLVVLGA AFKPNSDDVR DSPALAVAGL LAETGAGVTV
VDPVATHNAR QALPGLAYSD SVEAVVEGAD ALVLLTEWRQ FADLDPARLG AVVRRKVVVD
ARHALDADQW RQAGWVYLAP GRPTGVIPSS LPEQRHRGDG APPAPVEQCG ETGARLAEVE
ARAGHPV