Gene Francci3_4361 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_4361 
Symbol 
ID3907333 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp5208561 
End bp5209856 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content74% 
IMG OID637881692 
ProductNAD-dependent epimerase/dehydratase 
Protein accessionYP_483436 
Protein GI86743036 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.171088 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGTTCGGG CGGTGGTCAC CGGCGCGGCC GGCTTCCTCG GCGGAGCGCT GGCCCACGCG 
CTGCGGCGGC GCGGTGACGA CGTTGTCGCC ATCGACATCC GGCGCGCGCC GGGTGTCATG
CAGGCCGACG TCAGCCGTCC CGGTGAGTGG GAGAAGGCCT TTTACGGCAC GGAGCTGGTG
ATCCATGCCG CCGCCATCGG GACCGGTGGC GTCGCGGAAC TGACCCCGGT GCGTCCCGGC
CGAGCCGGAG CGCCGGTACG GGTGTCCGAC GCCGAGATCC GGCGGGTGAC CCTCGGCGCC
ACGGCGGCGT TGCTCGACGC CGCGGGTCGG GCCGGGGTCG ACCGCTTCGT CCATCTCTCC
TGCGTGGACG TGCTTGCCCG GACGAGTGGC CCGGTCCCCG GCGGGACCGG GGACCGCAGG
CCCGCGCGAG CCGTCGGCCG ATCATTGGAC GAGACCACGC CGATCGGACT GACCGGCGAC
GTCCGCGCCG ACGCCATGGC CGCCGCGGAA CAGGCGGTCG GGTCGGCCGC TGCGCACGGA
CTCGGCGCGA CCGTCCTACG GATCGGGGAT GCCTACGGTC CCCGGGCCGG TCGCTGGACC
CTCTGGCCGG TGCTGCTGAT GCGGGCCGGG CGGTTCGTCC TCGTCGACGG CGGCTGGGGT
GTGCTCAGCC CCGTGCATGT CGACGACGTG GTCACGGCGG TGACCACCGT CGCCGGTGCG
GACCGGTCGG CGGTCGCCGG TGAGGTGCTG CACGTGACCG GGGGGGAGTC GGTGTCTGCC
GCCGACTTCT TCACCTTTTA CTGTCAGATG CTCGCGATCG CGGCGCCCCG CTCCGTACCG
GCCAGGGTCT TCGGAGCGGT CGACGCGATT GATCGTGTCT GTTCGTTCGC GGAGGGTTAC
CGAAGGCGAG GTGTGGACGA TTCCGCGGCC GAGACGCCAC CCCTCCCCAC CCACCCGGTC
CCGGCGGTCT CGACGACCTC GGTGCGGCCA GCGGGCTCGG CGGCCCCGGC GGCCCCGGCC
GTTCCGGTGA ACTCGGCGAC GTCGCGGCGG GGACCGGGCC TGATCCGCGG TGTAGGTGCC
CGTCTGGTGG CCGGAGTCGA TCCCCGCGCC CGGGTGGACC TCGGTCCGCT CACCGTGCGC
GCCCTGACCC GCGACGCGGG CCTCTCCATC GAGAAGATCC GGGCCCGGAC GGGATGGTCG
CCGGTCGTGC GGCTGCCCGA GGGCATGAAC CGCACCGAGT CCTGGCTCCG GGAACGGGGT
CTGCTCGGGG TCCGGGAGCC GTCCCGTCGT GGGTGA
 
Protein sequence
MVRAVVTGAA GFLGGALAHA LRRRGDDVVA IDIRRAPGVM QADVSRPGEW EKAFYGTELV 
IHAAAIGTGG VAELTPVRPG RAGAPVRVSD AEIRRVTLGA TAALLDAAGR AGVDRFVHLS
CVDVLARTSG PVPGGTGDRR PARAVGRSLD ETTPIGLTGD VRADAMAAAE QAVGSAAAHG
LGATVLRIGD AYGPRAGRWT LWPVLLMRAG RFVLVDGGWG VLSPVHVDDV VTAVTTVAGA
DRSAVAGEVL HVTGGESVSA ADFFTFYCQM LAIAAPRSVP ARVFGAVDAI DRVCSFAEGY
RRRGVDDSAA ETPPLPTHPV PAVSTTSVRP AGSAAPAAPA VPVNSATSRR GPGLIRGVGA
RLVAGVDPRA RVDLGPLTVR ALTRDAGLSI EKIRARTGWS PVVRLPEGMN RTESWLRERG
LLGVREPSRR G