Gene Francci3_3825 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3825 
Symbol 
ID3905573 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4585943 
End bp4587385 
Gene Length1443 bp 
Protein Length480 aa 
Translation table11 
GC content75% 
IMG OID637881151 
Productaldehyde dehydrogenase 
Protein accessionYP_482904 
Protein GI86742504 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.344076 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0683109 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGAGC CGACGCCGTT CTGGCTGGCA GGCAAGCCGG CCACCGGGTC GGCGACGACC 
ACCGTTCGGC ACCCGTTCGA CGGCGCGGAG GTCGCCGTCG TCGCGCAGCC GGACCCCGAC
CAGGTTGAGA CGGCCGTGGC CGCCGCGGCG GCGGTCGCCC CGGCGTTCGC GGCGCTGCCG
GCCCACGTCC GCGCCGGCGC GCTCGCCGGC GTGTCGAAGG AGATCGCCCG TCGAGGCGAC
GAACTGGCCC GTCTCATCAC CGCGGAGAGC GGTAAGCCGC TGACCTGGTC GCGTGCCGAG
GTGGCCCGGG CAAGCTCCAC CTTCCGCTGG GGGGCGGAGG AGGCCCGGCG TTTCGCCGGG
GAGCTGACCA GGCTGGACAC CGATCCCCCC GGGGAGGGAC GGCTCGCGCT CACCCGGCGT
TTCCCCCGCG GGCCCGTGCT GGGCATCACC CCCTTCAACT TCCCGCTCAA CCTGGTCGCC
CACAAGGTGG CGCCGGCGCT GGCGGTCGGC GCGCCGATCA TCGTGAAGCC GGCCCCGCGC
ACGCCGCTGT CGGCGCTGTT CCTCGGCGAC CTGCTCGCCG ACGCCGGCTT GCCCGAGGGG
TCGTGGTCGG TCCTCCCGAT CCCCAACGAC CGCCTCGGCC CGCTCGTCGC GGACCCTCGG
CTGCCCGTGG TGTCCTTCAC CGGCTCCGGC CCGGTGGGCT GGTCCATCCG CGACACGGTC
CCCCGCAAGC ATGTGGTCCT CGAACTCGGC GGGAACGCGG CGGTGCTGGT CGCCGCCGAC
TACGCGACCC CCGCCGATCT GGCCCGGGCC GCCGGCCGCA TCGCCCTGTT CGCCAACTAC
CAGGCCGGCC AGTCGTGCAT CGCCGTGCAG CGGGTCTACG CCGACCGCAC GATCGTCGAC
GAGCTGCTCG CCGAGATCGT CGCCGCCGTG CGGGCGCTGC ACGACGGGGA TCCGGCCGAC
CCGGCCACCG ACGTGGGGCC GCTGATCGAC GTCGCGGCGG CGGAACGGGT CGAGGCGTGG
ATCACCGAGG CCGTCGAGGC GGGGGCGACA CTCGTCTGCG GCGGGACGCG CCACGGCACG
AGTCTCAGCC CGGCCGTGCT GACCGGCGTT CCGCCCACCG CCAAGGTGGT GAGCGAGGAG
GTCTTCGGCC CGGTGATCGT CGTGGCGGCG GTCGACGGCG TTGACGAGGG CTTCGCCCGG
ATCAACGACA GCGCCTACGG CCTGCAGGCC GGGGTGTTCA CCCACGACCT GGCCACCGCC
TTCCGCGCCC ACCGGGAGCT CCAGGTCGGG GGGGTCGTCA TCGGCGACGT CCCGTCGTAC
CGGGCCGACC AGATGCCCTA CGGCGGGACG AAGGGTTCCG GCATCGGGCG GGAGGGGGTC
CGCTCGGCCA TGACCGACCT CACCGAGGAC CGGGTGCTGG TCCTGACCGG TCTGGACCTG
TAG
 
Protein sequence
MSEPTPFWLA GKPATGSATT TVRHPFDGAE VAVVAQPDPD QVETAVAAAA AVAPAFAALP 
AHVRAGALAG VSKEIARRGD ELARLITAES GKPLTWSRAE VARASSTFRW GAEEARRFAG
ELTRLDTDPP GEGRLALTRR FPRGPVLGIT PFNFPLNLVA HKVAPALAVG APIIVKPAPR
TPLSALFLGD LLADAGLPEG SWSVLPIPND RLGPLVADPR LPVVSFTGSG PVGWSIRDTV
PRKHVVLELG GNAAVLVAAD YATPADLARA AGRIALFANY QAGQSCIAVQ RVYADRTIVD
ELLAEIVAAV RALHDGDPAD PATDVGPLID VAAAERVEAW ITEAVEAGAT LVCGGTRHGT
SLSPAVLTGV PPTAKVVSEE VFGPVIVVAA VDGVDEGFAR INDSAYGLQA GVFTHDLATA
FRAHRELQVG GVVIGDVPSY RADQMPYGGT KGSGIGREGV RSAMTDLTED RVLVLTGLDL