Gene Francci3_1038 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1038 
Symbol 
ID3906706 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1232996 
End bp1234588 
Gene Length1593 bp 
Protein Length530 aa 
Translation table11 
GC content70% 
IMG OID637878371 
Productaldehyde dehydrogenase 
Protein accessionYP_480149 
Protein GI86739749 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.15563 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGTCA ATATCACCGG TATGGCCGCA GCATCCAGCA GCTCCCAGCC ACGACTCCAC 
ATTAAGCCGG GTACCACCTG GTCCGAGGCG TTCACCACCT GCCGCCAGGT TGCCCCCGAG
GCGTTCGACG AGGACCGCCT CCGTAATCTC TGGGGCGGGC ACTGGCATCG AACTGGAAAT
CCACTGCACA GCCTGTCCCC CGTCGACGGC ACACCGATCG CCGGGCCCCC GATGATCGAG
CAGGACGAGG CCCGGGATGC GATCCGGGCC GCTCTCGACG ACCACAAGCA CTGGCGCGAC
GTGGCGCTGC CCGAGCGCAA GGCCCGAGTA CGCGCCGCCG TTGACGCCAT GGACGCCCAC
CGCGACCTGC TGGCTCTCCT GCTCGTCTGG GAGATCGGCA AGCCCTGGCG ACTGGCGCGC
ACCGACGTCG ACCGTGCCCT CGACGGGGTG CGCTGGTACA TCGACGAGAT CGACGGCATG
ATCGCCGGGC GCACCCCGCT ACCCGGTCCG GTGAGCAACA TCGCCAGCTG GAACTACCCG
ATGAGCGTGC TCATGCATGC CGTGCTCGTC CAGATCCTCG CCGGCAACGG CACCATCGCC
AAGACCCCGA CCGACGGCGG CGCCGCCTGC CTGACCCTGG CGAGCGCACT GGCCCGCCGG
GAGGGTCTGC CGGTCTCGCT GGTCTCCGGA TCGGGTTCCC GGCTGAGCTC GGCGCTGGTC
CGCGCCCCCG AGATCGGCTG CCTGGCCTTC GTGGGCGGCC GGTCGGCGGG TGGCCAGGTC
GCCGCTGCCC TGGTCGACAC CGGTAAGCGC CACTTCCTAG AGCAGGAGGG CCTCAACGCC
TGGGGTATCT GGGATTTCTC CCAGTGGGAC GTGCTCGCCG CCCATCTGAA GAAGGGCTTC
GAGTACGGCA AGCAGCGCTG CACCGCCTAC CCCCGCTACG TCGTCCAGCG CCAACTGTTC
GACAGGTTCC TCCAGATGTA CCTGCCGGTG GTCTCCGGCG TCCGGTTCGG CCATCCCCTC
GCCGTCGCCG ACGACACCGA CCAGCTGCCC GAGCTGGACT ACGGCCCGGT CATCACCGCG
GACAAGGCGA CCGAGCTGGC CGCGAAGATC GACGAGGCGA TCACGAAGGG CGGCGTGCCG
ATCTACCGCG GAGACCTCGC GGACGGCCGG TTCATCCCCG GGCAGGACAC CTCCGCCTAC
GTGCCGCCGG TGGCGATCCT GAGCCCGCCG GCGTCGTCCG CGCTGCACCA CGCGGAGCCG
TTCGGCCCGG TCGACACCAT CGTCGTCGTC GACTCGGAGG CTGAGCTGCT GGCGGCGATG
AACGCCTCCA ACGGCGCGCT GGTTGCCTCA CTGGCCTGCG ACGACGAGTC CCATGCCCGC
CGGCTCGCCG GGGAGCTGCA GGCGTTCAAG GTCGGAGTCA ACAAGCCGCG CTCCCGCGGC
GACCGGGACG AGCCGTTCGG TGGCCGCGGC GCCTCCTGGA AGGGGGCCTT CGTCGGCGGT
GTCCATCTGG CCAGGGCCGT CACCGTAGGC ACGGACGCCG ACGAGCGGCT CGTCGGCAAC
TTCCCGAGCT ACTCCCTCTA CCCGGCCGTC TGA
 
Protein sequence
MTVNITGMAA ASSSSQPRLH IKPGTTWSEA FTTCRQVAPE AFDEDRLRNL WGGHWHRTGN 
PLHSLSPVDG TPIAGPPMIE QDEARDAIRA ALDDHKHWRD VALPERKARV RAAVDAMDAH
RDLLALLLVW EIGKPWRLAR TDVDRALDGV RWYIDEIDGM IAGRTPLPGP VSNIASWNYP
MSVLMHAVLV QILAGNGTIA KTPTDGGAAC LTLASALARR EGLPVSLVSG SGSRLSSALV
RAPEIGCLAF VGGRSAGGQV AAALVDTGKR HFLEQEGLNA WGIWDFSQWD VLAAHLKKGF
EYGKQRCTAY PRYVVQRQLF DRFLQMYLPV VSGVRFGHPL AVADDTDQLP ELDYGPVITA
DKATELAAKI DEAITKGGVP IYRGDLADGR FIPGQDTSAY VPPVAILSPP ASSALHHAEP
FGPVDTIVVV DSEAELLAAM NASNGALVAS LACDDESHAR RLAGELQAFK VGVNKPRSRG
DRDEPFGGRG ASWKGAFVGG VHLARAVTVG TDADERLVGN FPSYSLYPAV