Gene Francci3_3042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3042 
Symbol 
ID3904395 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3609899 
End bp3611809 
Gene Length1911 bp 
Protein Length636 aa 
Translation table11 
GC content71% 
IMG OID637880362 
Productacyl-CoA dehydrogenase-like 
Protein accessionYP_482128 
Protein GI86741728 
COG category[I] Lipid transport and metabolism 
COG ID[COG1960] Acyl-CoA dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.699042 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCAGG TGGGAGAACG GGAAGCGCGC AGGGTCGCCG AGGCGGCCCG GGAGGCCGAG 
TGGCGGCTTC CCTCGTTTGG CAAGGAGCTT TTCCTCGGTC ATCTCCGGCT CGATCTGATC
TCCCCGCACC CGCGGCCGAA GGAGGGCGAC CGGCGGCGGG GGGAGGCGTT CCTGGCCCAG
CTCGAGTCGT TCCTGCGGGA AACGGTGGAC CCGTTGCAGA TCGAGCGGGA CGGGCGGCTG
CCCGAGCAGG TGATCGACGG GCTGCGGCGG ATAGGCGCCC TCGGCATGAA GATCCCCGAG
GAGTACGGCG GACTCGGGTT GACCACCGTG TACTACAACA GGGCGCTTGA ACTGTCCAGC
TCGTGGCATT CGAGCCTGCC CACGCTGCTC TCGGCGCACC AGTCGATCGG CGTGGCCGAG
CCGGTCCTGG GGTTCGGTAC CGAGGAGCAG AAGCGGACGT ATCTGCCGAA GGTGGCCCGC
GAGGAGATCA GCGCGTTCCT GCTGACCGAG CCCGATGTGG GCAGCGACCC GGCCCGGATG
AGCAGCCAGG CGGTGCCGGT CGAGGGCGGC GCCGCCTACG AGCTGAGCGG CCGCAAGCTG
TGGACCACCA ACGGTGTGAT CGCCGATCGG CTGGTGGTCA TGGCGGTCGT CCCGCGCTCG
GCCGGTCATC GCGGTGGCAT CACGTGTTTC ATCGTCGACG CCCACGCACC CGGCGTGAAG
GTGGAGCACC GCAACGAGTT CATGGGTCTG CGCGGCATCG AGAACGGGGT GACCAGCTTC
GAGCGGGTCC GGGTCCCGGC CGCGGACCGC CTCGGGCGGG AAGGCGATGG GCTCAAGATT
GCCCTGACGA CGCTGAACAC CGGCCGCCTC GCCCTGCCGG CGACCTGCGT GGGCGGCGCC
AAGTACGCGC TGCGCATCGC CCGGGAATGG TCGCGGGACC GCGTCCAGTG GGGCCGGCCC
ATCGGCCAGC ACGACGCCAT CGCCCAGAAG ATCGCGTTCA CGGCCGCCAC CACGTTCGGG
CTGGAGGCGA TGCTGGAGCT GTCCGGCCTG ATGGCCGACG AGGGACGCGG GGACATCCGG
ATCGAGGCTG CCCTGGCCAA GCTGTATGCC AGCGAGATGG CCTGGCTGAT CGCCGATGAG
CTGGTCCAGA TCCGCGGCGG CCGCGGGTAC GAGACCGCCC AGTCCCTGGC CAACCGGGGG
GAACGACCGG TACCGGCCGA GCAGATGCTA CGCGATCTGC GCATCAATCG GATCTTCGAA
GGCTCCACCG AGATCATGCG GTTGCTCATC GCGCGGGAGG CCGTGGACAC CCACCTCGCG
GTGGCCGGCG ACATCATTTC CCCGGACGCG CCGGTCGCGG CGAAGGCCCG CAGCGCCGGA
CGGGCGGCTG CCTTCTACGG CCGCTGGCTG CCCGGCCTCG CCGTCGGCAG TGGGACGTCG
CCGCGCTCGT TCCTCGAATT CGGCCGGCTC AGCACGCACC TGCGCTTCGT GGAGCGCTCG
GCGCGCAAGC TCGCCCGGTC GACCTTCTAC GGGATGGCTC GCTGGCAGGG CGGGCTGGAG
AAGAAGCAGG CGTTCCTCGG GCGTCTCGTC GACATCGGCG CCGAACTGTT CGCCATCAGC
GCGGCGGTCG TGCGGGCCCG GATGCTCGAC GAGGAGGGCG AGCCGACCGC GACCGACCTG
GCCGACCTGT TCTGCCGGCA GGCGCGACGC CGGGTCGAGG CGTCCTTCGC GGCGCTGTGG
CGCAACGACG ACGCCCGCAA CTACACGGCG GCGCAGGAGG TGCTCGCGGG CCGCTTCAGC
TGGGCCGAGT CCGGCGTGAT GGACCCGAGC GGCGAAGGGC CGTTCCTCGC CGCGGAGCGG
CAGCGGGCGG TCCCGACTCG ACCGGCCGTC GACCTGCTCG CCGCGGGCTG A
 
Protein sequence
MAQVGEREAR RVAEAAREAE WRLPSFGKEL FLGHLRLDLI SPHPRPKEGD RRRGEAFLAQ 
LESFLRETVD PLQIERDGRL PEQVIDGLRR IGALGMKIPE EYGGLGLTTV YYNRALELSS
SWHSSLPTLL SAHQSIGVAE PVLGFGTEEQ KRTYLPKVAR EEISAFLLTE PDVGSDPARM
SSQAVPVEGG AAYELSGRKL WTTNGVIADR LVVMAVVPRS AGHRGGITCF IVDAHAPGVK
VEHRNEFMGL RGIENGVTSF ERVRVPAADR LGREGDGLKI ALTTLNTGRL ALPATCVGGA
KYALRIAREW SRDRVQWGRP IGQHDAIAQK IAFTAATTFG LEAMLELSGL MADEGRGDIR
IEAALAKLYA SEMAWLIADE LVQIRGGRGY ETAQSLANRG ERPVPAEQML RDLRINRIFE
GSTEIMRLLI AREAVDTHLA VAGDIISPDA PVAAKARSAG RAAAFYGRWL PGLAVGSGTS
PRSFLEFGRL STHLRFVERS ARKLARSTFY GMARWQGGLE KKQAFLGRLV DIGAELFAIS
AAVVRARMLD EEGEPTATDL ADLFCRQARR RVEASFAALW RNDDARNYTA AQEVLAGRFS
WAESGVMDPS GEGPFLAAER QRAVPTRPAV DLLAAG