Gene Franean1_4294 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4294 
Symbol 
ID5672649 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5131585 
End bp5132691 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content71% 
IMG OID641243167 
Productacyl-CoA dehydrogenase domain-containing protein 
Protein accessionYP_001508584 
Protein GI158316076 
COG category[I] Lipid transport and metabolism 
COG ID[COG1960] Acyl-CoA dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.83932 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.79155 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACATCG CGGAGTTCCG GTCGGACGTG CAGGCCTGGC TGGCCGAGAA CGACCTGACC 
CCAGGACCCG ACCACTCCCT CGACGGGCAG GTGGCCCAGC TCGCGCGGGT GCGCCGTGCC
CTGTACGACG CGGGCTGGAT GCGCCACGGC TGGCCCACCG AGGTCGGCGG CCTCGGTGGC
CCTGCGGTGC TGCGCGCCGT ACTCGGCGAG GAGGTCGCGT CCCGGGACCT CGCCGAGCCC
GGCATCTACT CGATGATCGA GGTGCTGGCG CCGACCATGA TTTCCTATGC GTCGCCGGCG
CTGGCGGCCG AGATGGTGCC GCTGCTGCTC TCCGGCGCCG AGCAGTGGTG CCAGGGGTTC
TCCGAGCCGG GGTCGGGCAG TGACCTGGCG TCGCTGTCCA CCCGGGCGGA GCCCCGTGGG
GACGACTGGG TGGTCAACGG GCAGAAGGTC TGGACGAGCC TGGCCCAGTA CTCCCAGCGC
TGCGTGCTGC TCACCCGCAC CGCACCGGGC CACAGCGGGA TCACGGCGTT CTTCGTCGAC
ATGGACACCC CCGGCATCAC CGTCCGCCCG CTGCGCACCA TGCACGGCAT CGACGAGTTC
GCCGAGGTGT TCTTCGACGA CGTCGTCGTC CCCGGCGACC GCATGCTCGG AAAGCCCGGC
GACGGCTGGC AGCTCGCGAT GGACCTTTTG CCCCACGAGC GCTCCACCTG CTTCTGGCAC
CGGATCGCGT TCCTCTACGA GCGGCTCGAG CGCCTGCTGG ACGAGACCAC CAGCAGAGAC
AGCAGGGACA GCACGGACGA CGCGGATCTG GGTGCGGCCT ACCTGGCGCT GCACACCATC
CGCTGCCGCT CGCACGCCAC CCAGCGCCGC CTCGGCGAGG GCGGGCGGAT CGGGCCGGAG
ACGTCGATCG ACAAGGTGCT GCTCGCCACC GCCGAGCAAC GCCTGTACGA CACCGCCCGC
GACCTGCTCC CCGGCACCAT CGAACTCACC GACACCCGCT GGCGCTCCGA GTACCTCTAC
TCCCGCGCCG CCACCATCTA CGGCGGAACC GCCGAGATCC AGCGGAACAT CATCGCCCGC
CGACTCCTCG ACCTCGGCAA GGAATAA
 
Protein sequence
MNIAEFRSDV QAWLAENDLT PGPDHSLDGQ VAQLARVRRA LYDAGWMRHG WPTEVGGLGG 
PAVLRAVLGE EVASRDLAEP GIYSMIEVLA PTMISYASPA LAAEMVPLLL SGAEQWCQGF
SEPGSGSDLA SLSTRAEPRG DDWVVNGQKV WTSLAQYSQR CVLLTRTAPG HSGITAFFVD
MDTPGITVRP LRTMHGIDEF AEVFFDDVVV PGDRMLGKPG DGWQLAMDLL PHERSTCFWH
RIAFLYERLE RLLDETTSRD SRDSTDDADL GAAYLALHTI RCRSHATQRR LGEGGRIGPE
TSIDKVLLAT AEQRLYDTAR DLLPGTIELT DTRWRSEYLY SRAATIYGGT AEIQRNIIAR
RLLDLGKE