Gene Franean1_3147 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3147 
Symbol 
ID5671524 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3702942 
End bp3704090 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content64% 
IMG OID641242042 
Productacyl-CoA dehydrogenase domain-containing protein 
Protein accessionYP_001507462 
Protein GI158314954 
COG category[I] Lipid transport and metabolism 
COG ID[COG1960] Acyl-CoA dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.880335 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACGCA CGCTTTACAC CGAAGACCAC GAGATCTACC GGCAGACCGT CCAGGAGTTC 
CTCGAGCGCG AGATTGTCCC CCATAAGGAC CGCTGGGACG ACGAGGGCAT GATCGACCGC
GGCGTCTACG GATCCGCCGC GCAGACAGGG CTCCACGCGC TCGCCGTCCC CGAGGAGTAC
GGCGGCGCGG GCGAGCGGGA CTTCCGCTAC CGACTGGTCG TCTGCGAGGA GATCGGACGG
ATCAACGCCC TGTCGTTCGG GCTCACTCTG AGCGTTCAGG ACGACCTCGT GCTGGCCTAC
CTGGCCGATC TGACCAACAA TGAGCAGCGT AAGCGCTGGC TTCCCGGGTT CGCCTCCGGC
CAGCTCCTCG GCGCTCTTGC CATGACCGAA CCGGGGGCGG GTAGTGACCT GCGCGGAATA
TGCACCTCGG CTAGGCGCGA CGGTAACAGC TGGGTGATCA ACGGGCAGAA GACTTTCATC
TCCAACGGTA TCAGCGCAGA TCTGATTGTC CTCGCGGCCT GCACCGACCC CGACGCCGGG
TCACGCGGAT TCAGTCTGTT CGTCGTCGAG CGCGACACCC CCGGCTTCGA TCGCGGGTGC
AGGCTCGACA AAATCGGTCT CTCGGGTCAG GACACCGCCG AACTGTTCTT CGGCGATGCG
CGGGTGCCCG CCGAGAACCT GCTGGGTGAA GAGGGTCGCG GCCTGCAGTA CCTGATGAGC
CACCTGCCCC GCGAACGCCT GGGGATCACC GCCATGGCGA TCGGATCGGC TCGCGCGATC
TTTGATGCGA CGCTCGAGTA CTGCAAGCAA CGCTCGGCCT TCGGTCGACC CATCTCGGAC
TTCCAGAACA CCCGGTTCGA GCTTGCGACG ATGGCCACCG AGCTCGACCT CGCCGAGACA
TACGTCGACG CGTCGGTGCG TGCCTACAAC GAGGGCACGC TCTCTCCCGT GGATGCGGCC
AAGGGTAAAT GGTGGATCAC CGAACTGCAG AGACGCGTGA TCGACCGGTG CCTTCAACTC
CACGGTGGTT ACGGTTTCAT GCGCGAGTAC CCGGTCGGCA GGGCCTTCGT CGACTCTCGC
GTCCAGACGA TCTATGGCGG GACGACCGAG ATCATGAAGG ACCTGATCGG CCGCGACCTC
ACAAGGTGA
 
Protein sequence
MKRTLYTEDH EIYRQTVQEF LEREIVPHKD RWDDEGMIDR GVYGSAAQTG LHALAVPEEY 
GGAGERDFRY RLVVCEEIGR INALSFGLTL SVQDDLVLAY LADLTNNEQR KRWLPGFASG
QLLGALAMTE PGAGSDLRGI CTSARRDGNS WVINGQKTFI SNGISADLIV LAACTDPDAG
SRGFSLFVVE RDTPGFDRGC RLDKIGLSGQ DTAELFFGDA RVPAENLLGE EGRGLQYLMS
HLPRERLGIT AMAIGSARAI FDATLEYCKQ RSAFGRPISD FQNTRFELAT MATELDLAET
YVDASVRAYN EGTLSPVDAA KGKWWITELQ RRVIDRCLQL HGGYGFMREY PVGRAFVDSR
VQTIYGGTTE IMKDLIGRDL TR