Gene Franean1_4572 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4572 
Symbol 
ID5672919 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5454842 
End bp5455981 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content75% 
IMG OID641243435 
ProductL-carnitine dehydratase/bile acid-inducible protein F 
Protein accessionYP_001508851 
Protein GI158316343 
COG category[C] Energy production and conversion 
COG ID[COG1804] Predicted acyl-CoA transferases/carnitine dehydratase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.417277 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCAGG CGCTGGGCGA CGTCACGGTC GTGTGCCTCA GCGCGCTCGG GCCGGTCCCG 
TTCGCGACCA TGCTGCTCGC CGACCTGGGC GCCCGGGTGA TCCGGATCGA CCGGGCCGAC
CGGCCCGGTG GCGTCACCGG CCTGCGGCTC GAGGACGATC CCCGTACCCG GGGGCAGCGC
GGCATCGGGG TCGACGTCCG GCATCCGTCG GGCCGGTCCG TGGTGCTGCG GCTGGTCGAG
ACGGCGGACG TGTTCCTGGA GGGAATGCGC CCCGGTGTCG CCGAGCGGCT CGGGCTCGGC
CCGGCCGAGC TGCTGGCCGT CAACCCCCGC CTGGTGTACG GGCGGGCGAC GGGGTGGGGG
CAGTCGGGCC CGCGCGCTCA ACAGGCCGGG CACGACATCA ATTACGCCGG GCTCGCCGGC
GGCCTGTACC CGACCGGACC GGCCGAGCTG CCGCCGCTGC CGCCGCTCAA TCTGCTGGCC
GACTTCGCCG GCGGCGGTTC CTACCTCGCC CTCGGCGTGC TCGCGGCCCT GCACCACCGC
ACGGGGACCG GACGCGGCCA GGTGGTCGAC GCGGCCATGG TCGACGGCGT CGCCAACCTC
ACGGCGATGA TGCACGGGAT GCTCGCGGCC GGCCTGTGGA GCGATCGCCG CGGCGACAAC
CTGCTCGACG GCGGCGCCCC GTTCTACCGC ACCTACCGCA CCGCCGACGA CGGCTTCGTC
GCCGTGGGCG CGCTGGAGCC GCAGTTCTAC CGGCTGCTGC TGGAGAACCT CGGGCTCGAC
CCCGCGCGGT GGCCGCAGCA CGACCGGTCG ACCTGGCCCG AGCAGGAGCG CGTCCTGGGG
GATCTGTTCG CCGCGCGCAC CCGGGACGAG TGGACGAAGC TGTTCGACGG CGTCGACGCC
TGTGTGACGC CCGTGCTGAG CCTGGCGGAG GCGGCGGCCT CGGCCGAGCT GCGCGAGCGC
GCGACCTTCG TCGAGTGGGA CGGCGTCGCG CAGCCGGCGC CGGCACCCCG GCTGTCGGCG
TCCCCGGCCG TCGAACGTCC CCGGTCGGGC TGGTGCAGCC ATTCGGCCGA GATTCTGACC
GAGCTCGGGC TGACCGAGAC GGAGCGGGCG GCGCTGCGCG ACGCGGGTGT GATCGCTTAG
 
Protein sequence
MTQALGDVTV VCLSALGPVP FATMLLADLG ARVIRIDRAD RPGGVTGLRL EDDPRTRGQR 
GIGVDVRHPS GRSVVLRLVE TADVFLEGMR PGVAERLGLG PAELLAVNPR LVYGRATGWG
QSGPRAQQAG HDINYAGLAG GLYPTGPAEL PPLPPLNLLA DFAGGGSYLA LGVLAALHHR
TGTGRGQVVD AAMVDGVANL TAMMHGMLAA GLWSDRRGDN LLDGGAPFYR TYRTADDGFV
AVGALEPQFY RLLLENLGLD PARWPQHDRS TWPEQERVLG DLFAARTRDE WTKLFDGVDA
CVTPVLSLAE AAASAELRER ATFVEWDGVA QPAPAPRLSA SPAVERPRSG WCSHSAEILT
ELGLTETERA ALRDAGVIA