Gene Franean1_4517 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4517 
Symbol 
ID5672866 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5389656 
End bp5390843 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content70% 
IMG OID641243382 
ProductL-carnitine dehydratase/bile acid-inducible protein F 
Protein accessionYP_001508798 
Protein GI158316290 
COG category[C] Energy production and conversion 
COG ID[COG1804] Predicted acyl-CoA transferases/carnitine dehydratase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.196524 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGCGC TCGAAGGCGT GAAGGTGATT TGTGTGGGGC AGTTCTACTT TGCCCCTTAT 
TGCTCGATGC TGATGGCGCG CCTCGGCGCG GACGTCATCA AGGTCGAGGC GCCGGAGGGG
GACCCGTACC GCCGGCTGCC CACCGTCGAC CACGACGGCT TCCCGATCCA GTTCCGGTTC
CTCAACTCCG GCAAGCGCGC CATCCGGCTG GACCTCAAAC AACCTGCCGG GCAGGAGATA
CTCCGTAACC TCGTCCGGAC CGCCGACGTG CTCGTCCAGA ACCTGTCGCC GGGGGCGATG
GACCGGCGCG GTCTCGGCTA CAAGCAGCTC AGCGCGATCA ACCCGGGCCT GATCATGGCG
TCGGGCACAG GCTTCGGGTC GTTCGGGCCC TATGCCGGCG AGCCGGCGAT GGATCTCACG
ATCCAGGCGC GCAGCGCGAT CATGAGCACC ACGGGGTTCG CCGACGGGGC GCCGGTCCGC
ACCGGCCCGT CGGTCGTCGA CTTCGTCGCG GGCACGCACA TGCTCGGGGG TGTGCTCGCC
GCGCTGTTCC AACGCACCCG CACCGGCCGC GGTCAGCATG TCGAGGTGGC CCTGCAGGAC
GCCATCGTCC CGTCGCTGAC GTCCAACATC GCCGGGCTGC TGAGCAGCGC GACCGAGAGC
CACGAACGCA CGGGCAACCG GCACGGCGGG CTGGCCGTCG CCCCGTACAA CGCCTACCGC
ACCAACGACG GGTGGATCGC CGTACTGTGC CCGACCGACG CGCACTGGCG GCGGCTGTGT
GAGCTGATGG GGGATCCCGC CACCGACGAC CCGCGCTTCG CGGACATGAG CAGCCGGTGC
GCCCACATAG ACGACGTCGA CGCGGTCGTC GAGAACTGGA CGAGGGCCCG CCCCAAGGAC
CTGCTGGCGC GGATGCTGGT GGAGGCACGC ATCCCCTCCG CTCCGGTCGT CACCCTGCCG
GAGCTGCTCG AGGACCCGCA CGTACGCGAG CGCGGCGTGC TTCGCACTGT CACCGACGAG
CAGGGCTCGT TCATGACGCT CGGCAGTCCG CTGTTCCTGT CGGACTCGCC CATGGTGGAG
CCGTGGCGGG CGCGCGAGGT CGGCGCCGAC ACCGACGAGG TCCTTACCGC GGAGCTGGGC
ATGTCCGTCG ACGACATCGC CAAGCTGCGG GAGGCCGGGG TCATCTGA
 
Protein sequence
MTALEGVKVI CVGQFYFAPY CSMLMARLGA DVIKVEAPEG DPYRRLPTVD HDGFPIQFRF 
LNSGKRAIRL DLKQPAGQEI LRNLVRTADV LVQNLSPGAM DRRGLGYKQL SAINPGLIMA
SGTGFGSFGP YAGEPAMDLT IQARSAIMST TGFADGAPVR TGPSVVDFVA GTHMLGGVLA
ALFQRTRTGR GQHVEVALQD AIVPSLTSNI AGLLSSATES HERTGNRHGG LAVAPYNAYR
TNDGWIAVLC PTDAHWRRLC ELMGDPATDD PRFADMSSRC AHIDDVDAVV ENWTRARPKD
LLARMLVEAR IPSAPVVTLP ELLEDPHVRE RGVLRTVTDE QGSFMTLGSP LFLSDSPMVE
PWRAREVGAD TDEVLTAELG MSVDDIAKLR EAGVI