Gene Franean1_3018 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3018 
Symbol 
ID5671400 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3552596 
End bp3554455 
Gene Length1860 bp 
Protein Length619 aa 
Translation table11 
GC content70% 
IMG OID641241920 
Productacyl-CoA dehydrogenase domain-containing protein 
Protein accessionYP_001507340 
Protein GI158314832 
COG category[I] Lipid transport and metabolism 
COG ID[COG1960] Acyl-CoA dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.238191 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAAGGC TGCCAGGGGA GGAGTCGTTC ATGGGGCACT ACCGGAGCAA TCTCCGCGAC 
ATCGAGTTCA ACCTGTTCGA GGTGTTCGGC GTGGACCGGA CGCTCGGGAC CGGGCCCTTC
GCGGAGATGG ACCGCGACAC AGCGCATGAG GTGCTCGCCG AGCTGGAGCG GGTCGCCGTC
GGGCCGATCG CCGAGTCCTT CGAGGACGCC GACCGGCACC CGCCGACACT CGACCCCGCC
ACCGGCACCG TCACACTGCC CGAGTCGTTC CGCAAGTCCT TCGCGGCGGC GCAGGAGGGC
GAGTGGTGGC GGCTCTACCT CCCGCCGCAC CTGGGCGGCA TGGGTGCCCC GCCGACCATC
GCCTGGGCGG CGTCCGAGCT GCTCCTCGGG GCGAACCCGG CGGTGTTCAT GTACATGGCC
GGGCCGACGT TCGCCGCGAT CCTCGAGGGC ATCGGCACGC CATGGCAGAA GAAGATCGCG
AACTGGGCCA TCGAGCGGCA CTGGGGCGCG ACCATGGTGC TGACCGAGCC GGACGCCGGC
TCCGACGTCG GCGCAGGCCG GGCGCGGGCC GTCGAGCAGC CGGACGGCAC CTGGCACATC
GAAGGTGTGA AGCGCTTCAT CACCAGCGGT GACTGGGACA TCCCGGAGAA CATCTTCCAC
CTGGTGCTCG CCCGCCCCGA GGGGCACGGC CCCGGCACCA AGGGCCTGTC GATGTTCGTC
GTGCCCAAGT ACCTGCCCGA CCCGGAGACC GGCGCCCCGG GCGCCCGCAA CGGTGCCTAC
GTCACGAACC TTGAGAAGAA AATGGGCCTG AAGGTCTCGA CCACCTGCGA GCTGCGCTTC
GGCGAGCGTG AGCCGGCCGT CGGATGGCTC GTCGGCGACG TCCACGACGG CATCGCGCAG
ATGTTCCGGG TCATCGAGTA CGCCCGGATG ATGGTCGGCA CGAAGGCGAT CGCCACGTTG
TCCACCGGGT ACCTCAACGC GCTGGCCTTC GCCCGCGAGC GGATCCAGGG CGCCGACCTG
ACCCGGGCGT CGGACAAGAC CGCGCCGCGG GTGGCCATCA TCAACCATCC CGACGTGCGC
CGGATGCTCA TGCAGCAGAA GGCCTACGCG GAGGGGATGC GTGCCCTCGT GCTGTACACC
GCTACCTTCC AGGACGCCGT CACGCTGGCG GCGGCCCGCG GCGAGGTGGA CGACAACGCG
GTGCGGATGA ACGACCTGCT GCTCCCGATC GTGAAGGGAG TCGGCTCGGA GCGCTCCTAC
GAACTGCTGG CGAGCTCGCT GCAGGTCCTC GGCGGCTCCG GATACCTGCA GGACTACCCG
ATCGAGCAGT ACATCCGCGA CGCGAAGATC GACACCCTGT ACGAGGGCAC GACGTCCATC
CAGGGCCAGG ACCTCTTCTT CCGGAAGATC GTCCGGGACA AGGGTCGGGC GCTGACCACT
CTGTTCACGC AGGTCCAGGA GTTTGTGAAG GGCGAGGCGG GCAACGGCGC GCTCCGCGCG
GAGCGGGAGT CGCTGGGCGC GGCGCTCGAC GAGTGCCAGG CCATGGTCGG TTCGATGGTC
GGGTTCCTCA CCGCCACCGC CCAGGACAAG TCCCAGGTCT ACCGGGTCGG GCTGAACACC
ACGCGGCTGC TGATGAGCAT CGGCGACCTC GTCGTCGGCT GGCTGCTGCT GCGCGGTGCC
GACGCGGCGC TACGCGGGCT GGACGGCTCC CCGTCCGAGC GCGACCGGGC GTTCTACGAG
GGCAAGATCG CGGCCGCGCG CTGGTTCGCG GCCAACATCC TGCCGGAGCT GCGCACACGC
CGGACGGTGC TGGAGGGCAC CGGTCTCGGG CTGATGGAGA TGTCCGAAGC GGCGTTCTGA
 
Protein sequence
MPRLPGEESF MGHYRSNLRD IEFNLFEVFG VDRTLGTGPF AEMDRDTAHE VLAELERVAV 
GPIAESFEDA DRHPPTLDPA TGTVTLPESF RKSFAAAQEG EWWRLYLPPH LGGMGAPPTI
AWAASELLLG ANPAVFMYMA GPTFAAILEG IGTPWQKKIA NWAIERHWGA TMVLTEPDAG
SDVGAGRARA VEQPDGTWHI EGVKRFITSG DWDIPENIFH LVLARPEGHG PGTKGLSMFV
VPKYLPDPET GAPGARNGAY VTNLEKKMGL KVSTTCELRF GEREPAVGWL VGDVHDGIAQ
MFRVIEYARM MVGTKAIATL STGYLNALAF ARERIQGADL TRASDKTAPR VAIINHPDVR
RMLMQQKAYA EGMRALVLYT ATFQDAVTLA AARGEVDDNA VRMNDLLLPI VKGVGSERSY
ELLASSLQVL GGSGYLQDYP IEQYIRDAKI DTLYEGTTSI QGQDLFFRKI VRDKGRALTT
LFTQVQEFVK GEAGNGALRA ERESLGAALD ECQAMVGSMV GFLTATAQDK SQVYRVGLNT
TRLLMSIGDL VVGWLLLRGA DAALRGLDGS PSERDRAFYE GKIAAARWFA ANILPELRTR
RTVLEGTGLG LMEMSEAAF