Gene Franean1_2010 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2010 
Symbol 
ID5670411 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2415955 
End bp2417340 
Gene Length1386 bp 
Protein Length461 aa 
Translation table11 
GC content69% 
IMG OID641240931 
Producthypothetical protein 
Protein accessionYP_001506353 
Protein GI158313845 
COG category[I] Lipid transport and metabolism
[S] Function unknown 
COG ID[COG1946] Acyl-CoA thioesterase
[COG2343] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR00189] acyl-CoA thioesterase II 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.347959 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.266903 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCCTCG ACTCCGGATT CCTCGTCCCA ACCCTGGGCA CAGTGTTAGC TTCCTGTGAC 
GTGCAGCCTG TAACGGCCGA CGCGACGTCG GCAAGCGATC CGGCCTTAAC TCGGCAGCCA
TGCCCCTACC CGGCCCGAGC GTGGTGGGGT GACACCCTGG TCGCGGAGTC GACAGCCGCC
GTCCGGGTGC AGGAACCGGG GCAGGTCCCC GTCCTTTACT TTCCACGCTC CGACATCCGG
TTCGACTCCT TCGGAGACGA GGGCCGGACG GTGTCGTGCC CGGTGAAGGG CACGGCGCGC
CTGTGGTCCC TCGAGGGGAA GGTCGACGCC GACTCCGTCG ACTGGCACGA CCCCGCCAGC
ACGACGACGG TCGACGGCCG GGACGCGCTG TGGGCCTTCA CCCAGCCGGC TTCCGGCCTG
GAGTGGCTGA CCGACTTCGC CGCCTTCGAC CACGACCGGG TCCGAGTGGA GATCGTCGAC
GAGCTCCCCG GCGAGCAGCC CCGCGACAGC ACGATCAAAC GCTTCCCCAC CTGGGGCGAC
GCCTCAGATC TCATCGACAT CATGAATGTC CGCCAGGCCG GCGACCGCAG ATACGTCAGC
GTGGCCCTTG CTGATCCCCG GCGGCCAGTC GTCGAAGGCA GCCAGATGCT CGGGCAGGCT
GTGGTGGCCG GCGGCCGCCA CGCGCCCGGG CGGCGGGTTG TCTCGGCGCA CATGGTGTTC
TACCGGGCAG CGGACGCGCG CGAACCTCTC GAGTTCGAGG TGGACGAGCT CAGCTCCGGC
CGGAGCTTCA CAACGCTGGC AGTACACGTC TCCCAGCGCG GAAAGCGGCG CGCCAGCGGC
ACCCTGCTGC TCGATGTCAC CGCACCGGAC GTCGTCCGCC ACTCCGCGGC CCCGCCGCCA
TCCGCGGGCC CCTACGACAG CAAGCCCTAC GACATGTCCG TGACCGGACG CGACATCCGC
ATGGTCGACG CGGCGTACAC CGACGACCCG GCCGCGCCCG TGGGTCCTCC GGTCATCGGC
ACATGGGTCC GGTTCCGCAC GGTGCCGGAC GACCCGTGCC TACAGGCCGG GCTGCTCGCC
CAGTTCACCG GGCACATCTC CATCGCCGCC GCCCTGCGCC CGCATGAGGG GGTCGGCCAG
AACCAGGCCC ATCGCACGCT GTCGATGGGG ATCAACGCGA TTGGCATCTC CTTTCACTCC
AATGTCCGGG CCGACCACTG GATGCGCTAT CATCATCTGT CGACCTTCGC CGGCGACGGA
ATGACCCATT CCGAATGCCG GGTCTACGAC GAGGCCGATG CGCTGATCTC GTCGTTCACC
GTCGACGCCA TGCTGCGGGG CTTCAGGAAT GACGGGCGCG CGGTCGACGA CAGGACCGCG
CTGTGA
 
Protein sequence
MPLDSGFLVP TLGTVLASCD VQPVTADATS ASDPALTRQP CPYPARAWWG DTLVAESTAA 
VRVQEPGQVP VLYFPRSDIR FDSFGDEGRT VSCPVKGTAR LWSLEGKVDA DSVDWHDPAS
TTTVDGRDAL WAFTQPASGL EWLTDFAAFD HDRVRVEIVD ELPGEQPRDS TIKRFPTWGD
ASDLIDIMNV RQAGDRRYVS VALADPRRPV VEGSQMLGQA VVAGGRHAPG RRVVSAHMVF
YRAADAREPL EFEVDELSSG RSFTTLAVHV SQRGKRRASG TLLLDVTAPD VVRHSAAPPP
SAGPYDSKPY DMSVTGRDIR MVDAAYTDDP AAPVGPPVIG TWVRFRTVPD DPCLQAGLLA
QFTGHISIAA ALRPHEGVGQ NQAHRTLSMG INAIGISFHS NVRADHWMRY HHLSTFAGDG
MTHSECRVYD EADALISSFT VDAMLRGFRN DGRAVDDRTA L