Gene Franean1_2496 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2496 
Symbol 
ID5670892 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2973214 
End bp2974509 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content74% 
IMG OID641241413 
Productalpha/beta hydrolase fold 
Protein accessionYP_001506834 
Protein GI158314326 
COG category[I] Lipid transport and metabolism
[R] General function prediction only 
COG ID[COG0456] Acetyltransferases
[COG2267] Lysophospholipase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.98231 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCACTG ATGCGGATCA CCCCGTCGGG GATCGCGCGG AGCTGACCCT GCGCGCCGCC 
GGGCCGGGTG ACGTCGCCGC GCTTGTCGAG CTGATCGAGT CCGCGTACCG GGGCGAGCGC
AGCCGGGTTG GCTGGACCAC CGAGGCCGAC CTGCTGGGCG GCCAGCGCAC CGACCCGGAG
ATGCTCGCCG CCGCGCTCGC CGAGCCCGAC ATCCGGATGC TCCTCGCGCT CGGCCCGGCC
GGGGAGCCGG TCGGCTGCTG CCAGCTCCAG CGGCGCCCGG ACGGCGCCTA CTTCGGCATG
TTCGCCGTCC AGCCGGACAT CCAGGGGCAA GGGATCGGGG ACCGTCTGCT CACCGCCGCG
GAGGCACTCG CCCGCGACGA GTGGGCGGCG GCCCGGATGG AGCTGTACGT CATCTCGCTG
CGCGCCGAGC TGATCGCCTG GTACGAGCGG CGCGGCTACC GGCGTACGGA CCGCCACGAG
CCCTTCCCCT ACGGGGACAC CCGGTTCGGC GTGCCGCTGC GCGACGACCT GGTCTTCGCC
GTCCTGGAGA AGGACCTCGG CCACCGGGTC GACGTCGGCG GCCTCGCGCT GCACGTCGAG
ACCTGGACGG GCCAGGCGCA CTCATCCACC CCGCTGCTGC TCCTGCACGG CATCGGTGGG
AGCACCAGGG ACTGGGCCGG GGTGTCCCGC GAGCTCGCCG GCGCCGTGTC GAGCCGGGTC
GTGGCCTACG ACCACCGCGG CCACGGAACC AGCGGGCGGG CGGCCCGCCC GGAGTACACC
TTCGACCACC TCGTCCGCGA TCTCGAGACC GTCGTCGCGA CGCTGGAGCT GGCGCCGCTG
CACCTGCTGG GGCATTCGAT GGGCGGGGTG GTCGCGCTCC GGTACGCGCT GGCCCACCCC
GAGGCCGTCC GGTCGCTGAT CCTGATGGAC ACCGCCGCGG CGCCGGCGGC GGGTGATCAT
CTGCTGTCCC GGCTGGGCAT GGGCGCGCTC ATGGAGGGCA TCGCCGCCGC GACCGCGCTG
CTGGGGCACG GGGACCACGC CGACCCCGCC GCCCTCGCCG CCTTCGGCCA CGAGCTCAAC
GCCTACCCCT CGATGATCGA CCGGCTGGGC GAGATCCGCT GCCCCACGAC GGTCATCGTC
GGCGAGCGGG ACGTCCTGCT GCGTGGTGCC GCGCGGGATC TGGCCGGCGC CATCGAGGGT
GCCCGGCTCG CGGTGATCGC CGGTGCCGAT CACAACCCGC AGGCCAGTCA CCCACAGGCC
TGGCTCAGCG CGGTGGAGCG GCACGCCGCC TTCTGA
 
Protein sequence
MATDADHPVG DRAELTLRAA GPGDVAALVE LIESAYRGER SRVGWTTEAD LLGGQRTDPE 
MLAAALAEPD IRMLLALGPA GEPVGCCQLQ RRPDGAYFGM FAVQPDIQGQ GIGDRLLTAA
EALARDEWAA ARMELYVISL RAELIAWYER RGYRRTDRHE PFPYGDTRFG VPLRDDLVFA
VLEKDLGHRV DVGGLALHVE TWTGQAHSST PLLLLHGIGG STRDWAGVSR ELAGAVSSRV
VAYDHRGHGT SGRAARPEYT FDHLVRDLET VVATLELAPL HLLGHSMGGV VALRYALAHP
EAVRSLILMD TAAAPAAGDH LLSRLGMGAL MEGIAAATAL LGHGDHADPA ALAAFGHELN
AYPSMIDRLG EIRCPTTVIV GERDVLLRGA ARDLAGAIEG ARLAVIAGAD HNPQASHPQA
WLSAVERHAA F