Gene Franean1_4191 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4191 
Symbol 
ID5672546 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4985673 
End bp4986881 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content74% 
IMG OID641243064 
Productstearoyl-CoA 9-desaturase 
Protein accessionYP_001508481 
Protein GI158315973 
COG category[I] Lipid transport and metabolism 
COG ID[COG1398] Fatty-acid desaturase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0555236 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCATG ACCGTACGGG CGTGAGTCCG CCCGTGCTAC TGACTGTCGG TCCGGCCACA 
GCCACGTCCC GGGCATCACT GCCGGCCGTG CCGGTGGAGC CAGCCCCGCC AGCCGTCGCC
GCCGTGCCGG TGGAGCCGTC CCGGCCGGTC GCGACGGCCG TCGCGGTGCG GCCGCCCACG
CCGGTGGGCC CGGCGGCGGC GTGCCCACCG CCGGTGAGCA CCGCGGGCAG GCGGACCTTG
GCCAAGGCCG CCGTGGGCGC GGCCGCCGCG GGGGAGAGCG CCGCGTCGCC GCGGGCGGCG
GGCGCCGCGC CCGAGCTGGA TCCGCGGCCG AAGTCCCGCG CCGAGCAGAT CCTGCTCGCG
CTCTTCCTGG GTGGGCCGCT GCTGGCGGTC GCCCTCGCTG TCCCGTTCGC CTGGGGCTGG
GGTCTGAGCT GGCACGACGT CGTCATCGGC GCGGTGATGT ACGTCATCGG CGGTCTTGGG
ATCACCGTCG GCTACCACCG GCACTTCACC CATGGCGCGT TCAAGGCGCG CCGGGGCCTG
CGGATCGCGC TCGCTCTCGC CGGCAGCATG GCGATCGAGA TGAGCGTGAT CGACTGGGTG
GCCGCGCACC GGCGCCACCA CCGCTTCTCC GACCGTGACG GCGACCCGCA CTCCCCGTGG
CGTTTCGGGC CCGGTACCCG GTCGCTGGCG CGCGGTCTGC TGCACGCGCA CGTCGGGTGG
CTGTTCGCGC CGCGGCGGAC CAATGCCCAG CGGTACTGCC CGGACCTGCT CGCCGACCGG
GACATCCGGC GGATCTCGGA CCGCTTCGGC TGGCTCGTCG CCGTGTCGAT GCTTCTCCCG
CCGCTGGTCG GCGGCCTGTG GGCGGGGTCG TGGACGGGCG CGCTCACCGC GTTCTTCTGG
GCGTCGCTGG TCCGGGTCTT CCTGTTGCAC CACGTGACCT TCTCGATCAA CTCGATCTGC
CACGTCCTGG GCGCCACGCC GTTCGCCACC CGGGACCACT CGGGCAACGT GTGGTGGCTG
GCCGTCCCGT CGTTCGGGGA GGCGTGGCAC AACCTGCACC ACGCGGACCC CACCAGCGCG
CGGCACGGTG TGCTGCGCGG CCAGATCGAC CTCAGCGCCG GGCTGATCGC GGTGTTCGAG
CGTCTCGGCT GGGCCCATGA CGTGCGCTGG CCCGACAGCG AGCGGATCGC GGCGAAGCGC
GCCGCCTGA
 
Protein sequence
MSHDRTGVSP PVLLTVGPAT ATSRASLPAV PVEPAPPAVA AVPVEPSRPV ATAVAVRPPT 
PVGPAAACPP PVSTAGRRTL AKAAVGAAAA GESAASPRAA GAAPELDPRP KSRAEQILLA
LFLGGPLLAV ALAVPFAWGW GLSWHDVVIG AVMYVIGGLG ITVGYHRHFT HGAFKARRGL
RIALALAGSM AIEMSVIDWV AAHRRHHRFS DRDGDPHSPW RFGPGTRSLA RGLLHAHVGW
LFAPRRTNAQ RYCPDLLADR DIRRISDRFG WLVAVSMLLP PLVGGLWAGS WTGALTAFFW
ASLVRVFLLH HVTFSINSIC HVLGATPFAT RDHSGNVWWL AVPSFGEAWH NLHHADPTSA
RHGVLRGQID LSAGLIAVFE RLGWAHDVRW PDSERIAAKR AA