Gene Franean1_2020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2020 
Symbol 
ID5670421 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2427856 
End bp2429040 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content75% 
IMG OID641240941 
Productacyl-CoA dehydrogenase type 2 
Protein accessionYP_001506363 
Protein GI158313855 
COG category[I] Lipid transport and metabolism 
COG ID[COG1960] Acyl-CoA dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.692035 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.294025 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGTCG AGCTCGGTAC GACGACGCCG ACCGTCGACG CGCGGTCGGC CCTCCCGCCG 
GTGGACGTTA CCGACGACGC CCTCGCCACG GTGACCGCCC AGCTCGCCGC CACCGCGGAG
GAACACGACC GCTCGGCCGA GTTCCCCTGG CGGGGCCTGC GGGTGGTCCA CGACGCGGGC
CTGCTGCGCC TCGGCATCGC CCCGCGCTAC GGCGGCGCCG ACCTGAGCGC CGTCGACAGC
ATCCGGGTGT TCGGGGCGTT GGGGCAGGGC GACCCGTCCG TCGCCCTGAT CACCGCGATG
ACCGTGTTCC AGCACGTCCT GCAGAGCAGG GACCCGTGGT GGCCCGACGA GCTCTACCGC
ACGGTCGTCC GCGACTCCCT CGAGCGGCCG GTGCTGGTCA ACGCGATCCG CGCCGAGCAC
GAGCTGGGCG CGCCGGCCCG CGGCGGGCTG CCGGCCACGA CGATCCGGCG GACGGCGTCC
GGCTGGGTCC TCAACGGGCA CAAGGCGTAC GCGACAGGCT CGGAAGGGCT GAGCTACCAC
CTGGTGTGGG CGGCGACCGA GGACGCCGAT CCGCTTCAGG GCCACGCGAT CGTCCCCGGG
GACTCCCCCG GGATCAGGAT CGTGCGCACC TGGGACCATC TCGGGCTGCG GGCCAGCAGC
ACCCACGACG CGATCTACAC CGACGTGGAG ATCCCGGCCG GGAACTTCCA GGGGGCGCCG
GCTTCGGAGC AGCGGGGCGA CGTCGCGATC GGCGGCGTGG CGCTGGGCGC GGCCGCCCTG
TACCTCGGGG TCGCCCGGGC CGCCCGGGAC TTCTTCGCCC GCTTCGCCCA GGAGCGCGTC
CCGACCGTGC TGGGACGTCC GATCGCCACC ACCGAGCGCA TCCAGGCCGT CGCCGGGGAG
ATCGAGGCCC AGCTCGTCCT GGCCGAGGAG CTCGCCTTCG GGCTGGGCCG CCGCATCGAC
GCCGGTGAGT CGATTCCCCC GCAGCGTCTG GTGCTCGCGA AGCCGCTCAT CGTCCGGGCC
GCCGTCACGG CGGTGCAGAC GGCCGTCGCC GCGATCGGAA ATCCCGGGCT GACCAGGCAC
AACCCGCTCG AGCGCCATCT TCGGGACGTG CTGTGCGCCC GGGTCCACCC GCCGCAGGAG
GACACCGCGC TGCTGGTGGC CGGCCGCCGC GCGCTCGGCC TCTGA
 
Protein sequence
MTVELGTTTP TVDARSALPP VDVTDDALAT VTAQLAATAE EHDRSAEFPW RGLRVVHDAG 
LLRLGIAPRY GGADLSAVDS IRVFGALGQG DPSVALITAM TVFQHVLQSR DPWWPDELYR
TVVRDSLERP VLVNAIRAEH ELGAPARGGL PATTIRRTAS GWVLNGHKAY ATGSEGLSYH
LVWAATEDAD PLQGHAIVPG DSPGIRIVRT WDHLGLRASS THDAIYTDVE IPAGNFQGAP
ASEQRGDVAI GGVALGAAAL YLGVARAARD FFARFAQERV PTVLGRPIAT TERIQAVAGE
IEAQLVLAEE LAFGLGRRID AGESIPPQRL VLAKPLIVRA AVTAVQTAVA AIGNPGLTRH
NPLERHLRDV LCARVHPPQE DTALLVAGRR ALGL