Gene Franean1_4552 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4552 
Symbol 
ID5672899 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5431216 
End bp5432514 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content73% 
IMG OID641243415 
Productcyclase/dehydrase 
Protein accessionYP_001508831 
Protein GI158316323 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCAGCG CCACCGCCTA CCTGCTGATT CACGAGGGCA TCCGCGCGGA GACGCGCCGC 
CTGGCCGACT TCGCCGCGCA GCTGGCCGCC GGCCGCCGCT ACGCCGGACC GGCCCAGCTC
ACCGCGCTGC GGACGCACCT CGACGAGGTC GTCAACGTGA TCCATCACCA CCATGTCGGC
GAGGACACGC ACCTGTGGCC GCTGCTGCGA CGGTTCGCCG ACCCCTTCGA CCGCGTGGAC
GGGCTCGACG TCCTGGACGG GCTCGCCGGT GACCACGACG TACTGGACCC GCTGATGGAA
CGGGTCCGCA CGGCACTCGC CCGACTGGCA GCGACCGCCA CCACCGGCGC GTCGGACGAC
ACCTCCGACC GGGCCACAGC TGAGGGCTCG GCTGAGGGCT CGGCCGGCGC CGCCGCCGAG
TTTGCCGCGG CCGCGACTGT GCTGTTCACC CTGATGGACG AGCACCTTGC CGTCGAGGAG
TCCATCGTCG TGCCGATCCT GCGCGAGCGG GTACCGGACG ACGAACTGGC CGCCATGGAG
AAGCGGATGC AGCGCGGGAG CAAGATCCGG CTCGGGTTCG TACTGCCGTG GCTCGACGCG
GCCGACCCGA CCCGGATGGC CGAGACAGCC GCCCAGCTCG GGCCGGTCTT CCCGGCGCTA
CTCACCCTCA CCCGCCGTGG CTACCAGCGC CGCGTCCGCG CCGCCTACGG CGTCACCGCC
GCGAGCCCAG GACCGGTCAC CCTGCGAGGA CAGGCCGAGA TCGTCATCGA GGCCACCCCG
GAGCAGGTGT ACGAGGCGAT CGCGGACGTC ATCCGGATGG CGCGCCACAG CCCGGAGTGC
TACCGCTGCG CGTGGCTCGA CGGCGCGGCT GCTCCACTGC CGGGCGCCCG CTTCCGCGGC
TGGAACCGTT TCCGGGGCGC CCGCTGGAGC CGGGAATGCG AGATCGTCAC CGCCGAGCCG
GGCGTGGCCT TCGCCTACCG CACCGTGCGT ACCAGTACCA GGCCGGACAG CACGCTGTGG
CGCTTCGAGC TGACCCCGAC CGCTGCCGGC ACCCGGCTCC GCCAGACGTT CGAGCTCTCC
GGCGCGGCGC CCGTCATGGT GTTCGAGCGG CTGAGCGGCC GCACCACCAG CACTCCGAAG
GCGATGGCGC GCACGCTCGC CCGACTGCGG GACGACCTCC GCACCAGGTC GGACCGCGTG
GGCGGGGCCG ACATTACCGC CGGATCGGAC ACCGCCCGCG GATCCGAGAC CCGCAGCCGC
CTCGGCGAGG ATCTGGTCAG CGCGCGAAGC GGCCAGTGA
 
Protein sequence
MASATAYLLI HEGIRAETRR LADFAAQLAA GRRYAGPAQL TALRTHLDEV VNVIHHHHVG 
EDTHLWPLLR RFADPFDRVD GLDVLDGLAG DHDVLDPLME RVRTALARLA ATATTGASDD
TSDRATAEGS AEGSAGAAAE FAAAATVLFT LMDEHLAVEE SIVVPILRER VPDDELAAME
KRMQRGSKIR LGFVLPWLDA ADPTRMAETA AQLGPVFPAL LTLTRRGYQR RVRAAYGVTA
ASPGPVTLRG QAEIVIEATP EQVYEAIADV IRMARHSPEC YRCAWLDGAA APLPGARFRG
WNRFRGARWS RECEIVTAEP GVAFAYRTVR TSTRPDSTLW RFELTPTAAG TRLRQTFELS
GAAPVMVFER LSGRTTSTPK AMARTLARLR DDLRTRSDRV GGADITAGSD TARGSETRSR
LGEDLVSARS GQ