Gene Franean1_6244 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6244 
Symbol 
ID5674563 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7580438 
End bp7581610 
Gene Length1173 bp 
Protein Length390 aa 
Translation table11 
GC content70% 
IMG OID641245096 
Productepocide hydrolase domain-containing protein 
Protein accessionYP_001510492 
Protein GI158317984 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCAGTCG AGCCTTTTCG GATCCACATC AGCGAGGACC GTCTGGCCGT GCTCGGCGAC 
CGGCTGCGCA CCACCGACTG GGCGGAAGAT CCGGTACGGG ACGACAGCTG GCACTACGGC
GTGCCCGCTC CGTACCTGCG CGAGCTGACC GAGTACTGGG CCACGCGGTA CGACTGGCGC
GCGCACGAGG CGGCGATGAA CCGGTGGCCG CACGTCCGGG GCGAGATCGA CGGTGTCACC
GTCCATGCGC TCCACGAACG CGGCTCCGGC CCCGCACCGC TTCCGCTCGT CCTCTCCCAC
GGATGGCCGT GGACCTTCTG GGACTTCAGG AAGGTGATCG AGCCGCTCGC TCACCCCGAA
CGTTTCGGCG CCGATCCCTC GGACGCCTTC GACGTCGTCG TGCCGTCCCT GCCCGGCTCG
GTCTTCTCCT CGCCCACCCC GGCCGGGGTC GGATTTCGGC AGACCGCCGC CCTGTGGGTG
AAACTCATGA CCGAACTCGG CTACCAGCGC TTCGGCGCGC ACGGCGGCGA CTCGGGCGCG
TACGTCACCG CACAGCTCGC CCACGAGTTC GCCGACCGGC TCGTCGGCGC ACATCTGACG
TTCCCCGCTT TGCTCGGCAC CGACCTGGGC GGGGTGAGCC GGGACGACTT CGCGCCCGAG
GAGGTCGACG ACTTCGACCG GCAGCGTCCC GCGATGCTCA ACCTCACGCA TTTCCTGACG
CACACCTTCG AACCCCGGAC CCTGGCCTGG GCGTTGCAGG ACTCCCCCGC CGGCCTGGCG
GCCTGGATGG TTCAGCGGCG GCGGGCCTGG AGCGACTGCG GCGGCGACGT GGAACGCCGC
TTCAGCAAGG ACGATCTCAT CACGAGCTTC GCCCTCTACT GGCTCACCGG CACCGTCGGC
GGCTCGCTGC GGTTCTACGC CGACTCGTTC CAGCGGCCGT GGATCCCCTC GCACGACCGT
CGGCCCGTCC TGGAGTCACC GACGGGCATC GCCGTGTTCC CGTACGAGCT GACGCATGTG
CCACGCACTC TTGCGCAGCG CGAGGCGAAC CTCGTGCACT GGACCCGGAT GAGCCGTGGC
GGCCACTTCG CAGCGGCTGA GGAACCACAA CTCGTCGTGG CGGACATCCG CGCGTTCTTC
CGGCCGCTGC GCGCGACCGG ACTGTCCCGC TGA
 
Protein sequence
MAVEPFRIHI SEDRLAVLGD RLRTTDWAED PVRDDSWHYG VPAPYLRELT EYWATRYDWR 
AHEAAMNRWP HVRGEIDGVT VHALHERGSG PAPLPLVLSH GWPWTFWDFR KVIEPLAHPE
RFGADPSDAF DVVVPSLPGS VFSSPTPAGV GFRQTAALWV KLMTELGYQR FGAHGGDSGA
YVTAQLAHEF ADRLVGAHLT FPALLGTDLG GVSRDDFAPE EVDDFDRQRP AMLNLTHFLT
HTFEPRTLAW ALQDSPAGLA AWMVQRRRAW SDCGGDVERR FSKDDLITSF ALYWLTGTVG
GSLRFYADSF QRPWIPSHDR RPVLESPTGI AVFPYELTHV PRTLAQREAN LVHWTRMSRG
GHFAAAEEPQ LVVADIRAFF RPLRATGLSR