Gene Franean1_7272 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_7272 
Symbol 
ID5675573 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp8877861 
End bp8881166 
Gene Length3306 bp 
Protein Length1101 aa 
Translation table11 
GC content76% 
IMG OID641246109 
Productpatatin 
Protein accessionYP_001511497 
Protein GI158318989 
COG category 
COG ID 
TIGRFAM ID[TIGR03607] patatin-related protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.341762 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.239541 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGGCA CGTTTAGTCT TGCGGCCATG ACGGAGGGCA TGGCGGCAGT CGCCGACGAG 
TCCGACGACG GGTCGACCCA GGAGGTCCGG CTCGCGGTCG TGATGACCGG CGGCGCGAGC
CTGGCGGTGT GGATGGGCGG CGTCGCCCGG GAGATCAACC TCCTGACCGG CGCCTCCGCG
CACCGTGACC GCGCCGCGGG TGACGGTCAC GTCGGCGTGG CCGGGACCGG AGACCAGGGC
CTGTCTGCCG CCGACGCGGC GGCCGCGTCC CGCTGGGCGG CGCTGCTGGA GATCCTCGAC
GTCGAGGTGT CGGTGGACGT CCTGGCCGGG GCCTCGGCGG GTGGGATCAA CGCGGCGCTG
CTCGGCTACG CCAACGCCCA CGACGCCGAC CTGGGCCCGC TGCGCGATCT GTGGATCACG
CTGGGATCAC TGCGCGAGCT GATGCGACGC CCGGGCGAGC GGAGCTTCCC GTCGCTGCTG
CGCGGCGACG CCGTGATGCT GCCCGCCCTG CAGACCGCGC TGCGCAGCCT CGAGCCCTCG
TCCCGAAGCG GTCACACCGC CGAGGCGAGG CGCGCCAGAT CCGCGCGCAC GGTTCGCCCG
ACCTCGGTGT TCATCACGAC CACGATCCTG CGCGGCGAGT CGGCGCGCTG GTCGGACGAC
CTCGGCGGCA TCGTTCGCGA CCTCGACCAC CGCGGCCTGT TCGTCTTCGG CGAGGACGAC
CTGACCGACC CCGAGGCGGC CAACCGCCTG GCGCTTGCCG CCCGGGCCAG CGCGGCCTTC
CCCGGTGCCT TCGAGCCGGC CTACATTCCC GTCCAAAGCT CGCCCGACGC GGACCATCCC
GACATGGCCG ACTACGTCAA CGCGGCCAGC GGCTTCTACG GCTCCGACGG CGGCATCCTG
GTGAACCGCC CGATCGGGCC GGCCCTGCAA GAGATCTTCG ACCGGCCGGC GCACGGGCGG
CAGGTCCGCC GCGTGCTCGC CTACGTGGTG CCCGCGCCCA ACCCGCCGGA CGACCCCACG
CCGAGCCGAG ACGACGGGTC GACGCCGACA GTCTCCGACA CGCTGCTCGG CGCCCTGGGC
GCGGCGCTGA ACCAGTCGAT CGCGAGCGAC CTGCGGCGCA TCAGGGACCA CAACGAGGCG
GTACGCGGCG TCCGCGCGAG CCGGCGCGGG CTGTTCCGCC TCGCCCCGGC CGGCGGCCCC
CGCCTCGCCG ACGAGGCAAT CTACGAGGCC TACCGCCACC GCGAGGCCGA ACGCGCCGCG
GACGTCGTCC TGAGCACGCT GGACCGGATC GTCACCGGCG CGAACGCCGG CGCGGACGCG
GCGGCGCTGC TCGCGGACCC GGCCGGACGG ACGGACCTGC GGGCCGCGGC GGTCACGGCC
GCGTTGGCGG AGTTCCCGGA CCGGCTGCCC GGCCCGGACG ACCTCGGTGA CCTTTGGCGG
CTGGGCCGCC CCGCGGTGGA CGCCGCCAAG GGCATCCTGA TTTCGATGAT CAACGAGGCC
TACGTCCTGT CGACCGACCC GGACGACCGG GTGGCGCTGG CGACGCTGGC CCGGGAGGTG
CACGCGGCGG TCGTCGCGCC CCGCGAGCAG GCCGGAGCAC ATCTGCTCCC GCTCCCGTCG
ACGCCGGCGA ACCGGGGCGG CTTCACCGTC GTCCGCCACA CCCTGCGCGA CCTGGCGGGG
GCGTCGTCCG CCGCCGTGGT GGCCGAGGCG ACCCGCCGCT GGCTGCGCGG CGCCGGCGAC
GAGGAGACCA GCCGGACCGA CCTCACGCTC GCCTGGAGCA CGCTGGGCTC GGTCACCGAG
AACCTGCGCG GTCTTCTGAC GGGCGTCCTC GCCCGCTCAG CACCATCGCG GACGGCCACA
ACCGACCTCG CGGACAGGCC CGGCCACGGG GACGGGGAGC ACGGGGGACG ACACGGGCGG
GACGGGCAGG ACGAGGACCG GCCCTTCGGG CCCGCGGCGC TGATCGCACC AGGGCCGCGC
ACCCACGCGA ACGGGCTGAG CCTCACCGCG CGGCGCCGGG TCGCCGCCCG CACCCTGCGG
GACTTCGTCG GCTACCTCCC CACACGGCGT TCGCGGGCCG CGCTGGCCCT GCTCGACCTG
CACCTGGTCA CCCACGCCCT GACCGGCGGG GACGTCGTCG ACCAGCCGGT GGAGCTGGTC
CAGGTGAGCG CGGACATCCG CTGCGGTCTG GACCCGTCAC GGGCGAGCGC CGACCGGAAG
CTGACCGGGC TCCAACTCGG CAACTTCGGC GCGTTCGCGA AGTCGTCCTG GCGGGCCAGC
GACTGGATGT GGGGCCGGTT GGACGGCGCC GCCTGGCTGG CCCGCATCCT GCTCGACCCC
CGGCGGCTGG TCCACTTACG CGATGCCGCC GCGCCGGGCG CGGAACCGCC CGACAGCACC
GCGACCTGGC TCACCGAATT CGTGGAGCTG CTGGCGACGG TCGCCGCGGG CCCCGTCACC
GCGGACGTCC TCGACGAGCT CGCCTGGCTC GACGACCCGG ACGCGGCCGT GCCGGCCGCC
CTGCCCGAGA CCGCGGCCTG GGTCGCCACC GGGATCCAAC GCGAGATCGC CGCCGGCGAG
CTGACCAGCG TCGCCGACGC CGTCCGGGCG GACATCGCCG GTGCCAGGGT CGGCTCCGCG
CCCACCCGGG AGTTCCTTGA CGCGATGGCG CCCGGCCCGG CCCAGCTCGA TCCCGCCGAC
ACCGCCCGGG TGCTGCGGGC GTGCCGCGTC TCGGACGAGC GGCTGACCGA TCGGGCCAAC
AGCCCGATCC TCGCCATCAC GGTCGCCCAG GTGCTGGCGG TCCTCACCGG GTGGCTGGCG
TCGCTGCGCG CGCTGCCGCG GCCGCTGCGT CCGGCGGTCG CGGCGGTGCG CGCCGTCGCT
CGCGTCGCCT ACGCGCTGGT GGACGACGTG ACGCGCGGCC GGCGCCGGGC GACGATCGCC
CTCGGCACGG TGCTGCTCGC CGCCGGGCTC GCGGGCGCGC TCGTCCTGTC CGGCCCGATG
GGCGGCGTGG GGCTGCTGGT GGCCGTGACG GGGCTGCTGC TGATCAGCCT CACGGGCTGG
CGGGTGCTGC CCGCGGGGCT GGCCGTCGTG GGCGTGGCGG GGCTGGCGGC GGTGGCCGCG
GCGGGGGTGA TCCCGGTCGT GGGTGACCAT CTCTTCCCCT GGCTACACGA CGACGCGGTG
CCCTACCTGG CCGATCATCC CTGGGCGTGG GCGGCCGTCT TCGGCCTGCT GATGCTCCCG
CCGGTGTGGT CACTCGCCGA ACTCCTGCGC CCGCGGCGGC GGAGCCGACA CCGGCCGGCA
AACTGA
 
Protein sequence
MSGTFSLAAM TEGMAAVADE SDDGSTQEVR LAVVMTGGAS LAVWMGGVAR EINLLTGASA 
HRDRAAGDGH VGVAGTGDQG LSAADAAAAS RWAALLEILD VEVSVDVLAG ASAGGINAAL
LGYANAHDAD LGPLRDLWIT LGSLRELMRR PGERSFPSLL RGDAVMLPAL QTALRSLEPS
SRSGHTAEAR RARSARTVRP TSVFITTTIL RGESARWSDD LGGIVRDLDH RGLFVFGEDD
LTDPEAANRL ALAARASAAF PGAFEPAYIP VQSSPDADHP DMADYVNAAS GFYGSDGGIL
VNRPIGPALQ EIFDRPAHGR QVRRVLAYVV PAPNPPDDPT PSRDDGSTPT VSDTLLGALG
AALNQSIASD LRRIRDHNEA VRGVRASRRG LFRLAPAGGP RLADEAIYEA YRHREAERAA
DVVLSTLDRI VTGANAGADA AALLADPAGR TDLRAAAVTA ALAEFPDRLP GPDDLGDLWR
LGRPAVDAAK GILISMINEA YVLSTDPDDR VALATLAREV HAAVVAPREQ AGAHLLPLPS
TPANRGGFTV VRHTLRDLAG ASSAAVVAEA TRRWLRGAGD EETSRTDLTL AWSTLGSVTE
NLRGLLTGVL ARSAPSRTAT TDLADRPGHG DGEHGGRHGR DGQDEDRPFG PAALIAPGPR
THANGLSLTA RRRVAARTLR DFVGYLPTRR SRAALALLDL HLVTHALTGG DVVDQPVELV
QVSADIRCGL DPSRASADRK LTGLQLGNFG AFAKSSWRAS DWMWGRLDGA AWLARILLDP
RRLVHLRDAA APGAEPPDST ATWLTEFVEL LATVAAGPVT ADVLDELAWL DDPDAAVPAA
LPETAAWVAT GIQREIAAGE LTSVADAVRA DIAGARVGSA PTREFLDAMA PGPAQLDPAD
TARVLRACRV SDERLTDRAN SPILAITVAQ VLAVLTGWLA SLRALPRPLR PAVAAVRAVA
RVAYALVDDV TRGRRRATIA LGTVLLAAGL AGALVLSGPM GGVGLLVAVT GLLLISLTGW
RVLPAGLAVV GVAGLAAVAA AGVIPVVGDH LFPWLHDDAV PYLADHPWAW AAVFGLLMLP
PVWSLAELLR PRRRSRHRPA N