Gene Franean1_5830 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5830 
Symbol 
ID5674153 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7072658 
End bp7073893 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content74% 
IMG OID641244680 
Productpatatin 
Protein accessionYP_001510082 
Protein GI158317574 
COG category[R] General function prediction only 
COG ID[COG1752] Predicted esterase of the alpha-beta hydrolase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGAGGA AGCGGGCTTC GCGCAGGGGG CTGGTGCTTG GCGCCGGCGG AGTGCTCGGC 
TCAGCCTGGA TGATCGGCGC ACTGTGGGCC GTCGAGTCCG AACAGGCCAT CGACGTCCGG
GATCACGACC TGGTACTCGG GACATCCGCC GGATCGGTCA TCGCGGCGCT GCTCGGCCTG
GGGGTCGGCG CGGACGTGAT GGTCAATTCC GAGCGTGGGA TCTTCGAGCC GGGCTACCCC
GTGCTGGACT ACCGCGATCT CGGAGCGTCC CTGCCGCCCA GGCCGCGGAT GCGGATGGGC
TCGCCGCGGC TGCTGACCGC GACCGCTCTG CATCCCCGCC AAGCGACGCC GATGGTCGCG
CTGGCGGCGC TGCTCCCGCA GGGCCGTGGC CAGATCGCCG CGATCGGTGA TCTCGTGGCC
GAGGCGGCTG CCCGGATCGA GCATCCGCTG GACGCGCTGG CCGCCAAGCG CTCGGCCGCG
GTGCCCCTCG GCGCGCGCCG GGGACGTCAG CCCGCCTCGT TGGCGGCGAA CGGCTGGCCG
CGGTCCCCCC GGCTGCGTGT CGTCGCGATG GACTTCGACT CGGGCGGCAG GGTGCTCTTC
GGCTCCCCGG ACGCGCCGCG GGCGGCCCTG CCCGACTCGG TGATGGCCTC GTGCGCCATC
CCCGGCTGGT ACGCGCCGAT CAAGATCGCC GGCCGGCGCT TCGTGGACGG CGGCACCAGA
TCACCGGCGT CCCTGGACCT GCTGGTCGAG GAGGAACTGG ACGAGGTGCT CGTGCTCGCG
CCGGCCTGCT CCTTCGACAG TGACCGCCCG CGTGGCGCCA TCGCCCGGGT GGAGCGGCAG
ATGCGTCGGG CCGCGACGCG ACGGCTGGCC CGCGAGATCG AGCTGCTGGA GGCCGCCGGC
ACCCGGGTGA CAGCGCTGTG CCCGGGTCCG GAGGATCTAG AAGTGATCGG CGGGAACGTC
ATGGACCTGA CACGCCGCGC GGAGGTGTTC GAGACCTCGC TGCGAACCTC GGCCGCCGCC
CTGCGAACCG CGCTCGGCAC CGACACGCGC CCCACCCGTC CCATCCGCGC CACGCGGGCG
GCGGGAACAG CGACGGCGGG AACAACGGCT GGCCCGGGTG ACTCCGGTGC CACCGCGAGC
GGTCCGGGAC GTCGCGCCGG TGGCCGGGAG GCACTTGGCC GCGCGGCGGC GACCACCGCC
GCGGACCAGA AGGCGGACCT CGGTCTGGTC GGCTGA
 
Protein sequence
MPRKRASRRG LVLGAGGVLG SAWMIGALWA VESEQAIDVR DHDLVLGTSA GSVIAALLGL 
GVGADVMVNS ERGIFEPGYP VLDYRDLGAS LPPRPRMRMG SPRLLTATAL HPRQATPMVA
LAALLPQGRG QIAAIGDLVA EAAARIEHPL DALAAKRSAA VPLGARRGRQ PASLAANGWP
RSPRLRVVAM DFDSGGRVLF GSPDAPRAAL PDSVMASCAI PGWYAPIKIA GRRFVDGGTR
SPASLDLLVE EELDEVLVLA PACSFDSDRP RGAIARVERQ MRRAATRRLA REIELLEAAG
TRVTALCPGP EDLEVIGGNV MDLTRRAEVF ETSLRTSAAA LRTALGTDTR PTRPIRATRA
AGTATAGTTA GPGDSGATAS GPGRRAGGRE ALGRAAATTA ADQKADLGLV G