Gene Franean1_4213 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4213 
Symbol 
ID5672568 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5018124 
End bp5019374 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content67% 
IMG OID641243086 
Product4-phytase 
Protein accessionYP_001508503 
Protein GI158315995 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.798604 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.63897 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGAGT CCTTCGCGCT GAAGCTCCGT CCGGACCTGG CTTTCTCCGA CGGCACCCCA 
TTCGACGCCG CGGCGGTGAA GTTCAACTGG GACCGGCTCA AGGACCCGGC CAGCGCCTCG
CCCAGCGCGA CGGAAGCGGC GATGGTCGCC TCGGCCGAGG TAGTCGACGA CGTCACCCTG
AAAGTCACGA TGACCACTCC CGTGACCGCG TACACGCAGG CGATCGTCGG CTCGGCGATG
AACTGGATCG CCTCACCGGC GGCCCTGCAG AAGGGTCAGC AGGCCTTCGA CGAGAGCCCT
GTCGGCGCCG GCCCCTTCAC TCTGCAGAGC TGGACCAGGC AGGCCGAGAT CAGGCTCGTC
AAGAACCCCC GCTACTGGGA CGCACCCAAG CCCTACCTCG CCGGCATCAC GATGCGCGCG
GTGCTCGACG CCGACCAGCG TTACAACACC CTGATCAGTG ACGGCGCCGA TGTTGCCGTC
GAGACGAACT GGATCAACCT GGCCAAGGCC GAGAAGGCGG GTCTGCCGAC CGACCTCCTG
CCGCTCAGCG GTGGCTACTT CATCGCCCTG AACACGCGCA GAGAGCCGTT CAACGATATT
CGCGCCCGAC AGGCCGTGGC CGCGGCACTC GACATCGATG CGCTGAACCT CGCCGTCTAC
AACGGCGAAG GCCAGGTGGC TGACACGCTG TTCACCAAGA ACTCCCCCTT CTACTCGGAC
AAGCCACTGA CGACCGTGGA CCAGGCGAAG GCCCAGAAAC TCTTCGACGA GCTGGCCGCC
GAGGGCAAGC CCGTGTCCTT CACGTTCTCC ACCTATCCGT CCAGCGAGAA CAGGGCGATC
GCGGAGAACG TCCAGGCCCA GCTCGACAGC TTCAAGAACG TCAAGGTCGA GGTCGCGACC
GTCGACTACT CGCAGGTCGG CGCGATGCGC ACGACCCACG ACTTCGACGC GATCGTATCC
GCCGCGGCCT TCCAGGACCC CGAGCCGCGG CTGTTGGCGA ACTTCACCGG GAACTCGCCG
GCGAACATGC CCGGCCCCGT GGACCCGGAG CTCGACAAGA ATCTGCTGGC CGGCCGGACC
GGAACGTCGT TGGAGCAGCG TAAGGCGGCC TACGACGCGG CGCAGGCGCG GTTGACCGAG
GCGATGCCGG CCATCTTCCT CACCCGGTCG GCGCCTGCCG TCATCACGGG CAAGAACGTG
GGCGGCATCG TGCAGTACGG CGCGGGTTCC CTGCTACCCG AGGATCTGTG A
 
Protein sequence
MAESFALKLR PDLAFSDGTP FDAAAVKFNW DRLKDPASAS PSATEAAMVA SAEVVDDVTL 
KVTMTTPVTA YTQAIVGSAM NWIASPAALQ KGQQAFDESP VGAGPFTLQS WTRQAEIRLV
KNPRYWDAPK PYLAGITMRA VLDADQRYNT LISDGADVAV ETNWINLAKA EKAGLPTDLL
PLSGGYFIAL NTRREPFNDI RARQAVAAAL DIDALNLAVY NGEGQVADTL FTKNSPFYSD
KPLTTVDQAK AQKLFDELAA EGKPVSFTFS TYPSSENRAI AENVQAQLDS FKNVKVEVAT
VDYSQVGAMR TTHDFDAIVS AAAFQDPEPR LLANFTGNSP ANMPGPVDPE LDKNLLAGRT
GTSLEQRKAA YDAAQARLTE AMPAIFLTRS APAVITGKNV GGIVQYGAGS LLPEDL