Gene Franean1_0352 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0352 
Symbol 
ID5668776 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp420297 
End bp421421 
Gene Length1125 bp 
Protein Length374 aa 
Translation table11 
GC content69% 
IMG OID641239284 
Productphosphate ABC transporter, periplasmic phosphate-binding protein 
Protein accessionYP_001504724 
Protein GI158312216 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0226] ABC-type phosphate transport system, periplasmic component 
TIGRFAM ID[TIGR00975] phosphate ABC transporter, phosphate-binding protein 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00462659 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGACGAAG GATTGCTAGT GAACCTAGGC AAGCGTCAGA TCGTCGCGGG CCTCGCCGCC 
GCGGCGCTGC TCTCTCTGGC CGCGTGTGGC TCGGACGACA ACACCGACAC CGCGGACGGG
ACCGCGCCCA CGGCCTCCGC CGGCGCGATC GACTGCGCCA AGGGGTCGAT CACCGGGTCG
GGGTCCTCCG CGCAGAAGAA CGCGATGGAC GAGTGGGTCT CGGCCTACCA GGACGCCTGT
GACGGCGCCA CCATCAACTA CCAGGGCTCC GGCTCGTCGA CGGGACGCCA GCAGTTCATC
GACAAGCAGG TCACCTTCGC GGGGTCCGAC TCCGCACTGA AGGATGCCCA GAAGACCGCG
GCGGACGCCC GCTGCACCGG CGGCGCGGCC GTCGACATCC CGATGGTCGT CGGGCCCATC
TCGCTCATCT ACAATCTTGA CGGTGTCGAC AAGCTGAACC TCAGCCCGTC GGCCATCGCG
AAGATCTTCT CCGGTGCGAT CACCAAGTGG AACGACCCGG CGATCGCCGC CGACAACTCC
GGCGTCAGCC TGCCGGACGC TCCCATCCAG GCGGTCCACC GCTCGGACGG CTCGGGCACG
ACGGACAACT TCACCAAGTT CCTCAAGGGC GCCGCGGCGT CCGACTGGAC CTTCGAGGGC
GGCTCCGACT GGACCGCGCC GGGTGGCCAG GGCGCCAAGG GCAGTGACGG TGTGACCTCG
ACCGTCAAGT CCACCCCGAA CGCGATCGGC TACGTCGAGC TCTCCTACGC GGAGAACGCC
AGCCTGCCGA CCGCGCTGGT CGGCAACGCC TCCGACGAGT TCATCGCGGC GAGCACCGAC
GCCGCCTCGA TCGGCATCAG CTCCGCCAAG GTCGCCGACG GCGACGACCT CAAGCTCACC
TTCGACTACA CCACGGCGAC CAAGGGTGCC TACCCGATCT ACCTGGCCAC CTACGAGATC
GTCTGCACCG CCGGCACTCC CGCCGACCAG GCTCCGCTCC TCAAGAGCTT CCTGACCTAC
ATCGCGTCCG CGGACGGCCA GGCCGCGATC GGTGACCTCG GCTACGCCCC GCTGCCGGAC
GAGATCGCCA CGAAGGTCCG CGGCGTCATC GCCAAGATCG CCTGA
 
Protein sequence
MDEGLLVNLG KRQIVAGLAA AALLSLAACG SDDNTDTADG TAPTASAGAI DCAKGSITGS 
GSSAQKNAMD EWVSAYQDAC DGATINYQGS GSSTGRQQFI DKQVTFAGSD SALKDAQKTA
ADARCTGGAA VDIPMVVGPI SLIYNLDGVD KLNLSPSAIA KIFSGAITKW NDPAIAADNS
GVSLPDAPIQ AVHRSDGSGT TDNFTKFLKG AAASDWTFEG GSDWTAPGGQ GAKGSDGVTS
TVKSTPNAIG YVELSYAENA SLPTALVGNA SDEFIAASTD AASIGISSAK VADGDDLKLT
FDYTTATKGA YPIYLATYEI VCTAGTPADQ APLLKSFLTY IASADGQAAI GDLGYAPLPD
EIATKVRGVI AKIA