Gene Franean1_2464 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2464 
Symbol 
ID5670860 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2933385 
End bp2934401 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content68% 
IMG OID641241381 
Productextracellular solute-binding protein 
Protein accessionYP_001506802 
Protein GI158314294 
COG category[E] Amino acid transport and metabolism
[T] Signal transduction mechanisms 
COG ID[COG0834] ABC-type amino acid transport/signal transduction systems, periplasmic component/domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00586162 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGACGCG CCGCGCTGGT GGCGGCGGCG CTGATGACCG CAGCGGCGCT GGTTGGGTGC 
TCGACCGTTT CCGGTGACCT GCCCCGCGCG GCCGACCAGC CATGGGAGCC GCTGGTGACT
GCGTCGGGCG AGGCGGCCGA CGCAGCCCAG GACGCCCGTC CTGAGGCTGC GGTGCCCACG
AACCTCGTCC TGCCCACCGA CGCCGGGGAC TTCGCGTCGG GAGGATCACT CGACAAGATC
CGCAAGCGCG GCTTCCTGCG GGTGGGTGTC TCGCGCGACA CCCAGACCCG CGGCGCATGG
AGTCCGGTGT CGCACCGGTT CGAGGGCTTC GACGTCGAGC TCGCCCAGCG GATCGCGGAG
GCCCTGTTTG GTCCGGGCAC GGCGGACGAG AAGGTGCGGT ACCGCCCGGT CAGCTACGCG
GAACGCCTGC CGGCGGTCGA GAACGGGGAG GTGGACATCC TCGTCAGCAC CCTGACCTAC
TCGGAGTCAC GGGCCGAGCG TGTGGGTTTG TCCGCGGCTT ACTTCACCGC CCATCCACGG
CTGCTCTCCC ATCGGGAGAG CACGCACTCG GGAGCGGGGC CCGGCATCGA TTCCCCGGAG
GAACTCGCCG GGAAACGGGT GTGCGCCCCG CGGGGGACGA CGACGCTGAC GAACCTGGAG
AATACCCATC AAACCCATCC CACGTTCGAG ATCGTGGATC ACCTTAACGA GCTGTCCGAC
TGTCTGGTTG CCTTCCAGCA GGGTGAGGTC GACGTCGTGG CCGCGAATGA CGCAAGTCTG
GTGGGGATGC TGGAACAGGA TGCCACCGCC GTACTTGGAA CGTTCTCAGT CGGGCGGGAC
GAGTACTACA GTGTCGCATT CGAGCGGGAC GACACCGAGC TCGCCGGATT CGTCAACGGG
GTTTTGGAGC GTCTACGGCG CGACAAGCCG GAGTGGCTGA AATTATGCGA GAGGTGGAAG
GCTCCCGAGA TGCCCTGCGA GGAGTCGCTG CCACCCGAGC CGCAGTGGGC ACGCTGA
 
Protein sequence
MRRAALVAAA LMTAAALVGC STVSGDLPRA ADQPWEPLVT ASGEAADAAQ DARPEAAVPT 
NLVLPTDAGD FASGGSLDKI RKRGFLRVGV SRDTQTRGAW SPVSHRFEGF DVELAQRIAE
ALFGPGTADE KVRYRPVSYA ERLPAVENGE VDILVSTLTY SESRAERVGL SAAYFTAHPR
LLSHRESTHS GAGPGIDSPE ELAGKRVCAP RGTTTLTNLE NTHQTHPTFE IVDHLNELSD
CLVAFQQGEV DVVAANDASL VGMLEQDATA VLGTFSVGRD EYYSVAFERD DTELAGFVNG
VLERLRRDKP EWLKLCERWK APEMPCEESL PPEPQWAR