Gene Franean1_2459 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2459 
Symbol 
ID5670855 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2926945 
End bp2928102 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content75% 
IMG OID641241376 
Productextracellular solute-binding protein 
Protein accessionYP_001506797 
Protein GI158314289 
COG category[E] Amino acid transport and metabolism
[T] Signal transduction mechanisms 
COG ID[COG0834] ABC-type amino acid transport/signal transduction systems, periplasmic component/domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0529514 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.605436 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACAGAC GCGCCCGTGT CGTCGCCGTT CTCGTCGCGC TCGGGATCGG GCTGGGGCTG 
GCCGCGCTGC CGGGGTGCGC GACGACGGGC AGTCCGCTGC CCCGCCAGGC TCGGCCGCTG
CCGCCCGCCG ACGGCGTCGC CGCCCCGGCC CGTCCCGAGG CGGCCGCCGG CTCGGCGGGG
CCCCCGTCAA CACCAGGCCA GCCGGGTCCG TCCGCGCAGC CCTGTATCGC CCGCGCGAGT
GTGCCCCCGC TCGCCGCCCT GCCGCCGCCG GGAGCGCGCG GATCGCGGAT CGAGGCGATC
CGGCGGTACG GCTACCTTCG CGTCGGGGTG ACCACGGTGG CCCCGCCGTT CGGGTCGATG
AACTGGCGCA CCATGGAGGT GGAGGGCTTC GACCCGGCCA TCGCCGGCGA GATCGCCGGG
GCCATTCTCG GCGACCCCGA ACTGGTGCAG TTCCGCGCGG TCGACACCCG GGACAGGGAG
GCGCTGGTCG CCGACGGCAC CGTCGACATC GTCACCGGCA CGATGACGAT GACCTGCGCC
CGCAAGGAGC GGGTCCGCTT CTCCGGGGTG TACTACGAGG CCGCGATGCG CATCCTGGTG
CCGGCCGGCG CGGGCCTGCG CACCGTCGGT GACCTCGCTG GACGGCCGGT CTGCTCGTCG
CAGGGAAGCA CCTCGTTCGA GAAGGTCGCG AACCTGGTGC GCGGCCCCGG CCGGCCGGTC
GCGGTGAACC GGGTGGGGAT CGTGGACTGC CTGGCCGCGC TCCAGCGCGG CGAGGTGGAC
GCGGTGGCCA CCGACGACAC GATCCTCGCC GGGATGCGCG CCGAGGACGC CACCGTCACC
GTCCTCGGTC CCGAGGCGTT CGACGGCGTT CTGGGCCCGG CGGGCCGCGC CGGCCTCGAC
GAGCCCTACG GTGTGGCGAT CGGCCGGACG GATCTCGCGC CGGGCCTCTC CCCGGCCGAG
GCCGCGGCGA ACCGGGCCGC CGACGACGCC TTCGTCGCCT TCGTCAACCG GGTGCTGCTC
GACATGATGA CCGGCCCGAC CTGGAACAGG CTCTACACCC GCTACCTGCG CGACGTCCTG
CGGGTCCCGG GCGTGCCCCC GAACGCCATC CAGCCGAACT GGCCGGACGG CGTGCTCGTC
ACGGGCGGCG GATCGTGA
 
Protein sequence
MNRRARVVAV LVALGIGLGL AALPGCATTG SPLPRQARPL PPADGVAAPA RPEAAAGSAG 
PPSTPGQPGP SAQPCIARAS VPPLAALPPP GARGSRIEAI RRYGYLRVGV TTVAPPFGSM
NWRTMEVEGF DPAIAGEIAG AILGDPELVQ FRAVDTRDRE ALVADGTVDI VTGTMTMTCA
RKERVRFSGV YYEAAMRILV PAGAGLRTVG DLAGRPVCSS QGSTSFEKVA NLVRGPGRPV
AVNRVGIVDC LAALQRGEVD AVATDDTILA GMRAEDATVT VLGPEAFDGV LGPAGRAGLD
EPYGVAIGRT DLAPGLSPAE AAANRAADDA FVAFVNRVLL DMMTGPTWNR LYTRYLRDVL
RVPGVPPNAI QPNWPDGVLV TGGGS