Gene Franean1_1224 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1224 
Symbol 
ID5669637 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1464725 
End bp1465696 
Gene Length972 bp 
Protein Length323 aa 
Translation table11 
GC content69% 
IMG OID641240156 
Productextracellular solute-binding protein 
Protein accessionYP_001505584 
Protein GI158313076 
COG category[E] Amino acid transport and metabolism
[T] Signal transduction mechanisms 
COG ID[COG0834] ABC-type amino acid transport/signal transduction systems, periplasmic component/domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.228597 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCATGT CGCTCATCCG TAGCAAACGG CGAACGGCCC CCACGGACGC CATCGACTCC 
ACCACCCGCG CGGCCTCGGG CTCCCGCCCG CGGCGGCCGC TGCGCAGCGC GGCCCTGCTG
CTCGCCGCGG CACTGGCCGG CTCGCTCGCG CTGTCCGCCT GCGGCGACGA CGACAGCTCC
GACGAGCCGG CCGCCGCCAC CGCCACCACG TTCCCGGCGG GGAGCACGAT GGCCAAGCTC
CAGTCGGCGG GCACGATCAC CGTTGGCACG AAGTTCGACC AGCCGCTGTT CGGCCTGAAG
AACCTGCGCG GCGAGCCCGA GGGCTTCGAC GTCGAGATCG CCAGGATCAT CACCGACGCG
CTGGGGATCC CCGCGGACAA GGTCAAGTTC GTCGAGACGG TCTCGGCCAA CCGCGAGCCG
TTCATCGAGC AGCACCGCGT GGACCTGGTG GTGGCCACCT ACACCATCAA CGACAAGCGC
AAGCAGGTCG TCGACTTCGC CGGCCCGTAC TACGTGGCCG GTCAGACGCT GATGGTGCGG
GCCGGCGAGA CCGCCATCAC CGGGAAGGAC ACGCTCGCGG GCAAGAAGGT CTGCTCGGTG
AGCGGCTCCA CCCCGGCTGA GCGCATCCGC ACACAGGCAC CGGACGCCGA GCTGACCCTG
TTCGACGTCT ACAGCAAGTG CGCCGAGGCG CTCAAGGCCG GCCAGGTGGA CGCGGTCACG
ACCGACAACG CGATTCTGCT CGGCCTGATG GACTCCGACC CGGGCGCCTT CAAGCTGGTC
GGCGAGCCGT TCAGCACGGA GCCCTACGGC ATCGGCATCG CCAAGGGCGA TGACGAGTTC
CGCACGTTCA TCAACGACAC GCTCGAGGCG GCCTACACCG ACGGCCGGTA CGAGACGGCC
TACAAGGACA CTATCGGCAA GGTCGAGCCG GACATGCCCA CGCCTCCCGC GGTGGACCGC
TACACCTCCT GA
 
Protein sequence
MRMSLIRSKR RTAPTDAIDS TTRAASGSRP RRPLRSAALL LAAALAGSLA LSACGDDDSS 
DEPAAATATT FPAGSTMAKL QSAGTITVGT KFDQPLFGLK NLRGEPEGFD VEIARIITDA
LGIPADKVKF VETVSANREP FIEQHRVDLV VATYTINDKR KQVVDFAGPY YVAGQTLMVR
AGETAITGKD TLAGKKVCSV SGSTPAERIR TQAPDAELTL FDVYSKCAEA LKAGQVDAVT
TDNAILLGLM DSDPGAFKLV GEPFSTEPYG IGIAKGDDEF RTFINDTLEA AYTDGRYETA
YKDTIGKVEP DMPTPPAVDR YTS