Gene Franean1_2067 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2067 
Symbol 
ID5670468 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2490287 
End bp2492089 
Gene Length1803 bp 
Protein Length600 aa 
Translation table11 
GC content75% 
IMG OID641240989 
Productextracellular solute-binding protein 
Protein accessionYP_001506410 
Protein GI158313902 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.153764 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAACGCCT CTCCGGGGCG GCGGCTCGGT CATGCCCGAG GGATCGCCGC CCTGCTCACC 
GCCCTCCTGA CCACGGTCGC CCTGCTCACT GTCGCGGGAT GCAGCGGCGA GTCGGACCCG
ACGCCGGGCC CGATGGCGGC GAGCCCGACC ACGCCGCCGA CGACCGCCTC CCCGCTCGAG
AAGCCCGGCG GCACGCTGCG CCTGCTGACC GGGCGGATGC CGACCGGCGA CCCCGGGTGG
GCCGACGAGC TGGGGGAGCG GGCGTTCGCC CGGCTGGTGA CCCGCCAGCT CTACAGCTAC
CCCGCCGACG CCGACACCGC CAGGTCGACG ATCCCGCGCC CCGACCTCGC GGCCGGCGCC
CCCGTGGTGA CCATGGGCGG CACCGTCTAC ACCGTGCGGC TGCGCTCGGC GGCCCGGTGG
AACACCCCGA ACCAGCGCCG GATCACCGCG ACGGACGTCG CCCGCGGCCT CAAGCGGATG
TGCGCGCCGC CGTCGCCCTC CCCGCTGCGC GGGTACTACG CGGCGACGGT CGTCGGCTTC
GCCGAGTACT GCGCCGAGCT CGCCGCAGCG CCGGTGGCCG ACGCCCCCGC GCTGATCGAG
AGCGGCACCG TCCCCGGGAT CGAGGTCATC GGCGACGACA CCCTCGCGTT CCACCTGATC
AAGCCGGTCA ACGACTTCGT GGACATCCTG GCCCTGCCGG CCAGCTCGCC GGTGCCGCTG
GAGGCCCTGG CCTACCCGCC GGACTCCCAG CAGTACCTCG ACAACCTGAT CTCCGACGGG
CCGTACCGGT TCGTGTCGGA GCCCGGCGGT GGCTACCGGC TGTCCCGCAA CCCGGCCTGG
AGCGGCTCCT CGGACGGCAT CCGGCGGGCG CTGCCCGACC ACATCACCGT CACCGACGGG
CTCGACCCGG CGACCATCAC GGCGCGCATC GAGGCCGGTG ACGCGGACAT GGCCCTGAGC
GGCGACATCC CCGCCGACGA CCTGGCCCGA CTGGTCGAGA GTGCCGACAA GAAGCTCGTG
GTCGCTCCGA CCGGCCCGGT CGTCGCCCTC GTCGTCGGAC TCAACGGGCC GTCCGCGGCG
GCCCTGCGCG ACCAGCAGGC CCGGGAGGCG CTGGCCTACT GCATCGACCG AACGGCGGTG
GCCGCCGCGC TGGGTGGCCC CATGCTCGCC ACGGCGACGG CCCAGCTCCT GCAGTCGCCG
ATGACCGGCT ACGAGACGTA CAACCCCTTT CCGGCCGGGG ACGGCTCCGG GGACTCACGG
CGCTGCGCCG ACGGCCTCGC GAACAACCCG GGCGGGAAGG TGACGGCGCT GTCCCTGCTG
ACCACGGACA GCGCCACCGA CACGGCGGTG GCCGAGGCGC TGCGCGCCGC GTTCGCCCGC
GCCGGAATCC GCCTCGACCT GCGCATCCGC ACCGGCGCGC AGTACACGGC GGCCGCGTCG
AGCCCTGGCG GGCAGTTCTG GGACCTCGCC CTGACCACGA TCACCCCGGA CTGGTTCGGT
GACGCCGGTC GCACCGTCTA CGAGCCGCTG CTGGACGAGG CCTGGGTGGG CGCCCGGCCG
GCCGACGGCG GCTACCGCCG TCCGGACCTC CTCGCCCGCT ACGAGTCCGC CGTGACGGCC
TCCTCCGAGG ACGACGCCGC CACGGACTGG GCCGGGCTGG AGCGAACGGT GCTGAACGAC
GCCGCGATCG TGCCCCTCGC GGTCACCCAC ACGTTGCGGC TGCGTAGCTC GGCGGTACAG
GCGTTCACGA TCGTGCCGTC GCTGGGAACC GCCGATCCCA CAGCGGTTTC GCTCGGTCCC
TGA
 
Protein sequence
MNASPGRRLG HARGIAALLT ALLTTVALLT VAGCSGESDP TPGPMAASPT TPPTTASPLE 
KPGGTLRLLT GRMPTGDPGW ADELGERAFA RLVTRQLYSY PADADTARST IPRPDLAAGA
PVVTMGGTVY TVRLRSAARW NTPNQRRITA TDVARGLKRM CAPPSPSPLR GYYAATVVGF
AEYCAELAAA PVADAPALIE SGTVPGIEVI GDDTLAFHLI KPVNDFVDIL ALPASSPVPL
EALAYPPDSQ QYLDNLISDG PYRFVSEPGG GYRLSRNPAW SGSSDGIRRA LPDHITVTDG
LDPATITARI EAGDADMALS GDIPADDLAR LVESADKKLV VAPTGPVVAL VVGLNGPSAA
ALRDQQAREA LAYCIDRTAV AAALGGPMLA TATAQLLQSP MTGYETYNPF PAGDGSGDSR
RCADGLANNP GGKVTALSLL TTDSATDTAV AEALRAAFAR AGIRLDLRIR TGAQYTAAAS
SPGGQFWDLA LTTITPDWFG DAGRTVYEPL LDEAWVGARP ADGGYRRPDL LARYESAVTA
SSEDDAATDW AGLERTVLND AAIVPLAVTH TLRLRSSAVQ AFTIVPSLGT ADPTAVSLGP