Gene Franean1_3559 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3559 
Symbol 
ID5671928 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4223260 
End bp4224840 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content73% 
IMG OID641242445 
Productextracellular solute-binding protein 
Protein accessionYP_001507865 
Protein GI158315357 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.43971 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGTTG CTCTCAACCG TCGCGCGTTC GGGCGTGCCG GCCTGCTCGC CGTCGGCGCC 
CTCACGGCGC CGGCACTGCT CGCGGCCTGT GGCGACGACG CCCCCGCCGC ACCAGGCGCG
GGCACGGGCG CGGCGGGGGC GACAGCGGCC CCGCGCCGCG GCGGGAAACT GCGGGCCGCG
TTCGCGAGCA GTCCGGCCGA CACCCTGGAC ATCGTGAAAG GCCACGACGT GCTGGGCTCG
GTGCGCGGCC TCGCGGTGTA CGAACGCCTC GGCGACGCCC ACCCGGACGG CTCGGTCTCC
CCGCGCCTGT TCGAGAGCCT GACGCCCAAC GCCGACGCGA CCGTCTGGAC GCTGAAGCTG
AAGCCGGGGA TCACCTTCTC CGATGGCAGG CCGTTGACGA CGGCCGACGT GCTGGCCAGC
TTCGCGACGT TCACCGGCAC CGAGAGCGGC GCCGCGATCG CCGCGTTCGA CCCGAAGGAG
AGCAAGGCGA CGGACGCGCG CACCGCGACC ATCGCGCTGA CCGCCCCCGT GTACGACCTG
CCGGCCCGGG TGAGCGGCGT CGTCCTGGTG ATCATGCCGG AGGGCAAGCC GGCGTCTCAG
CTCGGTGACG TCGTCGGCAG CGGGCCCTAC GAGATCGCCT CGTTCGTGCC CGGCCAGCGG
ACGGTCCTGC GCAGGCGCAC CGACTACTGG GACGGCGACG CCCGGGGCTA CCTGGACGCG
ATCGAGCTGG TGGCCGCGCC GGACGCGAAG TCGCGGCTGT CCGCGCTGCG GGCGGGCCAG
GTGGACTGGG CCGACGACAT CGCCTACCTG GACGCCTCGA CCCTGCGGGA GGACCGCGCG
ATCACGATCC ACCGCGGCGC GGCCGAGCAG GGCCTGGCCT GGTTCCTGAA CATGGCGGCG
CCGCCGTTCG ACGACGAGCG CGTCCGGCAG GCCCTGCGGT ACTCGGTGGA CCGCCAGAAG
CTCGTCGACA CCACGCTGTT CGGATTCGGC TCCGTCGGCA ACGACCTGTG GGGCAAGGGC
CTGCCGAACT ACAACGGCTC GATCCCGCAG CGGCCGCACG ACCCCGCGAA GGCGAAATCG
CTCCTGCAGG AGGCCGGGGT GGCCACCCCG GCCAAGGCCA CCCTGCTGAC CTCGCCCATC
GGCCCGGGTC TCGTCGAGGC GACCCAGCTC CTCGCCGACC AGGCCCGCGA GGTCGGCTTC
GACATCAAGG TCGAGGTCAT CCCGCCGGAC GTCTACTTCG CCCGGCCCGA GGAGTGGGCG
AAGGCGTCGG GCGTGGCGTT CGCCCAGGTC GGCGCGTTCA CCGACATGGC CCCGCTGGTC
TACCTGTCCG ACGGCCCGTT CAACTTCGGC TGGCGCAAGC CCGACTGGGA CGCCGGGTTC
GCCGACGGAG TGGGCGAGCT CGACGCCGCG AAGCGCAAAG CGACGTTCGA CGGCCTGCAG
CAGCAGCTCT GGGATTCCGG CTCCGACCTC GTGTGGGGGT TCGCGCCGAG GCTCGTCGCC
GCCGCGCCGT CCGTCGGGGG CGTCGACTCC AGTCCCAACT TCGGCATCCC GGACCTGGTC
TTCATCCACC GCACGGGGTG A
 
Protein sequence
MNVALNRRAF GRAGLLAVGA LTAPALLAAC GDDAPAAPGA GTGAAGATAA PRRGGKLRAA 
FASSPADTLD IVKGHDVLGS VRGLAVYERL GDAHPDGSVS PRLFESLTPN ADATVWTLKL
KPGITFSDGR PLTTADVLAS FATFTGTESG AAIAAFDPKE SKATDARTAT IALTAPVYDL
PARVSGVVLV IMPEGKPASQ LGDVVGSGPY EIASFVPGQR TVLRRRTDYW DGDARGYLDA
IELVAAPDAK SRLSALRAGQ VDWADDIAYL DASTLREDRA ITIHRGAAEQ GLAWFLNMAA
PPFDDERVRQ ALRYSVDRQK LVDTTLFGFG SVGNDLWGKG LPNYNGSIPQ RPHDPAKAKS
LLQEAGVATP AKATLLTSPI GPGLVEATQL LADQAREVGF DIKVEVIPPD VYFARPEEWA
KASGVAFAQV GAFTDMAPLV YLSDGPFNFG WRKPDWDAGF ADGVGELDAA KRKATFDGLQ
QQLWDSGSDL VWGFAPRLVA AAPSVGGVDS SPNFGIPDLV FIHRTG