Gene Franean1_3464 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3464 
Symbol 
ID5671835 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4093947 
End bp4095536 
Gene Length1590 bp 
Protein Length529 aa 
Translation table11 
GC content72% 
IMG OID641242352 
Productextracellular solute-binding protein 
Protein accessionYP_001507772 
Protein GI158315264 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.739249 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGACGCG GACGCCCGCC AAGACCCCTC CGATCATTCT GGTCCGCCCG GCGGTTGCGG 
ACCGCGGCGG CGGCCGCGCT GGCCGCGCTG GCGGTCAGCA CGGTCGCCGC CTGCGGCGGT
TCCACCGCCG CCGACGGGAC CGCGCCGGCG GCGCAGGGCG GGACCCTCAC CGTCCTGTGG
CCCGCCGAAC CACTCGACCT GTCCACCGAC AGCGGCTTCG GACTACAGAT GATCTCCGGA
TCGATCGAGC GCCTCGCCGT GTACGACGCC CTGGTCGCGA TCACCCCGGC CGCCGAGCTG
GACTACCGCC TGGCCACCTC CCTCGACTCC GACGACTCGC TGACCTGGAC GCTGCGGCTG
CGCGAGGGGC TGCGGTTCAG CGACGGCACC CCGCTGGACG CCGCCGCGGT CCGGGACAAC
TGGACCCTGC TGGCCGATCC GGCCCGCAAG TCACCCAGCG CCAAGATCGC ACAGCGGATC
GGGTCATTCA CCATCGTGGA CCCGACAACC CTGCGGATCA CGCTGAAGGA GGCCGACGGC
CAGTTCCCCC GGCTGGTCGC GCAGACCCCG CTGACCTTCA TCGGCTCCCC CACCGCGCTG
CGCGCCAAGG GCGACGGGTT CAAGACCGCG CCGGTCGGCG CCGGGGCCTT CACCGTCCGG
GAGTGGCTGC GCAACGACCA CCTGACTCTG GTCCGCAACC CGACCTCGTC GGTCCAGGCG
CACTACGACA CGATCGTCGT CAAGTCCGTC CCGGACGAGA CGCAGCGCTA CAACACCCTG
CTCGCGGGGG GCGCGGACAT CGCCTTCTCC GCGAACCTGC GCACCGGCAT CACCGCGGTC
GCGGCCGGGC TGGTCACCGA GAAGGCGTTC AGCGACGGCG GGCTCAACCT GCTGTTCAAC
ATCACCAAGG CCCCCTTCGA CGACATCCGG GCCCGCCGGG CGGTCTCCTA CGCCCTCGAC
GCGCAGGCGC TGAACAAGGC CCTGTTCGAC GGGACCGCCG CCGTGCCGTC CAGCTTCCTG
CGCGACGACT CGCCGCTGCA CAGCGACGTG CCGCTGCCCC GCCCGGACCG GGCGAAGGCC
CAGGCCCTGT TCGACGAGCT CGCCGCCGCG GGCAAGCCGG TGCAGTTCAC CATCATCTCG
CCGCTGAACT TCAGCAACGT CGCCGAATGG GTGCAGTCCA GCCTCGGTGG CTTCCGGAAC
GTCAGCGTGA AGGTCGACGC GATGGCCCAG ACGCTGCCCG TGCTCCAGGG CGGCTTCCAG
GCCACGCTCA CCGGCACCCC GCGGTTCGTG GACCCCTACC CGCAGCTGGC CCTGAACCTG
GGCACCGGCG GCCCGAGCAA CTACGGCAAG TTCTCCGACC CGGCCCTCGA CGCCGCGCTG
CGGGAGGGGC AGCAGTCCCG GGACACGGCC GTCCGGGTCC GGGCCTACGA GACCGCGCAG
CGGATCATCG CCGAACAGCT GCCGCTGGCC GGCCCGCTGT ACCGCCTGCC GGGCCAGTAC
CTACACGCGT CCACGGCCTT CGGTACGGGC AAGCTCCCGA TCATCAACGA CGGCGTGCTC
GACATCACCC GGCTCACCGG GGCGGGGTGA
 
Protein sequence
MRRGRPPRPL RSFWSARRLR TAAAAALAAL AVSTVAACGG STAADGTAPA AQGGTLTVLW 
PAEPLDLSTD SGFGLQMISG SIERLAVYDA LVAITPAAEL DYRLATSLDS DDSLTWTLRL
REGLRFSDGT PLDAAAVRDN WTLLADPARK SPSAKIAQRI GSFTIVDPTT LRITLKEADG
QFPRLVAQTP LTFIGSPTAL RAKGDGFKTA PVGAGAFTVR EWLRNDHLTL VRNPTSSVQA
HYDTIVVKSV PDETQRYNTL LAGGADIAFS ANLRTGITAV AAGLVTEKAF SDGGLNLLFN
ITKAPFDDIR ARRAVSYALD AQALNKALFD GTAAVPSSFL RDDSPLHSDV PLPRPDRAKA
QALFDELAAA GKPVQFTIIS PLNFSNVAEW VQSSLGGFRN VSVKVDAMAQ TLPVLQGGFQ
ATLTGTPRFV DPYPQLALNL GTGGPSNYGK FSDPALDAAL REGQQSRDTA VRVRAYETAQ
RIIAEQLPLA GPLYRLPGQY LHASTAFGTG KLPIINDGVL DITRLTGAG