Gene Franean1_3755 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3755 
Symbol 
ID5672120 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4447423 
End bp4449096 
Gene Length1674 bp 
Protein Length557 aa 
Translation table11 
GC content67% 
IMG OID641242636 
Productextracellular solute-binding protein 
Protein accessionYP_001508056 
Protein GI158315548 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACGAC GCCTTCACTC GCACCGGCCT GGCCTGGCCG CCATGGCCCT TGCCGTCACG 
GCCGCGCTGG GCCTGGCTGC GTGTGGCTCC TCCGGCGATG ATGACGGTGC CGCCGGCGAC
ACCGCGGGGA CGCCGGTAGC CGGTGGGACT CTGAAGGTCG CCTTCTTCCC CGACAACCCG
ACGTTCACGT GCCTCGACCC GTTCCAGACC TACTGGATCG AGCACCGCAC GGTGATCCGC
AACGTCGCCG ACTCCCTGAC CGACCAGGAC CCCAAGACCG GCGAGATCAA GCCCTGGCTC
GCCGAGAAGT GGGAGATCAG CGCGGACGGG AAGGAATACA CCTTCCACCT GCGTGACGGC
GTCACCTTCA GCGACGGCAC CCCGCTCGAC GCCGCGGCGG TCAAGGCCAA CTTCGACGGC
GACAAGAGCG TCGTGGAGGA GAGCGGAGGC ACGGCCTACG GCGCCAGCTA CATCCTCGGC
TACGACCACA GCGAGGTCGT CGACCCGAGC ACCGTCAAGA TCTTCTTCTC GACGCCGAAC
GCCTCGTTCC TGCAGGCCAC CTCGACGACC AACCTGGCGA TCATCTCGCC GGCGTCGTAC
AAGAAGACCT CCAAGGAGCG CTGCCTCGGC GACTACGTCG CCTCCGGGGC GTTCACGCTG
GGGAGCTACA AGCCCAACGA GCTCACCACC CTCAAACGGC GGCCGGGCTA CGCATGGGGC
TCGGCGCTGT CGGAGAACAC CGGCGAGGCC CACCTCGACA CGGTCGAGTT CAGCTACGTC
GCCGAGGACA GCGTCCGCAC CGGCAACCTG CTCAGCGGCA CCGTCGACAT CGCCTGGCCG
CGTAACCCCT TCACGGTCGA GGACCGCGAG CTGATCGAGA AGTCGGGTGA CGTCGTCGAG
TCCCGGCCGC TGCCGGGCCC GGCGTCCGTG TTCTTCCCCA ACGTGAGCGC GGGGCGTCCG
CTGGCGGACC TCAACGTCCG CAAGGCGCTG TACAAGGCGT TCGACCTCGA GACCTACGCC
AAGACCGTAT TCGGAGACGA CTACCCGGTC GTCACCGGCG CCTTCAACTC GACGACGCCG
TACTTCGTGT CGCAGGCCGA CAAGCTCCGC CACGACCCGG CGGGCGCGGG CAAGCTCCTC
GACCAGGCCG GCTGGAAGCT CGGCCCCGAC GGCTATCGCT ACAAGGACAA CCAGAAGCTC
ACGCTGAAGA CGCCGACCAC CACGTTCAAC GTCGGTGCCG AGCTCATCCA GGACCAGCTC
AAGCAGGTCG GCATCGACCT CGTGCTCGAC ACCACGACGA CGGCCGAGCT TCCCGCGAAG
TACAAGAACG GCGACTACGA CCTGGCCGGC AGCTACTTCA CCCGGGCCGA CCCGGGTGCG
CTGCAGTTCA TCCTCGACCC GGCCCACGCC AACTCCAAGG CGCTCGCGAC GAACGCGACG
ACCCCGCAGA CCCTGGCGAA GCTCACCGGG CTGTTCGCCA AGGCGGCGCA GACCACCGAC
CCGGCGCAGA CCAAGCAGGC CTACACCGAC CTGCAGAACC TGCTCATCGA CGAGGGCGTG
TCGTTCCCGC AGTTCGAGCG GGTGCAGTAC GCGGGGGTCA GCAGCCAGGT CCACGGCTTC
GCGTTCACGT CGGAGAGCTT CCTGAAGCTC AACGACGTGT GGAAGCAGCA GTAG
 
Protein sequence
MKRRLHSHRP GLAAMALAVT AALGLAACGS SGDDDGAAGD TAGTPVAGGT LKVAFFPDNP 
TFTCLDPFQT YWIEHRTVIR NVADSLTDQD PKTGEIKPWL AEKWEISADG KEYTFHLRDG
VTFSDGTPLD AAAVKANFDG DKSVVEESGG TAYGASYILG YDHSEVVDPS TVKIFFSTPN
ASFLQATSTT NLAIISPASY KKTSKERCLG DYVASGAFTL GSYKPNELTT LKRRPGYAWG
SALSENTGEA HLDTVEFSYV AEDSVRTGNL LSGTVDIAWP RNPFTVEDRE LIEKSGDVVE
SRPLPGPASV FFPNVSAGRP LADLNVRKAL YKAFDLETYA KTVFGDDYPV VTGAFNSTTP
YFVSQADKLR HDPAGAGKLL DQAGWKLGPD GYRYKDNQKL TLKTPTTTFN VGAELIQDQL
KQVGIDLVLD TTTTAELPAK YKNGDYDLAG SYFTRADPGA LQFILDPAHA NSKALATNAT
TPQTLAKLTG LFAKAAQTTD PAQTKQAYTD LQNLLIDEGV SFPQFERVQY AGVSSQVHGF
AFTSESFLKL NDVWKQQ