Gene Franean1_4820 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4820 
Symbol 
ID5673161 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5754820 
End bp5756490 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content71% 
IMG OID641243676 
Productextracellular solute-binding protein 
Protein accessionYP_001509092 
Protein GI158316584 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.161411 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGAGAA CCATCCGGCC GCGAAGGCCG GCCCACCTGG CGCGCACATC GCGCCTGCTC 
AGCGCATTCG CGCTCGTACC CGCGCTCGTG CTGGGACTCG CGGCGTGCGG CGACGACGGG
GACGACGCCG GCGGCGAGGC CGCCTCGCCA ACGTCCGGCG GAACGCTGAC CATCGCGCTG
AGCAGCGACC CGGTGAGCAT CTACCCGCGG GCGGTGAGCC CGATCGTGCG CGACGTGGCC
CGGCCGGTCG TCGACTCGCT GCTCGCGCTC GACCCGAAGA CCCGCCAGCC GGTGCCGTGG
CTGGCGGAGA AGTACGAGGC CAACGCCGAC GCCACCGAGT ACACCTTCCA CCTGCGCGCG
GGCGTGACCT TCTCCGACGG CACGCCGCTG ACCGCCGAGG TGGTGAAGCG GAACTTCGAC
GACATCCTCG CCAACGCGGC GAAGAGCACT GGTTCCGCGG CGCTGTCGAC GCTGAAGACC
AGCGGCTATA CCGGCACCGA CGCCGTGGAC GAGCTCACCG TCGTCCAGCA TTTCGGCGCG
GCGTTCCCGG CGTGGCCGGT CGCGCTGGCG AACACCGGAT TCGGCATCCT CGCCCCGGCC
ACCCTGGATC TGCCCTACGA GCAGCGCTTC AACAAGGTCG TCGGCTCCGG TCCGTTCGTC
CTCACGTCGT ACACGAAGAA CAGCGAGGTC GTGCTCAGCC GGCGCGACGA CTACCGGTGG
GCGCCGCGGT ACCGCTCACG TACCGGTGAC GCCTACCTCG ACAAGGTCGT CTACCGGATC
ATCGAGGAGC CGAGCGTCCG CGCCGGTGCC CTGCAGAACG GCCAGGTCCA GGTGTCGACG
TCACTCAACC CGGCCGACAT CGAGGCCGCC GGGTCCGCGG GCGCCACGGT CATCACGCAG
CCGCTGCCCC GCAACACCGA GGCCCTCGTC GTGACCGGCG CCGACCGGAC GCCGCTCAAC
GAACTACCCG TCCGCCAGGC GATCGTGCGC GCCGTCGACA CCGGCGCGAT CCGCGACTCG
CTGCTGCACC CGTCGTTCAA GCTGCCCACG AGCGTCCTCA CCTCGACCGT CATCGGGTGG
GCGGACCAGT CAGCGCATCT GAAGACCGAC GTCGACGAGG CGAACCGCCT GCTGGACGGG
GCCGGCTGGC GGCGCGGCGG CGACAGCGGC ATCCGGGAGA AGGACGGCCG GAAGCTCACC
GTCGTCTTCG GCTGGATCGA TCGCGGCAAC CCCTGGGACC AGGGCCTGGT GGAACTACTC
AAGGCGCAGC TCGCCGAGGT CGGCATCGAC CTGCAGCCCC GGCTGGACAC CGCCGCCGCC
GCCGTCGAGG CCCTCGGCAA GCACGACCAG TACGACCTGT TCCTGGCGGG TGTCGCCGGG
GGCGTCGACC CGGACAACGG GCTGCGTGGC AGCTTCGCCA ACGCCGCGCC GAACATCTAC
AACGTCCAGG ACACGGCCCT GCAGCCACTG CTGCAGCAAC AGGCGGTGAC GACCGACCCG
GACAAGCGGG CCGGCATCCT CGCCGACGTC CAGGAGCGGA TCGCCGAGCA GGGCCTCGCC
GTGCCGCTCG TCGAGGACAC CGCGGTCGTC GGCGTGGGGG CGAACGTGCA CGACTTCGCC
CTCGACTTCG ACTCCCGCAT CCCCCCGCTG TTCGACGTCT GGGTCTCCTG A
 
Protein sequence
MSRTIRPRRP AHLARTSRLL SAFALVPALV LGLAACGDDG DDAGGEAASP TSGGTLTIAL 
SSDPVSIYPR AVSPIVRDVA RPVVDSLLAL DPKTRQPVPW LAEKYEANAD ATEYTFHLRA
GVTFSDGTPL TAEVVKRNFD DILANAAKST GSAALSTLKT SGYTGTDAVD ELTVVQHFGA
AFPAWPVALA NTGFGILAPA TLDLPYEQRF NKVVGSGPFV LTSYTKNSEV VLSRRDDYRW
APRYRSRTGD AYLDKVVYRI IEEPSVRAGA LQNGQVQVST SLNPADIEAA GSAGATVITQ
PLPRNTEALV VTGADRTPLN ELPVRQAIVR AVDTGAIRDS LLHPSFKLPT SVLTSTVIGW
ADQSAHLKTD VDEANRLLDG AGWRRGGDSG IREKDGRKLT VVFGWIDRGN PWDQGLVELL
KAQLAEVGID LQPRLDTAAA AVEALGKHDQ YDLFLAGVAG GVDPDNGLRG SFANAAPNIY
NVQDTALQPL LQQQAVTTDP DKRAGILADV QERIAEQGLA VPLVEDTAVV GVGANVHDFA
LDFDSRIPPL FDVWVS