Gene Franean1_3479 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3479 
Symbol 
ID5671850 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4136531 
End bp4138171 
Gene Length1641 bp 
Protein Length546 aa 
Translation table11 
GC content69% 
IMG OID641242367 
Productextracellular solute-binding protein 
Protein accessionYP_001507787 
Protein GI158315279 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.116614 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0951056 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGATCGC GCCGAACGGC ACGAAGAGCA GGGCTGTTCG CGGCCCTCGC CGTCACCCTG 
GCCACCGCGG CCTGCGGATC CGACGGCGGC GGGAGCGGCA CCGCGGACGC CCAGCCACGC
GCTGGCGGAA GCGTCACCTA CGCCGCCCGG CAGGAGCCGG ACTGCTGGGA CCCCCATGCC
AGTGCCCAGG ACGTCACCGC GTTCGCACAG CGCTCGGTGT TCGACTCGCT CGTCTACCAG
ACGCCCGACG GCGCGTTCGA GCCGTGGCTG GCGAAGTCCT GGAAGATCAG CGACGACGGC
CGCACCTACA CCTTCGAGCT GCGCGACGAC GTCACCTTCC ACGACGGCGC CAAGCTCGAC
GCGGAGGCGG TCAAGGCGAA CTTCGACCAC ATCATGGCCA AGGACACCGA GTCGCAGTTC
GCCGCCGGGC TGCTCGGGCC GTACGAGGGC GTGAAGGTCA CCGGCCCGCA GGAGATCCAG
GTCTCGTTCA GCCGTCCCTA TGCGCCGTTG CTGCAGGTCG TCAGCACCAC CTTCCTCGGC
ATCGCCTCAC CGGCGTCGCT GAAGGCCGGC TCCGAGAAGC TGTGCTCGGG CACCGACTCG
ATCGGGTCGG GGCCGTTCAA GGCCGACGCC TACACCCGCG GCCAGCAGCG CTCCTACACC
CGGTACGCCG ACTACGACTG GGCACCGAAG AGCGCCGGGC ACAGCGGCCC CGCCCGGCTG
GACTCGGTCA CGATCCGGTT CATCACCGAG GAGGCCACCC GGGTCGGCGC GCTCAGCTCC
GGCCAGGTGG ACGGCGCCGC CGACATCCCG GCCAACCAGA TCGCCTCGGT CAGCAAGAAC
CCGCGGCTGA CCACGATCAG CAAGCAGGTG CCGGGCGCCG TCGACGCCTT CTACCTCAAC
ACCAAGAGTG AGCTGTTCTC CGACGTCCGG GTGCGCAAGG CGTTCCAGCG CAGCCTCGAC
CTGGGCACCA TCGTGAAGTC GGTGTTCCAG GGCACCACCG AGCGGGCGTG GAGCCCGCTG
TCCCCGACCA CGCCGAACAG CTACGACCCG TCGCTGGAGA AGACCTGGCC GTACGACCCG
AAGCTGGCCG GGCAGCTGCT CGACGAGGCC GGCTGGACTG GGCGCGACGC CGAGGGCTAC
CGCACCAAGG ACGGCAGGCG GCTGACCGTC TTCGCGCCGA TCTACGGCGA GGCGACCGTC
TTCTCCCAGG CGGCCCAGGC CGAGCTGAAG AAGATCGGCT TCTTCCTCGA CCTGCACGCC
TCGACGGACG CGGCCGAGAT CTCCGGCCTG CTGGACGGGG GGAAGTACGA CACCGTCGAG
CTGCAGTGGG CCCGCCCGGA CGGTGACATC CTGAGCTCGT TCTTCCTGTC CACAGAGACC
TCCGTGGGCG GCGGCCACAA CTTCGCCCTC GTCGCCGACC CGCAGGTCGA CGAGTGGCTG
AAGGCGGCCC AGGCCGAGCA GGACCCGAAG GAGCGGGCGA AGTACTACTC CCAGGTTCAG
AAGTGGACAA TCGACCAGGC CGTGGTCGTC CCGGCGTACA TCAAGAACGC GACCGTCGGG
GTCAACAAGA AGGTGCATGG CCTGCGGCTG AGCATCGCCA CCTGGCCCGA GTTCTACCCC
GCCTGGGTGC AGGCCGACTG A
 
Protein sequence
MRSRRTARRA GLFAALAVTL ATAACGSDGG GSGTADAQPR AGGSVTYAAR QEPDCWDPHA 
SAQDVTAFAQ RSVFDSLVYQ TPDGAFEPWL AKSWKISDDG RTYTFELRDD VTFHDGAKLD
AEAVKANFDH IMAKDTESQF AAGLLGPYEG VKVTGPQEIQ VSFSRPYAPL LQVVSTTFLG
IASPASLKAG SEKLCSGTDS IGSGPFKADA YTRGQQRSYT RYADYDWAPK SAGHSGPARL
DSVTIRFITE EATRVGALSS GQVDGAADIP ANQIASVSKN PRLTTISKQV PGAVDAFYLN
TKSELFSDVR VRKAFQRSLD LGTIVKSVFQ GTTERAWSPL SPTTPNSYDP SLEKTWPYDP
KLAGQLLDEA GWTGRDAEGY RTKDGRRLTV FAPIYGEATV FSQAAQAELK KIGFFLDLHA
STDAAEISGL LDGGKYDTVE LQWARPDGDI LSSFFLSTET SVGGGHNFAL VADPQVDEWL
KAAQAEQDPK ERAKYYSQVQ KWTIDQAVVV PAYIKNATVG VNKKVHGLRL SIATWPEFYP
AWVQAD