Gene Franean1_2440 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2440 
Symbol 
ID5670836 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2901455 
End bp2902552 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content76% 
IMG OID641241357 
Productcation diffusion facilitator family transporter 
Protein accessionYP_001506778 
Protein GI158314270 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1230] Co/Zn/Cd efflux system component 
TIGRFAM ID[TIGR01297] cation diffusion facilitator family transporter 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.516557 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.641383 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACACCC CCGAAGGCCG CCACGGTGAC CACGGGCACG ACAACGGTCA GGGTCTCGTC 
CCGCCGCAGA CCCGGCCCGC CCACGGCCAC GACTCCGGCC ACACCCACGC CCACGGCGGG
CCCTGCGCGC CGGGCGGGCA CGAGCACGCG GCGGGTGGGT CCGCTCCGCG CGGCTTCGAC
TCCCAGCACC GCAGGCTGGC GTTCGCGACG GGCCTGAACG TCGCGATCGT CGTCGGCCAG
GCCGCGGCCG GGCTGCTCGT CGGGTCGGTC GCCCTGCTCG CCGACGCCGC GCACAACCTC
GCGGACGCGG CGGGCGTCGC GTTCGCGCTG ATGGCGATCC GGCTCGCGAG GCAGGCGCCG
TCCGCCACCC GGACCTTCGG CGGGCTGCGC TGGCCGGTGC TCGCCGCCCA GGCGAACGCG
GCGAGCGTGC TGGTCGTGAC CACCCTGGTC TGCGTCGAGG CCGCCGGGCG GCTCGCCCAC
CCCGAGCCGG TCGACGGCTT GGTCGTCCTG ATCGTGGCGA TCGCCGCGGC CGTCGGTAAC
GGGGTCAGCG CGCTCTTCGT CCACGAACGG CACGGTGATC TCAACACCAG GGCGGCGGTC
ACCCACCTGG CCGGCGACGC GCTGGTGTCG GTCGCGGTGG CCGGCGCCGG GCTCGTCATC
TGGCTCACCG GCGGCTGGTA CTGGCTCGAC CCGGCGCTCT CCCTGGTCGT GGCGGCGCTG
ATCGGGATCC AGGGCGTGCG CCTGCTGGCC GAGTCGTCCC GGGTGCTGCT CGAGGCGACC
CCCGTCGGGC TGGACCTGGC GGCAGTCCAG GCGGACGTCC TTGCCGTGGA GGGCGTGACC
GGGGTGCACG ACGTGCACGT GTGGGGCCTG TCCGACCGGG TCGCCGCGGC GAGCGCCCAT
GTCGAGGTGG CCGGCCATCC GACACTCGAG GAGGCGCGGG CGGTCTCAGA CCGGGTCAAG
GCGGTGCTGG CGGAGAAGCA CGGCGTCGTG CACGCCACCG TCGAGACGGA GTGCGAGCCG
TGCTCGCCCG CCGGCGGCGA CCCGTGCGAC GTGCGCAGGG TGACCGTGCA CCAGCTGGCC
CCGGCGCACC GCCACTGA
 
Protein sequence
MNTPEGRHGD HGHDNGQGLV PPQTRPAHGH DSGHTHAHGG PCAPGGHEHA AGGSAPRGFD 
SQHRRLAFAT GLNVAIVVGQ AAAGLLVGSV ALLADAAHNL ADAAGVAFAL MAIRLARQAP
SATRTFGGLR WPVLAAQANA ASVLVVTTLV CVEAAGRLAH PEPVDGLVVL IVAIAAAVGN
GVSALFVHER HGDLNTRAAV THLAGDALVS VAVAGAGLVI WLTGGWYWLD PALSLVVAAL
IGIQGVRLLA ESSRVLLEAT PVGLDLAAVQ ADVLAVEGVT GVHDVHVWGL SDRVAAASAH
VEVAGHPTLE EARAVSDRVK AVLAEKHGVV HATVETECEP CSPAGGDPCD VRRVTVHQLA
PAHRH