Gene Francci3_1898 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1898 
Symbol 
ID3906847 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2231546 
End bp2232661 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content73% 
IMG OID637879236 
Productcation diffusion facilitator family transporter 
Protein accessionYP_481003 
Protein GI86740603 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0053] Predicted Co/Zn/Cd cation transporters 
TIGRFAM ID[TIGR01297] cation diffusion facilitator family transporter 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.338439 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAATGGC AGCTGCCACC ATGCCGCATT GATCGTTTTC GTCTATCGTC GCGGAACATG 
AACAGCGGCC ACGAGCATGA GCTCCCCGCC CAAGGGCACG GGCACGGCGG GCACCCGCGC
GGCGGACGGC ATGACCACCA CGCGAGCGGG CGGTGGACGC GGCTTCGTCA CGGCCTGTCA
GACCTGGCCG GAGGGCACAG CCACGACCCG GCGGACCAGA TCGACGACGC GTTGGAAGCC
GACACCGCCG GCCGCCGCGC CCTCCTGATC AGCCTGGCCG GTCTCGGGCT GACCGCCGCC
CCGCAAGCCG CCGTCGTGGC ACTGTCCGGA TCGGTCGCGC TCCTCGGCGA CACCCTGCAC
AACGTCGCCG ACGCGCTCAC CGCGGTCCCC CTGCTCATCG CCTTCACCGT GGCACGCCGC
CCGGCCACCG CCCGGTTTAC CTACGGCTAC GGCCGCGCCG AGGACCTCGC CGGCCTCGCC
GTCCTCGCGA TGATCGCCCT GTCGAGTGCG CTCACCGCCT GGGCCGCGAT CGACCGCCTC
CTGCACCCCC AGCGCGTCGG CCATCTGGGA GCGGTCGCCG TGGCCGGGCT CGTCGGCTTC
CTCGGCAACG AGATCGTCGC CCGCTACCGC ATCAGGATCG GCCATCAGAT CGGCTCCGCC
GCCCTCGTCG CCGACGGCCT ACACGCCCGC ACCGACAGTC TCACCAGCCT CGCGGTGCTC
CTCGGCGCGG CCGGTGTCGC GGTGGGCTGG CACTGGGCCG ACCCCGCCAT CGGCCTGGCG
ATCACCCTGG CGATCCTCGG AGTCCTGCGC TCCGCCGCTC GCGTCGTCGG GGCCCGGCTC
ATGGACGCCG TCGACCCCGC CGTGGTCGCC GAAGCCACCA GGGCGCTCCT GCACACCGAG
GGCATCGAGG CCGTCCGCGA ACTGCGGCTG CGCTGGATCG GCCACACCCT GCGCGCCGAA
GCCGACGTCA CCGTCGATGC GAACCTGACC CTGACCGCCG CTCACGACCT CGCCCACGCC
GCCGAAGCCC ACCTGCTGCG CCGCATCCGC CGCCTGTCCG CCGCTACCAT CCACACCAGC
CCCACCCACC ACCACGCCGC CACGACGGTC CCCTAA
 
Protein sequence
MEWQLPPCRI DRFRLSSRNM NSGHEHELPA QGHGHGGHPR GGRHDHHASG RWTRLRHGLS 
DLAGGHSHDP ADQIDDALEA DTAGRRALLI SLAGLGLTAA PQAAVVALSG SVALLGDTLH
NVADALTAVP LLIAFTVARR PATARFTYGY GRAEDLAGLA VLAMIALSSA LTAWAAIDRL
LHPQRVGHLG AVAVAGLVGF LGNEIVARYR IRIGHQIGSA ALVADGLHAR TDSLTSLAVL
LGAAGVAVGW HWADPAIGLA ITLAILGVLR SAARVVGARL MDAVDPAVVA EATRALLHTE
GIEAVRELRL RWIGHTLRAE ADVTVDANLT LTAAHDLAHA AEAHLLRRIR RLSAATIHTS
PTHHHAATTV P