Gene Francci3_0425 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0425 
Symbol 
ID3903614 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp506306 
End bp508054 
Gene Length1749 bp 
Protein Length582 aa 
Translation table11 
GC content74% 
IMG OID637877757 
Productsulphate transporter 
Protein accessionYP_479541 
Protein GI86739141 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0659] Sulfate permease and related transporters (MFS superfamily) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGACGG GGCAGGCCGA GGACCCCCGG CCCCACTGCC CCGCCCCCAT AAGGACGGCC 
CCGTCACGCT GGCCCAGGCC ACTCGGCCGG CTGGAACTCG GCCGGCTCCT GCCCGGCCGG
GCCGATATCG AGAGCATGCG CCGCCAGCCA GGCCGTGACC TGCTCGCGGG GATGACCGTC
GCGGTGGTCG CGCTGCCGCT CGCGCTGGGA TTCGGGGTCT CCTCCGGCCT CGGCGCGGCC
GCCGGACTGG TGACGGCGAT CGTGGCGGGT GCTCTCGCGG CGGTCTTCGG CGGGTCGAAC
CTGCAGGTGT CCGGGCCGAC CGGCGCGATG ACCGTCGTGC TCGTGCCGAT CGTGCACCGG
CACGGGCCGG CTGGTGTGCT GACCGTCGGG CTGCTCGCCG GGATGATCCT GCTCGCGCTC
GCGGTCCTGC GGGTCGGCCG CTCCGTGCGA TATGTGCCGG CGCCCGTGGT GGAGGGATTC
ACCGTCGGCA TCGCCGCGGT GATCTTCCTT CAGCAGGTTC CCTCGGCCCT GGGGGTGCAC
GCCCGAAACG GTGACCGGGT GACCGGCGTG GCCTGGGCGG CCGTGGCCGA CTTCGTCCAC
CGGCCGCACT GGCTCGCGCT GGCGCTCGCC GTCGGCGTCG CGGCGACGAT GCTGGTCGGC
GCCCGGCTGC GGCCCACGGT CCCGTTCTCG CTGCTCGCCG TGGCCCTCGC GACGGCCCTC
GTCACGGCCC GCGAGTGGGA CGTCACGACC CTCGGCGCAA TCCCCGCCAC CCTGCCCGGT
CCCTCGCTCG GCTTCCTCGA CATCACCGCG ATCCCGACGC TCGCCACGGC CGCCGTCGCC
GTGGCCGCGC TCGCAGCCTT GGAGAGCCTG CTCTCGGCCA GTGTGGCCGA CGGGATGACC
GTCGGCCAGA ACCACGACCC CGACCGGGAG CTATTCGGCC AAGGGCTGGC CAACCTCGCC
GTCCCGCTGT TCGGCGGGGT GCCCGCCACC GGCGCGATCG CCCGCACCGC GGTCAACGTC
CGCTCGGGTG CCTCGTCCCG GCTCGCCGCC CTCGCGCACG CCGTGGTCCT CGCCGGGGTG
ATGCTCGCAG CCGCTCCCCT GGTCGCGCGC ATCCCGCTCG CCGCGCTCGC CGGTGTTCTG
CTCGCCACGG CGGTGCGGAT GGTCGAGGTC TCCTCGTTGC GCACCCTCGG CCGTGCGTCA
CACTCCGATG CCCTGGTCAT GGCGCTGACC GCCGGGGCGA CCCTGGCGCT CGACCTCGTC
ATCGCCGTCG TCCTCGGCCT CGCGGTCGCC GGCGCCCTCG CCCTGCGCGC AATCGCCCGA
ACCGCGCGGG TCGAGGAGGT GCCGCTGGAC CCCGGCGATC ATCACGATTC CGAACACGCG
CTACTCGCCG AGCACATCGT CGCCTACCGT ATCGACGGCC CCCTGTTCTT CGCCGCCGCC
CACCGCTTCC TGCTGGAGCT CACCGAGGTG GCCGACGTCG AGATCCTCAT CCTCCGGCTG
TCCCGGGTCA GCGCCGTCGA CGGCACCGGC GCCCTCGTCC TGCGCGACGT CATCGACCGG
CTCGAACACC GCGGCGTCAA GGTGTATGTG TCGGGGCTGC GACCCGAGCA TGCCAAAGCG
GTGGCCGCCG TCGGTCTGGT GGACCGGCTG CGCGGCCAGG GACGCCTGTT CGACTCCACC
CCGTCGGCCA TCCAGGCTGC CCGGGACCAT CTCCACCACA CCGGCGCCCT GGCTGCCGTC
CCCACCTGA
 
Protein sequence
MTTGQAEDPR PHCPAPIRTA PSRWPRPLGR LELGRLLPGR ADIESMRRQP GRDLLAGMTV 
AVVALPLALG FGVSSGLGAA AGLVTAIVAG ALAAVFGGSN LQVSGPTGAM TVVLVPIVHR
HGPAGVLTVG LLAGMILLAL AVLRVGRSVR YVPAPVVEGF TVGIAAVIFL QQVPSALGVH
ARNGDRVTGV AWAAVADFVH RPHWLALALA VGVAATMLVG ARLRPTVPFS LLAVALATAL
VTAREWDVTT LGAIPATLPG PSLGFLDITA IPTLATAAVA VAALAALESL LSASVADGMT
VGQNHDPDRE LFGQGLANLA VPLFGGVPAT GAIARTAVNV RSGASSRLAA LAHAVVLAGV
MLAAAPLVAR IPLAALAGVL LATAVRMVEV SSLRTLGRAS HSDALVMALT AGATLALDLV
IAVVLGLAVA GALALRAIAR TARVEEVPLD PGDHHDSEHA LLAEHIVAYR IDGPLFFAAA
HRFLLELTEV ADVEILILRL SRVSAVDGTG ALVLRDVIDR LEHRGVKVYV SGLRPEHAKA
VAAVGLVDRL RGQGRLFDST PSAIQAARDH LHHTGALAAV PT