Gene Francci3_3975 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3975 
Symbol 
ID3906935 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4756544 
End bp4758124 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content70% 
IMG OID637881303 
Productsulphate transporter 
Protein accessionYP_483054 
Protein GI86742654 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0659] Sulfate permease and related transporters (MFS superfamily) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCTGATG TAACCGCCCG AGAGGCCGCC GACGCCGGGG GCTCCGTTCG CCGGACGGCC 
CCCAGCGGCC CCTGGTACTC CCGCCGCCGC ACCGGCCGCT GGCCGGGAGT TCGTCACATC
CGGGTGGAAC TGCTCGCCGG CCTGGTGACC GCGCTCGCGC TGATCCCGGA GACGATCTCC
TTCTCTGTGG TGGCGGGGGT CGATCCGAAG GTCGGTCTGT TCGCGTCGTT CACCATCTCG
GTGGTCATCG CGTTCACCGG CGGACGCCCG GCGATGATCT CGGCGGCGGC AGGCTCCATG
GCCCTCGTCG CCGCCCCCCT GGTCCGCGAT CACGGCCTGA ACTACCTGCT GGCGACGACC
ATCGGGGTGG GCCTGCTGAT GTTTGTGCTC GGCCGGCTCG GGGTGGCACG GTTGATGCGG
TTCGTCCCGC AGAGCGCGAT GATCGGCTTC GTCAACGCGC TGGCCATCCT CATCTTCACC
GCGCAGATGC CGCACGTGAT CGGCAAGTCG TGGCAGGTCT ACGCCCTCGT CGCCGCGGGC
CTCGCCATCC TGCTGGCTCT CCCCCGCCTG ACCAGAGCGG TCCCGCCGCC GCTCGTGGCG
GTGGCGGTCC TCACCCTCGT CACCGCCGGT TTCGACATCT CCGTCCCTAC CGTCGGCGAC
GAGGGCCGGC TGCCGAGCTC ACTGCCCGTC CCCGGTGTGC CCGACGTGCC GCTGACCTGG
GAGACACTGC GGATCATCGC CCCGTACGTG GTGGCGCTCA CCGCCGTCGG CTTGGTGGAG
ACGCTGCTGA CCGCGCAGAT CGTGGACCGG TTGACCGACA CCACGCATGA CCCGAACCGC
GAATCCTGGG GCCTGGGTGT CGCCAACATC GTCAACGGGT TCTTCGGCGG GATGGGCGGC
TGCGCCATGA TCGGCCAGAC GATGGTGAAC GTCAACTCGG GCGGGCGGGG CCGGCTGTCG
ACCTTCGCCG CCGGCGGTTT CCTGCTCATG CTGGTCGTCC CGCTGCGCGA CCTCGTGCGG
GTCATCCCGA TGTCGGCGCT GGTCGCGGTG ATGATCCTGG TGTCGGTCAT GACCTTCGAG
TGGCGCAGCA TCCAGCCCTC GACGCTGCGC CGGATGCCCC GCGGCGAGAC CGCGGTCATG
GTGGCGACCG TCGCCGTCGT GGTGCCCACC CACAACCTCG CCTACGGCGT CGGGGTGGGC
GTCATGCTGG CCGCGTTGCT GTTCACCCGC CGGGCCGCGA ACCTGGCGCT GGTGACCAGC
GTGCTCGACC CGGAGGGCCG GGAGAGGATC TACGTCGTCC AGGGCACGCT GTTCTTCGCA
TCGACCAGTG ATCTGGTGAA CGCCTTCGAC TACGCGTGCG ACCCCGAACG GGTGGTCATC
GACCTCTCGG AGGCCCACCT TCTCGACTCG GCGGCGGTCA CCGCCCTCGA CGACGTGCGC
GCGAAGTACC GCGAACGCGG CACGCAGGTG AACCTGGTCG GGGTGAACGC CCGCAGTGCC
GCTCTGCTGC ACAAGCTGTC CGCCGAGCCG GCGCAGGTGG AACCGGCACC GACCGAACCG
GCACCGACCC GCGTCCGGTG A
 
Protein sequence
MPDVTAREAA DAGGSVRRTA PSGPWYSRRR TGRWPGVRHI RVELLAGLVT ALALIPETIS 
FSVVAGVDPK VGLFASFTIS VVIAFTGGRP AMISAAAGSM ALVAAPLVRD HGLNYLLATT
IGVGLLMFVL GRLGVARLMR FVPQSAMIGF VNALAILIFT AQMPHVIGKS WQVYALVAAG
LAILLALPRL TRAVPPPLVA VAVLTLVTAG FDISVPTVGD EGRLPSSLPV PGVPDVPLTW
ETLRIIAPYV VALTAVGLVE TLLTAQIVDR LTDTTHDPNR ESWGLGVANI VNGFFGGMGG
CAMIGQTMVN VNSGGRGRLS TFAAGGFLLM LVVPLRDLVR VIPMSALVAV MILVSVMTFE
WRSIQPSTLR RMPRGETAVM VATVAVVVPT HNLAYGVGVG VMLAALLFTR RAANLALVTS
VLDPEGRERI YVVQGTLFFA STSDLVNAFD YACDPERVVI DLSEAHLLDS AAVTALDDVR
AKYRERGTQV NLVGVNARSA ALLHKLSAEP AQVEPAPTEP APTRVR