Gene Francci3_2471 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2471 
Symbol 
ID3904849 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2915728 
End bp2917275 
Gene Length1548 bp 
Protein Length515 aa 
Translation table11 
GC content70% 
IMG OID637879801 
ProductNa+/solute symporter 
Protein accessionYP_481567 
Protein GI86741167 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0591] Na+/proline symporter 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.108302 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGACG GGACGACCGT GGCGGGATTC ACCGTCGCGA TAGCAGGCAC TTCGTTACTA 
TCGGTAAGGG CGCGCGCGTT CCATCGGCGG GACGAGCTCC CCTCGCCGGA AGGGTGGGCG
CTCGCAGGGC GCCCGTTCGG TGGCGTTCTG ACCTGGTTCC TGCTCGGCGG TGCGATCTAC
ACCGCATATA CCTTCGCCGC GGTGCCCGGC CTCGTCTACG GCGTGGGCGC TCTCGGCTTC
TTCGCGCTGC CCTACACGAT CATCGTCTAT CCGCTGGCCT TCGTGCTCCT CCCACGTCTG
GCGGAGACGG CTCGGCGCCA TGATTACGTC ACGGTGGGCG ACTACGTGCG GGAACGCCAC
CGGTCGCCGC TGCTCGCGCT CGCCGTGGCG CTGACCGGAA TCGTGGCCAC CACGCCGTAC
ATCGCGCTGC AGTTGGTCGG CGTCCGCGCG GTGCTCGTTG CCGGCGGCCT CACCCCGCCT
GGGCTGCCCG GCGACCTGGT GCTGGCCTCG GTGTTCGGGG TGCTCGCCGT CGCGACCTAC
CGCAGCGGCA TCCGCGCTCC GGCACTCGTC TCGGTCGCCA AAGGAATTCT GATCTTTTTG
GCGGTCTTCG CCGTGGTCGG ACTCGTCCTA GCCAGGCTCG GCGGACCCGG CGAGATCTTT
GCCGCAGCAC GGCGGAACAT GGCCACGGAC GGCTTGCCGG ATCCGTCGCT CACGCTGGAG
CCGTCGACGT TCGCCTCGTT CGCGAGCCTC GTGCTCGGCT CGGCGATGGC CCTGCTGGTG
TACCCGCACG TGCTCACCGC CGCATTTGCG GCGAAGGACC CCGACGTCCT GCGCCGCTCG
GCGGTCACCC TGCCCGCATG GACGGGCCTG CTCGGACTCT TCGGGCTGCT CGGCGTTGCC
GCCTTGGCCA CGGGGGTACG CACACCCCGC GGGCAGGCCG AAGCAGCTGT CCCTCTGCTC
GTCCAGCAGC TGGCCCCACC GGTTGTCACC GGCGTCGTAT TCGGCGCGCT CGTTGTCGCG
GCGCTCGTCC CCGCGGCGGT GATGTCGGTG GCGGTCGCCA TGCTCTTCGT CCGCAACGTA
TACGTCGAGT ATTTTCACCC CACGGCCACC CCTAAGCATC AGGTCCGGGT GGCTCGCGGT
GTCTCGCTGG TGACAAAGAT CGGGGCATTC GCCTTCGTGC TCGGTCTGCG CGACCAGGAT
GCCATCGATC TGCAGCTGCT GGGCGGGGTC TGGATACTCC AGACCTTCCC GGCGGTCGGG
ATCGGCCTGT TCTCCACCTG GCTGCATCGC GGTGCGCTGC TTGCCGGATG GGCGGTAGGA
ATGGCGGCGG GAACCTGCCT GGTAACCTCC GGCGAATTCT CCGCGGTGGT GAACCTGCAC
CTGTTCGGGG CGTCCGTGCC GCTGTACACG GCGCTCCTCG CGCTCGGGCT GAACCTGGCG
GTCGCCACCG CGCTCACCCC GGTCCTTCAC CGCGTCGGCG TGGCCCGGAG CAGCCGGACA
TCCCAGGTGG ACCGACTCCC CCGGCCGGAC GCGCCGGGAT GGCGGTGA
 
Protein sequence
MADGTTVAGF TVAIAGTSLL SVRARAFHRR DELPSPEGWA LAGRPFGGVL TWFLLGGAIY 
TAYTFAAVPG LVYGVGALGF FALPYTIIVY PLAFVLLPRL AETARRHDYV TVGDYVRERH
RSPLLALAVA LTGIVATTPY IALQLVGVRA VLVAGGLTPP GLPGDLVLAS VFGVLAVATY
RSGIRAPALV SVAKGILIFL AVFAVVGLVL ARLGGPGEIF AAARRNMATD GLPDPSLTLE
PSTFASFASL VLGSAMALLV YPHVLTAAFA AKDPDVLRRS AVTLPAWTGL LGLFGLLGVA
ALATGVRTPR GQAEAAVPLL VQQLAPPVVT GVVFGALVVA ALVPAAVMSV AVAMLFVRNV
YVEYFHPTAT PKHQVRVARG VSLVTKIGAF AFVLGLRDQD AIDLQLLGGV WILQTFPAVG
IGLFSTWLHR GALLAGWAVG MAAGTCLVTS GEFSAVVNLH LFGASVPLYT ALLALGLNLA
VATALTPVLH RVGVARSSRT SQVDRLPRPD APGWR