Gene Franean1_3871 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3871 
Symbol 
ID5672234 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4601008 
End bp4602132 
Gene Length1125 bp 
Protein Length374 aa 
Translation table11 
GC content71% 
IMG OID641242749 
Productputative sulfonate binding protein precursor 
Protein accessionYP_001508169 
Protein GI158315661 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID[TIGR01728] ABC transporter, substrate-binding protein, aliphatic sulfonates family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.080113 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.159849 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGGGATG AGGGCAGACT GCCCGCAGGC CGTGACGCTA CGCCCGCGCC GTCCGTTCCG 
CTACGAGTCC TCATGATCGG CCGTCCGGCC GGACGCCGGT CCCGACGGCT GCTGGTGGTT
GTTCCCCTCG CCCTGGTCGG ACTGCTCGCG GCGGCCTGCG GCGGGAGCGA CCCGGCGCCG
GGTACGGCCG CGGCGGCGGC GAACGACCTC TCCGGCGTCA CACTGCGTTT CGGTGACCAG
ATCAACGGGG TCCGCTCCGT GCTGGAGGCC TCCGGCGAGC TGGACGACGC ACCGTACAAG
ATCGAGTGGA GCCAGTTCCA GGGCGGTGGC CCCACGGTGA TCGCGGCGCA GACCGGCGGG
GACGTCGACC TCGGGACGAT GGGGGAGACA CCGGTGGTGT TCGCGCAGGC CGCGCACAGC
CCGGTGACGG TCGTCGGGGC CGCCCGGATC GTCGACCCGG CGAAGTCGAA CTTCGCGCTG
GTCGTGAAGA AGAACTCGCC GATCCGGTCC GTCGCGGACC TGCGCGGCGC GACGATCCTC
AACAGCCAGG GCACGGTGTC GCAGTACCTG GTCGCGAAGG CGCTGGAGAA GGGCGGCCTC
ACGACCGACG ACGTGAAGCT GGTGAACCTC CAGCAGGGTG CGCAGGCCGC CTACGACCGC
GGCGACATCG ACGTCATCGC CAGCGGGGGA CCGCCCCTGG CGATGATGCT GGCGAAAGGC
ACCGACCGGG TCCTCATGAC CGGCGCGGGT GTGCTGCCCG GAGTCAACTA CCTGGTCGCG
CGCAACGGCG CTCTCTCCGA CGCCGGGCTC AGCGACGCCA TCGGCGACTT CCTCGGCCGG
CTCGCGCAGG CGCAGGACTG GTACAACGCC CACCCGGACG CCGCCATCGC GATCGTGAAG
CCCACCTACA AGGTGGACGA CACCGTCGCC CGCGCCATCA TCGACCTCGC GCCCCTGCAC
TACGTGCCGA TCGACAGGAC GGTCACCGAC GCCCACCAGC GCGAGGCGGA CTTCTTCGCC
GACCAGGGCG TGCTGAAGGC GAAGATCGAC ACGTCGACGG TCTTCGACGA CCGTTACAAC
CCGATCATCA ACGCGGTGGC CCAGGGCCGG CCTGGTCCGT CATGA
 
Protein sequence
MGDEGRLPAG RDATPAPSVP LRVLMIGRPA GRRSRRLLVV VPLALVGLLA AACGGSDPAP 
GTAAAAANDL SGVTLRFGDQ INGVRSVLEA SGELDDAPYK IEWSQFQGGG PTVIAAQTGG
DVDLGTMGET PVVFAQAAHS PVTVVGAARI VDPAKSNFAL VVKKNSPIRS VADLRGATIL
NSQGTVSQYL VAKALEKGGL TTDDVKLVNL QQGAQAAYDR GDIDVIASGG PPLAMMLAKG
TDRVLMTGAG VLPGVNYLVA RNGALSDAGL SDAIGDFLGR LAQAQDWYNA HPDAAIAIVK
PTYKVDDTVA RAIIDLAPLH YVPIDRTVTD AHQREADFFA DQGVLKAKID TSTVFDDRYN
PIINAVAQGR PGPS