Gene Franean1_3245 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3245 
Symbol 
ID5671620 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3834717 
End bp3835916 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content67% 
IMG OID641242138 
Productputative branched-chain amino acid ABC transport system, solute-binding protein 
Protein accessionYP_001507558 
Protein GI158315050 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTTCAG TCCGCTACTT AGGGGCGGCA GTCGCCGTCA TGACGATTGC CGTGGTGGGT 
TGTTCCCCGC CAGGATCCGT CAGTAATGTG TCGGGTAGCG AGTGCAATAG CCCCGGCATC
ACCCTCGACC AGGTGAAACT CGGTCTCGTG ATCTCGGAGT CCGGTGGGTC GGGTGCGTTG
ACCTCTGCCC GGTCCGGGAT CGATGCCCGG TTGAGCGAGG CGAACGCGGC CGGTGGGGTA
CATGGCCGTC GAATATCCTA CAGCTGGCGT GACGACCATT CGTCCGTGGC CGAGGACGCT
CGCGCGACCG AAGATCTGGT CCACCGCGAC TCGGTGTTCG GCCTGCTTGC GGCCACCTCG
TCGCTGGGCG GTTCGCTGGA CAGTCTGGTG GCGGAACAGG TTCCGGTCGT CGGGCTCGCC
GCCGACGCGA ACTGGGTGAA ACAGTCGAAC TTCTTCTCGT ACATGTACCA GGGGTCTGTC
CCGGTGCTTG CCGGCTATAT CCGGTCGGCT GGTGGCACGA AGGTCGCGGT TGTCACGCCC
GGCGCGTCAC CCTTCACCGC CGGGGCGGCC AAAACAGTCG GCGACGAGAT GAGCCTGAAC
GGCCTCACCT ACGTCGGGTC GTTTCCCTAC TCCCGCGGAG CGGACAGTCC GGAGCAGGTG
GCCCGGAACA TCGCCGCCAG TGGTGCGAAC GCCCTCGTCG CCCTCACCAT CCCAGAGGAC
CTGGCGGCCC TGATGCAGGC CCTGCGGGCC GCCGTGCCCA ACCTCGCGGT CACGGTCGCC
ATGAGTGGCT ACGACCGCAG CATCCTTCCC GCCTTCGGAC GGGCGCTGGC CGGAGTCTCC
ATCCCGGTCT ACTTCCGCCC GTTCGAGGCC GGCGGCCCGG CGATCGAACG CTACCGTCAA
TCCATGGCCA CATTCGCGCC GCAGGCCCCG TTCCCCGAGC AGCAGTTCGC CATGAACGCC
TACATCTACG CCGACATCTT CCTGCGCGGG CTCGAACTGG CTGGTCCGTG CCCGACGCGG
GAAGGATTCA TCAACGCCCT GCGTGGCGTC GACAACTACG ACGCCGGCGG ACTGATCGCA
CCGGTCGATC TGGCTGACAA TCTGGCCCAG CCACCGATGT GCCACTCGAT TGTCCAGGTC
AGTCCGGCAG GGGACGCCTT CCAGGTCGTC CGCGAGCGCA TCTGTGCGGA CGGCAGTTAG
 
Protein sequence
MRSVRYLGAA VAVMTIAVVG CSPPGSVSNV SGSECNSPGI TLDQVKLGLV ISESGGSGAL 
TSARSGIDAR LSEANAAGGV HGRRISYSWR DDHSSVAEDA RATEDLVHRD SVFGLLAATS
SLGGSLDSLV AEQVPVVGLA ADANWVKQSN FFSYMYQGSV PVLAGYIRSA GGTKVAVVTP
GASPFTAGAA KTVGDEMSLN GLTYVGSFPY SRGADSPEQV ARNIAASGAN ALVALTIPED
LAALMQALRA AVPNLAVTVA MSGYDRSILP AFGRALAGVS IPVYFRPFEA GGPAIERYRQ
SMATFAPQAP FPEQQFAMNA YIYADIFLRG LELAGPCPTR EGFINALRGV DNYDAGGLIA
PVDLADNLAQ PPMCHSIVQV SPAGDAFQVV RERICADGS