Gene Franean1_4617 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4617 
Symbol 
ID5672962 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5502658 
End bp5503959 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content72% 
IMG OID641243478 
Productextracellular solute-binding protein 
Protein accessionYP_001508894 
Protein GI158316386 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0699008 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGAGTC GCGGCACTCC GAGAGGCCGG GGCGGCCGGT CCACGGCGGC CGCCGTGGGA 
GCGCTGGTAC TGGCCGCGGT GCTGGCCCTG GGCGCCTGCG GCGGTTCCGG CGACGGAGCT
GGCCGCGGCC CGGTGCGGCT GACCTGGTAC GTCTACAACG AGTCGTCCGG TTCGTTCGCG
AAGGCGGCCG CGGACTGCTC GGCCGCGTCG AACGGCCGGT ACACGATCGG CATCAACATG
CTGCCGAACG ATTCGGACGG GCAGCGCCAG CAGCTGGTGC GGCGGCTCGC CGCCGAGGAC
TCCTCGATGG ACATCCTCGC GCTGGACGTG ACCTGGACCG CCGAGTTCGC CGAAGCCGGC
TGGATCGTGC CGTTCCCGGC GGCGGAGGCC AGGCGGCTCA CCGACGGGAT GCTGCCGGCC
GCGGTGCGGA CCGGCACCTG GGAGAACCAG CTCCACGCCG TGCCGCTGAA CACCAACGTC
CAGCTCCTGT GGTACCGCAA GGATCTGGTG CCGCGGCCGC CGCGGACCTG GGACGAGATG
CTGGCCGACG CCCGGCGGCT GGCCGAGCAG GGCAGGCCGC ACTACGTCGA GGTACAGGGT
GCCCAGTACG AGGGCTACAC CGTGCTGTTC AACTCGTTGG TGGCCTCGGC CGGCGGCCAG
ATCCTCGACG AGGACGGCAC CCAGGTGGTG CTCGGCGCAC CCGCGCAGAA GGCCGTCGAG
GCGATCCGGG CGCTGGCGCA CTCACCGGCC GCCGACCCGT CCTGGTCGAA CCAGCGGGAG
GACGACAACA GGCTGGCGTT CGAGACAGGT TCGGCCGCCT TCCAGCTGAA CTACCCGTTC
ATCTATCCGT CGGCCCGGCA GAACAACCCG CGGCTGGCCG AGCAGATCGG CTGGGCGCAG
TGGCCGACGC TGGTGCCCGG TCAGCCGTCG CACAGCACGA TCGGCGGGTA CAACCTCGCG
ATCGGCGCCT ACAGCCCGCA CCGGGCCGAG GCGGCCGCCG CGATCGAGTG CCTGACCGGC
CGGGACAACC AGATCCGCGA CGCGATCGAC GGCGGGCTCC CGCCGACCAT CGAGGATCTC
TACACCGACC AGAAGTTCAT CGCCGGCGGC TACCCGTTCG CGTCGGCCAT CTACACGGCG
CTGCAGAATG CCAGCGTGCG GCCGCGGACG CCGGCCTACC AGAGCGTGTC GCTGCAGATC
GCGCACACCC TCTCACCGCC GTCCTCGGCG AGTCTCGGCA GGCTCGGGCA GCTGCGCGGG
GCGATCGCCG ACGCCATCGA GTCGAAGGGA CTGGTGCCGT GA
 
Protein sequence
MLSRGTPRGR GGRSTAAAVG ALVLAAVLAL GACGGSGDGA GRGPVRLTWY VYNESSGSFA 
KAAADCSAAS NGRYTIGINM LPNDSDGQRQ QLVRRLAAED SSMDILALDV TWTAEFAEAG
WIVPFPAAEA RRLTDGMLPA AVRTGTWENQ LHAVPLNTNV QLLWYRKDLV PRPPRTWDEM
LADARRLAEQ GRPHYVEVQG AQYEGYTVLF NSLVASAGGQ ILDEDGTQVV LGAPAQKAVE
AIRALAHSPA ADPSWSNQRE DDNRLAFETG SAAFQLNYPF IYPSARQNNP RLAEQIGWAQ
WPTLVPGQPS HSTIGGYNLA IGAYSPHRAE AAAAIECLTG RDNQIRDAID GGLPPTIEDL
YTDQKFIAGG YPFASAIYTA LQNASVRPRT PAYQSVSLQI AHTLSPPSSA SLGRLGQLRG
AIADAIESKG LVP