Gene Franean1_3943 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3943 
Symbol 
ID5672304 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4712443 
End bp4713654 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content75% 
IMG OID641242822 
ProductABC transporter related 
Protein accessionYP_001508239 
Protein GI158315731 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3839] ABC-type sugar transport systems, ATPase components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.39994 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0172807 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGCAGGA TCGAGCTCGA CGGGCTGACG AAGAGATACG GCGACGTCGT CGCCGTGGAC 
GGCGTCAGCC TCGACATCGC CGACGGCGAG TTCCTGGTGC TGCTCGGCCC GAGCGGCTGC
GGGAAGTCCA CCCTGCTGCG GCTCGTCGCC GGCCTGATCA CACCGTCCGG CGGCCGGGTG
CTGCTCGACG GCCGGGACAT CACCCACGCC CCACCCGCCC GGCGCGACCT GGCGATGGTC
TTCCAGAGCT ACGCCCTGTA CCCGCACCTG ACCGTGGCCC GCAACATCGG CTTCCCGCTG
CGCGCGGCCC GCACGCCCCG CGCCGAGGTC CGCCGCCGGG TCGAGGAGGT CGCCGCGCTG
CTCGAGCTCG GTCCGCTGCT CGACCGCCGG CCGCGGGAGC TCTCCGGCGG CCAGCGCCAG
CGGGTCGCCG TCGGCCGGGC GATCATCCGC GACCCGCGGG CGTTCCTGAT GGACGAGCCG
CTGTCGAACC TGGACGCCAA GCTGCGCCAG GCGACCCGGG CCCAGTTCCG GATGCTGCAC
GAGACCCTCG GCAGCACCGT CCTCTACGTC ACCCACGACC AAGTCGAGGC GCTCAGCCTC
GCGACCCGGA TCGCGATGCT CGACGGCGGG CGCCTCGAGC AGCTCGGCAC GCCGACCGAG
GTCTACGACA GCCCGGCCTC GGTGTTCGTC GCTGGCTTCC TCGGCTCCCC GCCGATGAAC
CTGCTGCCCG CCCGAGTGGA GTGCCACGAC GGCCGGGTGC GGGTGCTCGC CGACGACGTC
GAGGTCGACC TGTGGCCCGG CGAGGACGTC GAGGCCCGTG ACGTGATCCT CGGGATCCGG
CCCGAACACC TCCACCTGAT GGCCGCCGAG GGCCCGGCGG GCCCGGGCGG GATGCCGCGG
CTGCGCGGTG TCGTGCGGGC GGTGGAGAAC CTCGGCGCCG AGGAGTCCGC GCAGTGCGCG
GTCGGCGGCG CCCTCGTGCA CTTCCGGGGC GCCCGCCCCC TCGGGCTGGC CGCCGGCCAG
CCCGTCGCGC TCACCACGGC GCGCGACCAC ATCCACCTGT TCGACCGGCA CAGCGGCCGG
CGGCTCGCCT GGCGCCCGCT GGCCGACCGC CCGCTGGCCG ACCGCCAGTC GGCCGACCAC
GAGCCGGCCG ACCACGAGCC GGCCGCGGCA CTCCACGACG GCGTGACCAC TCATCAAAGG
AGCTCGACGT GA
 
Protein sequence
MGRIELDGLT KRYGDVVAVD GVSLDIADGE FLVLLGPSGC GKSTLLRLVA GLITPSGGRV 
LLDGRDITHA PPARRDLAMV FQSYALYPHL TVARNIGFPL RAARTPRAEV RRRVEEVAAL
LELGPLLDRR PRELSGGQRQ RVAVGRAIIR DPRAFLMDEP LSNLDAKLRQ ATRAQFRMLH
ETLGSTVLYV THDQVEALSL ATRIAMLDGG RLEQLGTPTE VYDSPASVFV AGFLGSPPMN
LLPARVECHD GRVRVLADDV EVDLWPGEDV EARDVILGIR PEHLHLMAAE GPAGPGGMPR
LRGVVRAVEN LGAEESAQCA VGGALVHFRG ARPLGLAAGQ PVALTTARDH IHLFDRHSGR
RLAWRPLADR PLADRQSADH EPADHEPAAA LHDGVTTHQR SST