Gene Franean1_4978 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4978 
Symbol 
ID5673317 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5972967 
End bp5974178 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content75% 
IMG OID641243832 
Productputative ABC transporter, permease protein 
Protein accessionYP_001509248 
Protein GI158316740 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.80843 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATAATCG CATATCACCG CGCCCTGGCC ACGCCGGGTG CCGCCCGCTT CGTCCCCGCC 
GCGTTCCTGG GCCGGCTCCC GATCGCCATG GTCTCCGTGG GGACGGTGCT GCTCGTCCAG
GCCGAGACCG GTTCCTACGG GGTGGGCGGG GCGGTCGCGG CGGCCGGCGC CGTCGGTGAG
GCGCTGCTCG CCCCGCGCCT CGGGCGGGCC CTCGACCGGT TCGGGCAGGC CCGGGTGCTG
TCCGGCTGCC TGGCCGGTCA TCTGGCGGCG ATGACCACGC TGACCGTGGC GGTGACCGCC
GGCGCGCCGC GTCCGGTGTG GTTCACCGCG TCGGCGGTCG CCGGCGGTCT GCTGCCCCCG
GTGGGCGCCT GCGTCCGCGC GCGGTGGAGT GCCCGGCTCG GCGGCGGGGA GCTGCTCGGC
ACCGCGCTGG CGCTCGAGTC GGCGCTCGAC GAGGTGGTGT TCGTCCTCGG CCCCACCCTG
GTGACGCTGC TGGCGGTGTT GATCGCGCCG CCCGCCGGCC TGGTCGCGTC GATGGTCTTC
CTGGCGACGG GCACCGGTGC GCTGATCGCG CTGCGGGAGA GCGATCCGGG GCCGCGCGGC
GGTGGCGCGG CCGCCAGCGC CCGCATGCTG CGCGACGGGG GCACCCGTAC ACTGCTGCTG
ATCTTCCTGT GCATCGGCAT CGCGTTCGGC GGGGTGGACG TGTCGATGGT CGCCTTCGCC
CGGGAGGAGG GGCTCGCCGC GGTGGGCGGG GTGCTGCTCG GGCTGTTCGC GGCCGGGTCG
GCCGTCTCCG GCCTCGTGTA CGGCGCGCGG GCGCACACCC GGCCCCTGTC CGGCCGGTTC
CTGCTCGCCG CGGCCGTGAT GGCGGTCGGG ATGGCCCTGC CACTGGCCGG GGTGACCCTC
GAGCTGATGA TCCCGCTCGC GCTGCTGGCC GGGGCGACCG TCTCCCCCAC TCTGATCAGC
GGCAACGCGG TGGTCGAGCG CCTGGTCGGC GCTGGGGCGC GTACCGAGGG GTTCGCCTGG
CTCACCATGG CGGTCGTCAG CGGGATAGCC GTCGGGGCGC CGGTCGCCGG CAGTCTGGTG
GACGGCGGGG GCGCGCACCG CGGATTACTG GTGACCGCGG GGGCCGGTGT TCTCATCGGT
TCAGCCGCGC TGACCGGCCG CCGCCCGCTC TCCTCAGGTC GACTGGTGGA TCATCGTCCA
CACAACGATT GA
 
Protein sequence
MIIAYHRALA TPGAARFVPA AFLGRLPIAM VSVGTVLLVQ AETGSYGVGG AVAAAGAVGE 
ALLAPRLGRA LDRFGQARVL SGCLAGHLAA MTTLTVAVTA GAPRPVWFTA SAVAGGLLPP
VGACVRARWS ARLGGGELLG TALALESALD EVVFVLGPTL VTLLAVLIAP PAGLVASMVF
LATGTGALIA LRESDPGPRG GGAAASARML RDGGTRTLLL IFLCIGIAFG GVDVSMVAFA
REEGLAAVGG VLLGLFAAGS AVSGLVYGAR AHTRPLSGRF LLAAAVMAVG MALPLAGVTL
ELMIPLALLA GATVSPTLIS GNAVVERLVG AGARTEGFAW LTMAVVSGIA VGAPVAGSLV
DGGGAHRGLL VTAGAGVLIG SAALTGRRPL SSGRLVDHRP HND