Gene Franean1_4479 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4479 
Symbol 
ID5672829 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5344346 
End bp5345605 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content72% 
IMG OID641243346 
Productmajor facilitator transporter 
Protein accessionYP_001508762 
Protein GI158316254 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGCGGC GAAAGCAGGT CAGGGTGAGA TCCGAAGGCT CCGCACCGAG GGGCAGATTC 
CTGGTGGTCG TGGGGGCGAG CGCGCTGGGC GTCGTCACGG TCGTGATGCC CTCGCTGCTG
CTGGGTGCCC TGGCGCCGCA GATCAAGATG GATCTCGGCA TCGACGACGT GGCCATCGGG
GTGGTGAGTA CCGTCTACGG GGGCGCGGGC GCCGCCGTAG CGCTGTTCGC TGGCGGGCTC
GCGGATCGCA TCGGGTGGTC CCGGGCCATG ATGGCGGCGA ATTCCGCCGC CGCCGTCAGC
TGCGTCGGGG TCGGGCTGCT GGCCGGGTCG TACGGAATTC TGCTCGGTTG CATGTTCCTC
GCCGGCTGCG CGATGGGCCT GGGCATGCCC GCGTGCAGCC TTCTCCTGGC CAGCGAGGTC
GGCGCGCATC AGCATGGGTT CGTCTTCGGC ATCAAGCAGG CGGCGACGCC CGCGGCGGCT
CTGCTGGCTG GTCTCGCGGT GCCGCTCGTG GGCCTGACCC TCGGCTGGCG CTGGGCCTAC
GGAGCCGTGA GCGTGCTGGC GGTCGCTGCG GCGGTGCTGA GCCGCCGCAT CCGGACGCTG
CAGGCTCCCG GTCCGGCCGC GGGCGGCACA GCGGCGACGC ACCGGGCCGG CGGCGTCGCG
ATGCCGAAGG GACGCCCCAG CGTCGCTGTG ATCCTCCCGC CGACGTCGGC TGCCGCTGGC
GCGTCGATCG CGATGGGGGC GGTGCTGGCG TTCGTTGTGC TGTCCGCCGT CGAGGCCGGC
CTCTCGCAGA GTAGTGCCGG GATTCTGCTG GCTGTCGGGA GCACCGCCGG TATCGCCGTC
CGTATCGGTG GCGGGTGGTG TCTGGACCGT GGTGACGTGT CGGCGTTCCA GTTGTGCGGA
TGGATGATTC TTGCTTGCGC GCTCGGTGCG GGAGCGATGG GGACCCGCGA CACCGTGCTG
GTGGTCGCGG GTGCGATCGT GGCGCTGAGC GTGGGGTGTG GCTGGGCCGG CCTCTTTGAC
GTCGGCCTTG TGCGGGGCAG TCCGCTCGCT CCCGCGCGGG CGTCAGGCCT GGGCCAGGTC
GGCACGGCGG GCGGGGCGGC GGTGGGCCCG CTGCTGTTCG GACTCACGGT CGGTTGGTGG
GGCTACCAGG TCGGCTGGTA CAGCATCGCG GTGCTGTCGA TGACCTCGGC CGGCGTGATG
CTGTGGGCTG CCCGGCGCGC CCGGAGATCG GTACTGGCCA CAGTCACTGC TGCGCCGTGA
 
Protein sequence
MPRRKQVRVR SEGSAPRGRF LVVVGASALG VVTVVMPSLL LGALAPQIKM DLGIDDVAIG 
VVSTVYGGAG AAVALFAGGL ADRIGWSRAM MAANSAAAVS CVGVGLLAGS YGILLGCMFL
AGCAMGLGMP ACSLLLASEV GAHQHGFVFG IKQAATPAAA LLAGLAVPLV GLTLGWRWAY
GAVSVLAVAA AVLSRRIRTL QAPGPAAGGT AATHRAGGVA MPKGRPSVAV ILPPTSAAAG
ASIAMGAVLA FVVLSAVEAG LSQSSAGILL AVGSTAGIAV RIGGGWCLDR GDVSAFQLCG
WMILACALGA GAMGTRDTVL VVAGAIVALS VGCGWAGLFD VGLVRGSPLA PARASGLGQV
GTAGGAAVGP LLFGLTVGWW GYQVGWYSIA VLSMTSAGVM LWAARRARRS VLATVTAAP