Gene Franean1_5456 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5456 
Symbol 
ID5673787 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6597582 
End bp6599099 
Gene Length1518 bp 
Protein Length505 aa 
Translation table11 
GC content72% 
IMG OID641244311 
Productmajor facilitator transporter 
Protein accessionYP_001509717 
Protein GI158317209 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.247953 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.756284 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCGAAG CAACACACAC CTCTCGGACG CGGGCACCAC ATCCCGCCGC TGAGCGCCCG 
GCGGCCGGCG GAGCCCTGCT GGCCGTCATC CTCCTCGGCC AGTTCATGGC CATCCTGGAC
GTCAGCATCG TCAACGTGGC GCTGCCGACG TTGCGGACGG ATCTGGACGT CTCCGGCGCC
GGGCTACAGC TGATCGTCGC CGGCTACGTC CTCTCCTACG CGGTTCTGCT GATCACCGGT
GCGCGGCTGG GCGGCATGCT CGGGCACCGG CGCGTGTTCA ACACCGGGCT CGCCGGGTTC
ACCGTGGCCT CGCTGGCCTG CGGCCTGGCA CCCGGCACGT CCTCACTGAT CGCCTTTCGG
TTCCTGCAGG GGGTCAGCGC CGCGTTGATG ACCCCCCAGG TCATGAGCCT GATCCAGCGC
AATTTCGCCG GTGCCGCGCG GATGCGGGCG CTCGGCTACT TCTCCGCGGT GATCGCGGGA
GGCGTCGTCG TCGGCCAGGC CGCCGGCGGG CTGCTCGTCA GCGCGAACCT GTTCGACGCC
GGCTGGCGGA CCGTCTTCCT GGTGAACGTG CCGATCGGGG TGCTGCTGCT CGTCGTCGGC
CCGCGGGTCA TGCCCTCGGA CGAGGGGCGC GCGGGGGTCG ACCTCGACCT TCCCGGCCTG
CTCGTTCTCA GCGCGGCGGT GCTGCTCTTC GTGATGCCGC TGATCCTCGG TCACGAGCTG
GGCTGGCCGG CCTGGACGGC CGTGTCGCTG GCGGCGAGTG TCGTCCTGTT CGTGGTCTTC
GTCCTGGTGG AGCGGGGCGT CGCGGCCCGC GGGCGACGTC CGCTGATCTC GGGACGGGTG
CTGCGCGCGC CCGGGCTGCT ACCCGCCGCC GGCACGCTGC TGCTCGGGCC GGCGTCGTGG
GCCGGGTTCC TGTTCACCAC CACCCTGCAC CTGCAGGGCG ACCTGGGGAT GAGTCCGCTG
CGCTCCGGGC TGGCGTTCGT CCCGTGCGTC GTCGCGTTCG CCGCGGTCGG GCTGAGCTGG
CAGCGCCTGC CGTCCAGCTG GCACGCGAAC CTGATCCCGG TCGGGCTGGC GTTCGCCGCC
GTCTCGTATC TGTTCCTCGG GCCGCTGGCC GGCGGCGGCG CGCGCTACGA GATCCTCACC
GCGTTGATCG GCCTGGGACT CGGCGTGATG TCGATCATCA TCTCGGTGAC GCTCGAACAC
GTCCCGGTCG AGGACGCGGC CGACGCCAGC GGCCTGCTGC TGACCCTGAT GCAGCTCGGC
CAGGTCATCG GCATCGCCAC GGTCGGCACG GTGTTCCTGA CCACCGCCGC CGACGGCGGC
TCGACCCGCG ACGCGGAGTA CAGCACCGGC TGGGCGCTCG CGGCGGTCGC GTTCGTCGCG
GCGGCGAGTG CCCTCCTGCT CGCCCGGCGT CGCGGGCGGC AGCTGAGAGT GGTCGTTCCC
GTCCAGGCAC ACGACGTCGA TCCGGACCGT GCGGATCAGG TGATCGTGGC CGCTCCCACA
GTGCCGACCG GCGTCTAA
 
Protein sequence
MVEATHTSRT RAPHPAAERP AAGGALLAVI LLGQFMAILD VSIVNVALPT LRTDLDVSGA 
GLQLIVAGYV LSYAVLLITG ARLGGMLGHR RVFNTGLAGF TVASLACGLA PGTSSLIAFR
FLQGVSAALM TPQVMSLIQR NFAGAARMRA LGYFSAVIAG GVVVGQAAGG LLVSANLFDA
GWRTVFLVNV PIGVLLLVVG PRVMPSDEGR AGVDLDLPGL LVLSAAVLLF VMPLILGHEL
GWPAWTAVSL AASVVLFVVF VLVERGVAAR GRRPLISGRV LRAPGLLPAA GTLLLGPASW
AGFLFTTTLH LQGDLGMSPL RSGLAFVPCV VAFAAVGLSW QRLPSSWHAN LIPVGLAFAA
VSYLFLGPLA GGGARYEILT ALIGLGLGVM SIIISVTLEH VPVEDAADAS GLLLTLMQLG
QVIGIATVGT VFLTTAADGG STRDAEYSTG WALAAVAFVA AASALLLARR RGRQLRVVVP
VQAHDVDPDR ADQVIVAAPT VPTGV