Gene Franean1_2521 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2521 
Symbol 
ID5670917 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3000770 
End bp3002263 
Gene Length1494 bp 
Protein Length497 aa 
Translation table11 
GC content66% 
IMG OID641241438 
Productmajor facilitator transporter 
Protein accessionYP_001506859 
Protein GI158314351 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGCGCG AAGACTCCGG ACGCGTCGAC GCGCGGAAGA CGGATGCAGG GCCGCGGCGC 
TGGTGGATCC TGGGAGCTCT CTGCCTCGGT CTGGTCGTCG TCGAGGTCGA CGCGAGCATT
CTCAACGTCG CGATCCCGTC CATCGCGGCT GACCTTGACG CCGATCCGGC CGACATGGCG
TGGATCGTCG ACTCCTTCGT CCTCTCCTTC GCAAGCTTCA TGCTCGTCGC CGGACGCCTG
GGTGACCGTT TTGGGCACAG GAAGATACTC GAGGCGGGAC TCCTGCTGTT CTCCCTGGCC
TCCTTCGCCG CCACCTTCGC GGACTCACCC GCGGGCCTGG TCGCATGTCG CGCGGTCCTC
GGTATCGGCG CCGCGGCTAT CCTGCCGACA TCGAGAGCGC TGGTCATGGC GGCGTTCCCC
TCCGCCGAGC ACCCCAGAGC CCTGGGGTAC TGGACCGCGG CAATAGGAGT GAGCTTACCG
CTGGGACCTT TGCTCGGCGG GCTGCTCCTC GACCATTTCT GGTGGGGTTC CGTCTTCCTC
GTGGGCGGGC CCACCTCGCT GCTGGCCGGT GTCATCAACG CGGTGGTGCT CGGGCGTCAG
AGGGCTGGTG GGGGCAAAGG CGGATGGGAT CCGCCTGGGA TTGTCCTGTC CGCGTTCGGA
ACAGCAGCAT TGATATTCGG CATCATCGAG GGCCCTCGCC TGGGGTGGCT CTCCGCCACG
ATTCTGGCAT GCCATGGCTC CGCCCTGGTG GCACTGGCCG CCTTCTTCGT CTGGGAACGT
CATGCGACGC ACCCGGTGGT CGAACCGGAA CTGCTGCGAG TGCGATCGTT CACCGTGGGC
TCGATGGTCG CCGCACTCTC GCTGTTCGTG TTCAACGGGC TCCTCTTCGT GCTCACCCAG
TATCTCCAGA TCCTCGAGGG ATACAGCCCT CTCCAGTCCG GTCTCCGAGT GATCCCGCTG
GGAGCGGGAT TCATGCTTGG CAGCGTCTTG TCACGCCGTG CGGCCCTGCG AATCGGTCAA
CGTGGAACGC TGGCCATCGG GTTCGCGGCG GTGGGATCGG CCCTCCTGGT ACTCCTGCTC
GGTACCTGGT CCGACGGTTA TCTCGTCACG GGTGTCGGTG TTCTGATCGT CGGCTGGGGA
ACCGGGCTGA CAATGGCCTG CGCCGTCCAT CTGGCACTGA GCGAGGTTCC GGCGTCCACA
GCGGGCGCCG CCGGAGCCTT CAGTAACTCG GTAAGGCAGT TGGGGGCGGC ACTCGGCGTC
GCAGTCCTCG GCGCCGCACT CGGGACTTCC GCAGGTAGAA GTGGCGCGCA GAACCCTTCT
TCCTCTGATC CTGTCAGCAA TACCGCTGGA AGCGTCGTGG CGGACATTGC GACCGCTCGT
CCGGGCCTTG GCATGGAAAA CGGGTTCAGG CTCGCCTTCG CCCTGGCGGC GATCGTGAGT
TGCCTGTGCA TCGTGCTTGT CGCCGTGGCT TTCAGGAAAG TTCGATCAGC CTGA
 
Protein sequence
MEREDSGRVD ARKTDAGPRR WWILGALCLG LVVVEVDASI LNVAIPSIAA DLDADPADMA 
WIVDSFVLSF ASFMLVAGRL GDRFGHRKIL EAGLLLFSLA SFAATFADSP AGLVACRAVL
GIGAAAILPT SRALVMAAFP SAEHPRALGY WTAAIGVSLP LGPLLGGLLL DHFWWGSVFL
VGGPTSLLAG VINAVVLGRQ RAGGGKGGWD PPGIVLSAFG TAALIFGIIE GPRLGWLSAT
ILACHGSALV ALAAFFVWER HATHPVVEPE LLRVRSFTVG SMVAALSLFV FNGLLFVLTQ
YLQILEGYSP LQSGLRVIPL GAGFMLGSVL SRRAALRIGQ RGTLAIGFAA VGSALLVLLL
GTWSDGYLVT GVGVLIVGWG TGLTMACAVH LALSEVPAST AGAAGAFSNS VRQLGAALGV
AVLGAALGTS AGRSGAQNPS SSDPVSNTAG SVVADIATAR PGLGMENGFR LAFALAAIVS
CLCIVLVAVA FRKVRSA