Gene Franean1_5286 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5286 
Symbol 
ID5673620 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6358613 
End bp6361414 
Gene Length2802 bp 
Protein Length933 aa 
Translation table11 
GC content73% 
IMG OID641244143 
ProductFHA modulated ABC efflux pump with fused ATPase and integral membrane subunits 
Protein accessionYP_001509550 
Protein GI158317042 
COG category[V] Defense mechanisms 
COG ID[COG1131] ABC-type multidrug transport system, ATPase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.135201 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.531019 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGATCGGA CCACGCTGAG CGTGATCAGC CCGGCGGGCC ACAACACGCT GACGGGCGAC 
GCTCCGTGCG TGGTCGGTCG GGCACGCGGC GCGGACATCG TCATCAACGA CGCACGGGTG
TCCCGGCGTC ATCTCGTGCT TGAGCCGGTC GCCGGCGGCT GGCATGTGCG GGACACCAGC
GCCAACGGCC TCTGGCAGGA CGGGCGCCGG ATGGGGGACA CCACGGTCCG CTCGACCGAG
GTCCGCTTCC GGCTCGGCGC CGCGGACGGC CCTGAGGTCG TGCTGGTGCC CTCGTCCGTC
GGGCTGGCCT CCGGTGGTGC CGGAGCGGGC TCCGCGCCCG CCACCTCGGT ACCCGCTCCC
GCGGTCCCGG CCGGGCCGCC GCCGCGCGTG GCGGGTGCGC CGGGATGGCC GTCGTCCAAC
CCCAACCCCA ACCCCAACCC CGGTCCCGCG CCCGCCGGCC CAGGTCACAC GCCTACTCCC
GGCCACGTGC CCACCCCCGG TCAGGCGCCT GCCCCCGGTC GCGCGCCCGC CTCCGGCCGG
GCCCGGGTAG CGGCACCGCC GACGGCCCCG GCATCGGCCC GCCCGCCGGT CCCGGGGGGC
CACCAGCCAC CGCACGCCGG GGGTCCGGGC GCCGCCGGGC CGGACACCGC CTCCGCGGCG
CTCGACCTGG AGACGGCGCA GAGCGCCATC GTGCACGCCC GCCGCAAGAT GCACCCGCTG
CGGCCCGGCA CGATCCGGCT CGGCCGCTCG CGCGACAACG ACATCTCCGT CGCGGACCTG
CTCGCCTCGC GGCACCACGC CGAGCTGCTG GTCGCCCCCG GCAGGGTCGA GGTCGTCGAC
CTCGGGTCGG CGAACGGCAC GTTCCTCAAT GGCCAGCGCA TCGGGCGGGC CCAGGTCGGG
CAGCGCGACG TCATCGCGAT CGGGCACCAC CTCTACCAGC TCGAGGGCGA CTCGCTGGTG
GAGTACGTCG ACTCGGGTGA CGTCGCCTTC GAGGTGCAGG CGCTGTCGGT GTTCGCCGGC
ACCAAGCAGC TCATGCACGA CATGACCTTC CGGCTCCCCG GCCGCTCGCT GCTCGGGGTG
GTCGGCCCCA GTGGTGCGGG CAAGTCGACC CTGCTCAACG CGCTGACCGG CTTCCGCCCG
GCGGACGTGG GCTCTGTCCG CTACGCCGGG CGGGATCTCT ACGCCGAGTA CGACGAGCTG
CGCCGGCGCA TCGGGTACGT CCCGCAGGCC GATCCGCTGC ACGCCCAGCT CACCGTGCGT
GAGGCGCTCG AGTACGGCGC CGAGCTGCGG TTCCCCGCGG ACACCACGGC GCAGGAACGG
CGCGCCCGCG TCCAGGAGGT CATCGGCGAG CTCGGCCTGA CCGCGCACGC CGGCACGCCG
GTGAGCCGGC TCTCCGGTGG GCAGAAGAAG CGGACGTCGG TGGCCCTCGA GCTGCTCACC
CGGCCGTCGT TGCTGTTCCT GGACGAGCCG ACGTCGGGGC TGGACCCGGC GAACGACCAG
TCGGTGATGG AGACGTTGAA GGGCCTGGCC AAGGGCGGCG GGGTCGGTTC GGCGGACGAG
AGCGGCCGCA CGGTCATCGT GGTCACGCAC AGCGTGCTGT TCCTGGACCT GTGCGACTAC
ATCCTGGTGC TCGCCCCGGG TGGGCACGTG GCCTACTTCG GCCCGTCGGA CGGGGCGCTG
AGCTTCTTCG GGAAGGAGGA CTTCCGGGAG TTCGCGGCGG TGTTCCGGGA GCTGGAAAGC
ACGCCGGGTG CGGAGATGGC GGCTCGGTTC CGCGCCTCGG AGTACTTCGT GCCCTCCGCG
GTCGTGGCGC CGATCGTGCG CAAGGCGCCG GCGGAGCTGC CGAGCGTGCG CCAGCAGCCG
GTCACCTCCC AGCTCGCGAC GCTGACCCGG CGATATCTGC GGGTCGTGCT GGCGGACCGT
TCCTACCTGC GGCTCATCGC CGCCTTCCCG TTCCTGCTCG GGATCATCCC GCGGGTGATC
CCGGCCCCCG ACGGGCTGAA ACCCCTGCCG GACGCGCCTA ATCCGGACGC GATGAAGGTC
CTCGTGGTGC TGGTGTTGTG TGCGTGTTTC ATGGGGATGG CGAACTCGGT CCGCGAGATC
GTCAAGGAAC GCGACATCTA CCGGCGGGAA CGAACGATCG GCCTGTCCCG GACGGCCTAC
CTCGGCTCGA AGATCATCGT CCTCACCGGT ATCACGACCC TGCAGTGCGT GATCTTCACG
CTGATCGGGC TGGTCGGCCG CACGCCGCCG GAGGCGTCCC TGCTGGGCTC GCCGCTGGTG
GAATGCCTGG CCGCGGTGAT CGTCGCTGCG CTGGCGTCCA TGATGATCGG CCTGCTCGTG
TCGACACTGG TCGACAACGC GGACAAGACG ATGCCGATCC TGGTGCTGGT GACCATGGCG
CAGCTCGTGC TCTCCGGCGG GCTGGTGTCG CTGTCCGGCC GTCTGGTGAT GGAGCAGGTC
GCGTGGATCG CCCCCGCCCG GTGGGGCTTC GCGGCGCTGG CGTCCACCGA CGACCTCAAC
GAGGTGTCCA AGCTGGGCAA CGAGGTGCTG CGCACGGACC CGGCCGACGG GCTGTGGGAA
CACAGCGCGG GCATCTGGGT GCTCGACGTC GTCCTCGGCC TGGTCCTCGG CGCGGCGGCG
CTCGCGCTGA CCTCGATGAT GCTGCGCCGG ATCGACCCCA AGGTGACCCG GCCCGCCGCC
CCGCCGCCGG GAGTCGGCCG CCCGCCGCAG GCCGGGCAGG CGCCGCCGGC CGGCCCCGGT
GGGCCCGGCG GCTACGGTGC CGCGCCCGGC GCGCGCCGCT AG
 
Protein sequence
MDRTTLSVIS PAGHNTLTGD APCVVGRARG ADIVINDARV SRRHLVLEPV AGGWHVRDTS 
ANGLWQDGRR MGDTTVRSTE VRFRLGAADG PEVVLVPSSV GLASGGAGAG SAPATSVPAP
AVPAGPPPRV AGAPGWPSSN PNPNPNPGPA PAGPGHTPTP GHVPTPGQAP APGRAPASGR
ARVAAPPTAP ASARPPVPGG HQPPHAGGPG AAGPDTASAA LDLETAQSAI VHARRKMHPL
RPGTIRLGRS RDNDISVADL LASRHHAELL VAPGRVEVVD LGSANGTFLN GQRIGRAQVG
QRDVIAIGHH LYQLEGDSLV EYVDSGDVAF EVQALSVFAG TKQLMHDMTF RLPGRSLLGV
VGPSGAGKST LLNALTGFRP ADVGSVRYAG RDLYAEYDEL RRRIGYVPQA DPLHAQLTVR
EALEYGAELR FPADTTAQER RARVQEVIGE LGLTAHAGTP VSRLSGGQKK RTSVALELLT
RPSLLFLDEP TSGLDPANDQ SVMETLKGLA KGGGVGSADE SGRTVIVVTH SVLFLDLCDY
ILVLAPGGHV AYFGPSDGAL SFFGKEDFRE FAAVFRELES TPGAEMAARF RASEYFVPSA
VVAPIVRKAP AELPSVRQQP VTSQLATLTR RYLRVVLADR SYLRLIAAFP FLLGIIPRVI
PAPDGLKPLP DAPNPDAMKV LVVLVLCACF MGMANSVREI VKERDIYRRE RTIGLSRTAY
LGSKIIVLTG ITTLQCVIFT LIGLVGRTPP EASLLGSPLV ECLAAVIVAA LASMMIGLLV
STLVDNADKT MPILVLVTMA QLVLSGGLVS LSGRLVMEQV AWIAPARWGF AALASTDDLN
EVSKLGNEVL RTDPADGLWE HSAGIWVLDV VLGLVLGAAA LALTSMMLRR IDPKVTRPAA
PPPGVGRPPQ AGQAPPAGPG GPGGYGAAPG ARR