Gene Franean1_3190 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3190 
Symbol 
ID5671566 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3760982 
End bp3763852 
Gene Length2871 bp 
Protein Length956 aa 
Translation table11 
GC content71% 
IMG OID641242084 
ProductABC transporter related 
Protein accessionYP_001507504 
Protein GI158314996 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0411] ABC-type branched-chain amino acid transport systems, ATPase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAATCC CCACCACCCA GCTGCTCTTC GACGGCGCGG TCACCGGCCT GGTGATCGGC 
CTGCTCGCGG TCGGCATCGT CCTCGTCCAC CGCTCGACCC GCGTCATCAA CTTCGCGGTG
GCGAACATGG GCCTGGTCGG CTCGGCCCTC TTCGCACTGC TCACGGTGCG TTACAACGTT
CCGTACTGGA TCTCGCTGGC CATCGCGCTG CTCGTAGGTG TGCTGTTCGG CGCCCTGGTC
GACCTCACGG TGATCCGCCG GCTGTTCGCC GCGCCACGGG TGATCCTGCT GGTCGCGACC
ATCGGCGTCG CCCAGCTCGC GCTGACCGTC GTCACCTCCC TGCCCGACCT CGACGACTAC
CCGAGCGAGT CCTACCCGGT GCCCTGGTCG GGGAGCTGGT CACCGGCCAG CGGCCTCACG
ATCACCGGGG CGCAGCTGAG CGTCCTGGTC ACGGTGCCTC TCGTCGCGCT CGGGCTCAGC
CTGTTCCTGG GCCGCACCGT GCTCGGCAGG ACGGTCAAGG CCGCCGCGGA CAACCCCGAG
CTCGCCCGCC TTCAGGGCAT CAGCCCCAAG ACCGTGTCGA CCGCGGTCTG GGCGGTGGCG
GGGCTGATCG GAACTCTGTG CCTGATCCTG GTCTCCGCGC AGAACCAGTC CCTGACCCAG
ATCACCACCC TCGGCCCCAC CACCCTGCTG CGAGCCCTGG CCGCCGCGGT GATCGCCCGG
ATGACGTCGT TCCGCGTCGC GCTGCTCGCC GGGATCGCCC TGGGCCTGGC CCAGTCGTTC
ATCCAGTTCA ACTGGCTCGA CCAGCCCGGC CTCACCGACC TGACGATCCT GGTCGTGGTG
CTGGTCGCGG TGTTCTTCGT CAGCCGCGGC CGGGACACCG AAACCTCGTC GTTCTCCTTC
GCCCCGAAGA TCAAGCCCGT GCCCGACCGG CTGCGCGGGC TCTGGTGGGT CCGCCACCTC
GAACGCGCAC CCCTGGTCCT GCTCGGTCTG GTCGCCGTCG TCCTCCCCCT GCTCGTCGCC
CAGCCGTCAC GGCACCTGCT CTACACGGTG ATCCTCGGCT ACGCGATCTG CGCCGCCTCG
ATCACGATCC TGACCGGCTG GGCCGGGCAG CTCTCCCTCG GCCAGATGGC CTTCGCGGGA
TTGGGCGCGC TGCTCGCCGC CGCACTCAAC CGTGGCCTGC GCCTGCAGGT CGGCGGCTCG
GTCATGGCGC TGAACGCCCT GCCGTTCCCA GCGGCGGTCG CGATCGCGAC GGCGTTCACC
GCGGCCCTGG CGGCCGTGGT CGGAGCGGGC GCGCTGCGGG TGCGCGGGCT GCTCCTGGCT
GTCAGCACCT TCGCGTTCGG GATCGCGGCC GAGCAGTACC TCTACCGCCG CGACGTGTTC
CACGACGAGG GCGGCAACAC CGCCTCGTTC CCGCGCAGCA CCGTCTTCGG GATCGACGTC
ACCACCCAAC GGGCCTACTA CTACCTGGTC CTGATCGTCC TGGTTCTGGT CCTGCTCGTG
GTGGCCCGGC TGCGCAGGTC CGGGGTCGGG CGGACCACGA TCGGCGTCCG CGACAATCCC
GCCACCGCGG CCGCGTACAC CGTCGCCCCC ACCCGGGTGA AACTGCGCGC GTTCGCTCTC
GCCGGCGGGA TCGCCGGCCT CGGCGGCAGC CTGCTCGCCG GCGCCGTCCA GAACGTCCCC
TACGCCGACA GGTACTTCCT CAGCCCCGAC TCGCTGATCC TGGTCTCCAT CGTGGTGATC
GGCGGCCTCG GCTCGGTCAC CGGCCCCCTG CTCGGCTCCC TGTGGGTCAT CGGCCTGCCG
TCCTTCTTCC CCGACAACGA CATCGTCCCG CTCCTGACCT CCAGCCTGGG CCTGCTGATC
CTATTGCTCT ACTTCCCCGG CGGCCTGGTC CAGATCGGGT ACAGCGCCCG CGACGCCCTC
CTCGCCTGGG CGGACAGAAG GCTCGGCACC ACTCCAGCGG TGAAGAGCGT CTCGACCGTG
CCCCCGGCTC TCACCCGCAC CACCCGCCCG CCGCTCCCGG AGAACACCCC CGTCCTCGAG
ACCAGCGACA TCCGGGTGCG ATTCGGTGGC CGGGCCGCCG TCGACGGCGT CTCCCTCACC
GTCATGCCCG GCGAGATCGT CGGCCTCATC GGCACCAACG GCGCCGGCAA GTCAACCCTC
ATGAACGCCA TCGGCGGCTT CGTCCCCGCC ACCGGCACCG TCACCCTGCT CGGCCGGGAC
GTCTCCAACG CCAGCCCGGC GGCCCGGGCC CGCGGCGGTG TCGGCCGAAC CTTTCAGGCC
GCCGCGCTGT TCCCCGAGCT CACCGTCCGC GAAACCGTCC AGATAGCGCT GGAGGCCCGC
GGCCGCACCG GGCTTCCCTC CACCGCCCTG CACCTGCCGC ACACCTTCCG CGCCGAACGC
GCGAAACGCT CTGCCGCCGA CGACCTCATC GACTTCCTCG GCCTCGGCCG CTACGCCGAC
GCCTTCATCG CCGACCTGTC CACCGGCACC CGCCGCATCG TCGAGCTCAC CGGCCTGCTG
GCCCTCGACG CCCAGGTCCT CTGCCTCGAC GAACCCACCG CCGGCGTCGC CCAACGCGAA
ACCGAGGCGT TCGGCCCGCT CATCCAGGAA ATCCGCCGCG AGCTCGGCGC CGCCATGCTC
GTCATCGAAC ACGACATGCC ACTGATCATG AGCATCAGCG ACCGCGTCTA CTGTCTCGAA
ACCGGCCAGA TCATCGCCAG CGGAACGCCG GACGCCGTCC GCAACGACCC CAGGGTCATC
GCCAGCTATC TCGGCACCGA CGAGCGCGCC ATCGAACGCA GTGGAAAAGC CCCGGCGGCC
CCGGACCCCG AGGACCACCA CACGCCGACC GACGCGGTCG TGCCCTCGTA A
 
Protein sequence
MEIPTTQLLF DGAVTGLVIG LLAVGIVLVH RSTRVINFAV ANMGLVGSAL FALLTVRYNV 
PYWISLAIAL LVGVLFGALV DLTVIRRLFA APRVILLVAT IGVAQLALTV VTSLPDLDDY
PSESYPVPWS GSWSPASGLT ITGAQLSVLV TVPLVALGLS LFLGRTVLGR TVKAAADNPE
LARLQGISPK TVSTAVWAVA GLIGTLCLIL VSAQNQSLTQ ITTLGPTTLL RALAAAVIAR
MTSFRVALLA GIALGLAQSF IQFNWLDQPG LTDLTILVVV LVAVFFVSRG RDTETSSFSF
APKIKPVPDR LRGLWWVRHL ERAPLVLLGL VAVVLPLLVA QPSRHLLYTV ILGYAICAAS
ITILTGWAGQ LSLGQMAFAG LGALLAAALN RGLRLQVGGS VMALNALPFP AAVAIATAFT
AALAAVVGAG ALRVRGLLLA VSTFAFGIAA EQYLYRRDVF HDEGGNTASF PRSTVFGIDV
TTQRAYYYLV LIVLVLVLLV VARLRRSGVG RTTIGVRDNP ATAAAYTVAP TRVKLRAFAL
AGGIAGLGGS LLAGAVQNVP YADRYFLSPD SLILVSIVVI GGLGSVTGPL LGSLWVIGLP
SFFPDNDIVP LLTSSLGLLI LLLYFPGGLV QIGYSARDAL LAWADRRLGT TPAVKSVSTV
PPALTRTTRP PLPENTPVLE TSDIRVRFGG RAAVDGVSLT VMPGEIVGLI GTNGAGKSTL
MNAIGGFVPA TGTVTLLGRD VSNASPAARA RGGVGRTFQA AALFPELTVR ETVQIALEAR
GRTGLPSTAL HLPHTFRAER AKRSAADDLI DFLGLGRYAD AFIADLSTGT RRIVELTGLL
ALDAQVLCLD EPTAGVAQRE TEAFGPLIQE IRRELGAAML VIEHDMPLIM SISDRVYCLE
TGQIIASGTP DAVRNDPRVI ASYLGTDERA IERSGKAPAA PDPEDHHTPT DAVVPS