Gene Franean1_4737 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4737 
Symbol 
ID5673079 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5656225 
End bp5659047 
Gene Length2823 bp 
Protein Length940 aa 
Translation table11 
GC content72% 
IMG OID641243594 
ProductABC transporter related 
Protein accessionYP_001509010 
Protein GI158316502 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0410] ABC-type branched-chain amino acid transport systems, ATPase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGGAGT TCCTCAACCT CATCGTCGGC GGCGCCGTCG CCGGCGGTCT CTACGCCATC 
CTGGCGGCCG GCGTGGTCTT GACCTATCAG ACGTCGGGCA TCTTCAACTT CGCGCACGGT
GCGGTCGCCT TCGCCACCGC CTATCTGTTC GTGCAACTCA ATGTCACCGC CGGCGTGCCG
GTCGTCCCCG CGGCGATCAT TTCCATTCTT GTCTTCGCGC CGCTGCTGGG CCTGTTACTG
GACAGGCTGG TCTACCGGCG GCTCGCACAC GCCCCGTTGG CCGTGAAGAT CGTGGTACCG
ATCGGGCTGC TGATCGCGGT GCCCGGGCTC TGCCTGTTCA TCTCCAGCCG GCTGAACCTG
TGGTTCCACC TGGACATCGC CGGCATCGAG GATCTGTTCC TGATCCCCGG TCTGGGGCCG
ACCCCGAAAA AGACGTGGAC GCTCGGCGGG GTGCTGCTGG ACAGCAACCA GATCGCCGTC
TTCGGCGCCG CTGTGCTCAC GGCCGTCGGT CTGTGGGTCC TGCTGCAGCG CACACGGCTC
GGGCTCCAGA TGCGGGCCGT GGTGGACCGG CGGAGCCTGG CCGCCCTGCG CGGCGTCGAT
CCCGACCGGA CCTCCGCCGT GTCGTGGATG CTCGGCAGCT TCCTCGCCGG GCTCGCCGGC
GTGCTGGTCG CCCCGATCTT CACGCTCAAC ACCCCGGTGT TCACCACGGT CGTGCTGATC
TCGACGCCGG CCGTCGTGTT CGCCCGGTTC CGCTCGATGC CCCTCGCGCT CGCCGGCGGC
CTGCTGATCG GCGTCCTGCA GAACCTCATC GTCGGCTACG CGGACTTCGC CCAGAACATC
TCCGGGTTCA GCACGTCGGT GCCGTTCATC CTGCTACTCG GGCTGCTGTT CGTGTTCGCC
GTGGACCGCA GCCGGCGGGC CGGCGCGCAC GCCGACGACG ACCCGCCCCC GCCGTCCTAC
GGGACGGAGT CGAAACGCCG CAAGGTCGTC ACGTGGACGG TGTGGAGCGT CGTCGTCCTC
GTCTTCGCGC TGTTCGTGGC GGACGAGTAC TGGCAGTCAC TCATCATCCG CGGGCTGGCG
CTGTCGCTGG TGCTGCTCTC GTTCACGGTG GTGACCGGAG TCGGCGGGAT GGTCAGCCTC
GCGCAGGCTG CGTTCGTGAC CGCCGCCGGC ATCACGGCCG GCTGGGCGGT CAGCCACCAC
TGGCCGTTCG CGGTCGCGCT GCTCCTCGGC ACCGCGGTCG CCACCGCGAT GGGCGTGCTG
GTGTCGCTGC CCGCCCGGCG GCTCGGCGGA CTGCCCCTGG CGCTGGCGAC CCTGGCCCTG
GCGTACCTCT GCCAGAACCT GTTCTTCCAG CTCAGCGGCG TGAGCCGGAA CGACTTCGGT
GGGTGGGTGC TCAGCCCGCC CGCCCTCGGG CCGGTGGACC TCGCCGACCC CCGCTCAATG
ATCATTTTCC TGTCGGTCGT GCTGGGTCTC GCCCTTCTCC TGGTCAGCAA CCTGATCCGG
TCGAGCTCCG GCCGGGCGAT GGCCGCGCTG CGCTCCACCG AGCCGGGCGC GGTGACCATC
GGCATCTCGG CGGGCCGCAC GAAGACCGCG GTGTTCGCCC TGTCCGCGGC CATCGCCGGG
TTCGGCGGTG TGCTGCTCGC GAGTTCGTCC GGGCGCATCA CCCACCTCGA CTATCCGGTC
GAGACGGGCC TGTTCTGGCT CGCCACGGTC GTGCTGTTCG GGGTGCGCCG GCCGGCCGCG
GCCGTGATCG CCGGGCTGAG CGCCGCCGTC AGCCCCGAGA TCCTCAGCCA CGTCGCGGAG
ACGTCCTACC TGCCGCAGGT GCTCTCCGGC CTGGCCGCGA TCAACCTGGC GCAGAACTCC
GACGGCATCC TGGCGCTCAC CGCGCAGCAG CGGTTCGAGC GTCGCCGCAG GCGGCAGCAG
CGGGCGCTGC GCCGGGAGGC CGCCGCCACG GCGGCGACCC CGGTCAGCGC GACCCCGAAC
AGCGCGGCTG CGGTCACCGC GCCGGGCACC GAGGCCTCGG AGGGTGCCTC GACCGTGCTC
GAGCTGCGTG ACGTGCACGC CGCCTACGGC GCCGTGGAGG TGCTGCACGG GGTCAGCCTG
ACGCTGCGGG CGGGCGAGGT GCTCGCCCTC ATCGGGGCCA ACGGGGCCGG CAAGTCGACG
ATCTGCGGGG TGGTCACCGG CGGCGTCCCG GTGACGCACG GCACCGTGCG GCTCGACGGC
CAGGACGTCA CGGGGCTGCC ACCGCACCGC AGAGTCCGGC ACGGCGCCTT CCTCATCCCC
GAGGGCCGCG GCATCTTCCC CGCTCTGACC GTGGACGAGA ACCTCTCCCT GTGGCTCCCG
TCCGCCGATG ACCGGGACTC CGCCTACCGC CGCTTCGCGG TGCTCGGCCG GCGCCGCGGC
CAGCTCGCCG GATCCCTGTC CGGCGGCGAG CAGCAGATGC TGGCCCTGGC GCCGGCGCTG
GTGAGGCCCC CCGCCGTCCT GATCGTGGAC GAGCCCTCGC TCGGCCTGGC CCCCCTCGTC
GTCGCCGAGG TCTACGCCGC CCTGGAGGAA CTGCGGGCCA CCGGGACCGC GATCCTGCTG
GTCGAGGAGA AGGCCCACGA CGTGGTCGCG CTCGCCGACA CCATCGCCTT CATGGCCGTC
GGCCGGGTGG CCTGGGCGCA GCGGACCGCG GACGTCGACG CGGACCTGCT GGTGCAGTCC
TACCTCGGGA TCAGCGACAC GCCGGCCGGG CCCTCCGGGT CACCGGCAGC CCTGGCCGGC
AGCACGACCG GCCCCAGCAC GACCGGCCCC GCCGTCCCGC AACCCGCGAA GGAGCGTCCA
TGA
 
Protein sequence
MEEFLNLIVG GAVAGGLYAI LAAGVVLTYQ TSGIFNFAHG AVAFATAYLF VQLNVTAGVP 
VVPAAIISIL VFAPLLGLLL DRLVYRRLAH APLAVKIVVP IGLLIAVPGL CLFISSRLNL
WFHLDIAGIE DLFLIPGLGP TPKKTWTLGG VLLDSNQIAV FGAAVLTAVG LWVLLQRTRL
GLQMRAVVDR RSLAALRGVD PDRTSAVSWM LGSFLAGLAG VLVAPIFTLN TPVFTTVVLI
STPAVVFARF RSMPLALAGG LLIGVLQNLI VGYADFAQNI SGFSTSVPFI LLLGLLFVFA
VDRSRRAGAH ADDDPPPPSY GTESKRRKVV TWTVWSVVVL VFALFVADEY WQSLIIRGLA
LSLVLLSFTV VTGVGGMVSL AQAAFVTAAG ITAGWAVSHH WPFAVALLLG TAVATAMGVL
VSLPARRLGG LPLALATLAL AYLCQNLFFQ LSGVSRNDFG GWVLSPPALG PVDLADPRSM
IIFLSVVLGL ALLLVSNLIR SSSGRAMAAL RSTEPGAVTI GISAGRTKTA VFALSAAIAG
FGGVLLASSS GRITHLDYPV ETGLFWLATV VLFGVRRPAA AVIAGLSAAV SPEILSHVAE
TSYLPQVLSG LAAINLAQNS DGILALTAQQ RFERRRRRQQ RALRREAAAT AATPVSATPN
SAAAVTAPGT EASEGASTVL ELRDVHAAYG AVEVLHGVSL TLRAGEVLAL IGANGAGKST
ICGVVTGGVP VTHGTVRLDG QDVTGLPPHR RVRHGAFLIP EGRGIFPALT VDENLSLWLP
SADDRDSAYR RFAVLGRRRG QLAGSLSGGE QQMLALAPAL VRPPAVLIVD EPSLGLAPLV
VAEVYAALEE LRATGTAILL VEEKAHDVVA LADTIAFMAV GRVAWAQRTA DVDADLLVQS
YLGISDTPAG PSGSPAALAG STTGPSTTGP AVPQPAKERP