Gene Franean1_5562 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5562 
Symbol 
ID5673892 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6741806 
End bp6744481 
Gene Length2676 bp 
Protein Length891 aa 
Translation table11 
GC content73% 
IMG OID641244418 
ProductABC transporter related 
Protein accessionYP_001509822 
Protein GI158317314 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0411] ABC-type branched-chain amino acid transport systems, ATPase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.489183 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATCTGC GGGCCGTCTT ACAGTTCGCG TTGTTAGGGC TCGGTGTCGG CGGTGTCTAC 
GCGGTAAGCG CGGTCGGCAT CGTGGCGATC CACCGCGGGT CGCGGACAAT CAACTTCGCG
CACGGCGGCA TCGCGCTGTG GGGCGCGCTG GTGTTCTACT GGCTGCGCGA CGACCAGGGG
ATGGCGGCCG GCCTGGCCGC GGTGCTGACG CTGCTGAGCG CCGCCGCCTT CGGCGCCGCC
GTCTACCTGC TGGTCATCCG GATCATCCGG CACCGCACCC AGCTCTCGCG GATGGTGGCG
ACCCTGGGAA TCCTCGGCAC CCTGATCGGC CTGGCGCACT CGGTGTTCGG GAGCGCCCAG
CGGGTGCCCG ACACGTCCTT CCCGTCCTCG GGCGTGGACG TGTTCGGCCA GACGATCCCC
TCCGAGCGGA TCATCATCTT CGCGCTCGCG GTCGTGGTCG TGCTGGCCCT GTCGGCCTGG
TCGGCGTGGG CGCGGCTGCC GCTGATGACG AGGGCGATGG CGGAGCACGA GTCCGCCGCC
CAGGCCCTCG GCGCGTCACC GCACCTGCTC GGCAGCCTGA ACTGGGCACT CGGATCCGTG
CTGGCCGCCG CCTCCGGCAT CCTCGTCGCC CCCATAGTGG GGCAGTTCGA CACCGCGATG
ATGACGGTGA TGGCCTTCGC CCTGGCCGCC GCCCTGCTCG GCCGGTTCGA CAGCTACCTG
CTCGCGCTGG CCGGCGGTAT CGCCCTCGGC GTTGGAGAGA GCGTGGTCAC CCACCTGGTG
GCCGAGCACG TCCCGAGCCG CTTCCAGCTC GGCTGGCCGC AGACGGTTCC CTTCGTGGCC
GTCATGGCCA TCCTCACCCT GCGCCGGGAC AGGGCCAGCA ACCGGCTGCC CGCGGTCCCC
GCCGCGCCGG TCGCGGCCGG GCTGTTCCGG CCGGTTCCCG TCGGCGTCGC GCTGGTCGGG
ATCATCGTCG TCCTGCTCGT CGGTGACGCC CGGTGGCGCG ACGCCGCGAC CCTCTCGATC
GTCTTCGGCA TCCTGGTGCT GTCGCTGGTG GTCCTCATCG GGTACGCCAA CCAGATCAGC
CTGGCGCAGA TGACGGTGGC CGGGCTCGGG GCCTACGCCG CCGTGCGACT GGACATCGAC
CACTCGCTGC CGTTCGTGCT GGCGCCGGTG GCCGGAGCGG CCGTCGGCGC GATCGCCGGG
CTGCTCGTCG GCCTGCCCGC GCTGCGGGTG CGCGGCATCA ACCTCGGCAT CCTCACCATG
GGGATGGCCG TCGCCGTCTC CGGGGTCCTC TTCGAGAGCA CCCACTACAC CGGCGGCATC
TCCGGCTCCC AGCCGCACCT GCCGACGGTG TTCGGCCTGG ACGTCGACGC CGCGCGCCAT
CCCGACAGGT ACGGGTTCGT CGCACTGTTC TGGCTGGTGG TCGCCGGTGC GGTGGTGGCC
GTCGTGCGGC GGTCCCGGCT GGGGCGCCGG CTGCTCGTGG TGCGGACGAA CGAGCGGGCC
GCCGCGTCGG TCGGCGTCAG CGTCGCGCGG GCGAAGCTGT CCGCGTTCGT GATCTCGTCG
TCCCTGGCGG GCGCGGCCGG CGTGCTGCTG GGCTTCCGCA GCTCCTCGGT CACCTTCACC
CAGTTCTCGT TCATGGAGTC GATCAACCTG GTCAGCCTCG CCGTGATCGC CGGCGTCACA
TCGATCACGG GCGGCCTGCT CGGCGGGGTG ATGGCCTTCG GAGGCCTCGT CTACCTGCTG
ATCTCCCGGC TCCACATCGG CTTCGTCACG GACAACTACG CCACCATCTT CGGCGCCGCG
CTGGTCGTGA CCGTTCTCGT GCACGAGAAC GGCGTCGCGT GGCACCGGCG GTTCCAGAAC
GACCCGGCCC CGCTCCCGCG GGGGACCCAG CCGGCGCGCG AGGGCGGCGC GCTGGTCGCG
AACGACGTGA CCGTGCGGTT CGGCGGGGTG ACCGCGGTGT CCGCCGCCAC GCTGCGCGCG
CTGCCCGGGG TCGTCACCGG GCTGGTCGGT CCGAACGGAG CCGGCAAGAC CACCCTGCTC
GACGCGATCG GCGGTTTCGC GCCGACTACC AGCGGGAGCG TGCACCTCGG CGATAGATCA
CTCAACGGCG ACGGGCCGGA CGTCCGCGCC CGCTCCGGGT TGGGCCGGGT CTTCCAGGCC
GGCGAGCTGT TCGAGGACCT GACGGTCGCC CAGAACCTGC GCGTCGCCGC GGAGAACGCC
GGCCACACCG ACGGACGGCT GCCCGCCCCG GCCCACACGG CGGTCGAGCG CTTCCGGCTG
GTTGAGGACC TCCCCCGGCT GCCCACCGAG CTGCCGATGG CCAAGCGGCG GATGGTGGGC
ATCGTCCGCG CGCTGGCCGC CAACCCCGCC GTCCTGCTGC TGGACGAGCC CGGCGCCGGC
CTGTCGATCA CGGAGATCGG CGAGCTCTCC GCGAACCTGC GCGATCTCGC GCACGAGGAG
GGCCTCGCCG TCCTGGTGGT GGACCACGAC ATGGCCCTGG TGATGTCCGC CTGCGACCGC
ATCGTCGTCC TGCACCAGGG GCAGGTCCTC GCCGACGGGA CACCGGAGGA GGTCCGGGCC
GATCCCGCCG TCCGGGAGGC CTACCTCGGC GAGGCCACAG AGGCCGTCGT GGCCTCGCCG
GACGCCGTGC TGAGCGAGGC GGCGGTGCCG GACTGA
 
Protein sequence
MDLRAVLQFA LLGLGVGGVY AVSAVGIVAI HRGSRTINFA HGGIALWGAL VFYWLRDDQG 
MAAGLAAVLT LLSAAAFGAA VYLLVIRIIR HRTQLSRMVA TLGILGTLIG LAHSVFGSAQ
RVPDTSFPSS GVDVFGQTIP SERIIIFALA VVVVLALSAW SAWARLPLMT RAMAEHESAA
QALGASPHLL GSLNWALGSV LAAASGILVA PIVGQFDTAM MTVMAFALAA ALLGRFDSYL
LALAGGIALG VGESVVTHLV AEHVPSRFQL GWPQTVPFVA VMAILTLRRD RASNRLPAVP
AAPVAAGLFR PVPVGVALVG IIVVLLVGDA RWRDAATLSI VFGILVLSLV VLIGYANQIS
LAQMTVAGLG AYAAVRLDID HSLPFVLAPV AGAAVGAIAG LLVGLPALRV RGINLGILTM
GMAVAVSGVL FESTHYTGGI SGSQPHLPTV FGLDVDAARH PDRYGFVALF WLVVAGAVVA
VVRRSRLGRR LLVVRTNERA AASVGVSVAR AKLSAFVISS SLAGAAGVLL GFRSSSVTFT
QFSFMESINL VSLAVIAGVT SITGGLLGGV MAFGGLVYLL ISRLHIGFVT DNYATIFGAA
LVVTVLVHEN GVAWHRRFQN DPAPLPRGTQ PAREGGALVA NDVTVRFGGV TAVSAATLRA
LPGVVTGLVG PNGAGKTTLL DAIGGFAPTT SGSVHLGDRS LNGDGPDVRA RSGLGRVFQA
GELFEDLTVA QNLRVAAENA GHTDGRLPAP AHTAVERFRL VEDLPRLPTE LPMAKRRMVG
IVRALAANPA VLLLDEPGAG LSITEIGELS ANLRDLAHEE GLAVLVVDHD MALVMSACDR
IVVLHQGQVL ADGTPEEVRA DPAVREAYLG EATEAVVASP DAVLSEAAVP D