Gene Franean1_3248 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3248 
Symbol 
ID5671622 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3838494 
End bp3841373 
Gene Length2880 bp 
Protein Length959 aa 
Translation table11 
GC content71% 
IMG OID641242140 
ProductABC transporter related 
Protein accessionYP_001507560 
Protein GI158315052 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0411] ABC-type branched-chain amino acid transport systems, ATPase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGCTCC CCACCGCGCA ACTCCTGTTC GACGGGGCGA CCACCGGACT GGTCATCGGC 
CTGCTCGCCG TCGGCATCGT CCTGGTCAAC CGGGCGACCC GGATCATCAA CTTCGCTGTC
GCGAACATGG GCCTCGTCGG CTCCGCGCTC TGCGCGCTGC TGGTCGTGCG CTACAACGTG
CCCTACTGGA TCGGGCTGGC CGCCGCGCTC GCCGCCGGGG CGCTGTTCGG CGCCATCATC
CATGTGGGCG TCATCCGCCG GCTGTTCACC GCCCCCCGCG TCATCGTCCT GGTCGCCACG
ATCGGAGTCT CCCAGCTCGC GCTGACCATC GTGAACGCCT ATCCGGACCT GAAGGACCAC
GCCGACCAGT CCTACCCGGT GCCCTGGTCC GGGACCTGGT CACCGGTGGA CGGCGTGCAG
GTCACCGGCG CGCAACTCAG CATTCTCGTG GTCGTCCCCG TCGTGACCGC CGGGCTCTCC
CTGTTCCTGA ACCGCACCGT CCTCGGCAAG ACGGTGAAGG CCTCCGCGGA CAACCCCGAG
CTCGCCCGGC TGCAGGGCAT CAACCCGAAG ACCATTTCGC TGGCCGTCTG GACGGCCGCC
GGCTTCCTCG GCACGCTGTC CATGATTCTG GTCGCCGCGC AGGAGAAGTC CCTGGCGCAG
GTCACCACCC TCGGGCCGAC GACGCTGCTG CGGGCGTTGG CCGCCGCCGT GATCGCCCGG
ATGGTCTCGG TCCGCATCGC GCTGGTCGCC GGCATCGGCC TCGGGCTGTT GCAGTCGTTC
GTGCAGTTCA ACTGGCTCGA CCAGCCGGGC CTCACCGACA CCGTCATCCT CGTGATCGTC
CTGGTTGCCG TGTTCTTCAC CAGCCGGGGC AGGAGCACCG AGGTCTCGAC GTTCTCGTTC
GCGCCCAGGG CCCGCCCGGT CCCCGAGCGG CTGCGCGAGC TGTGGTGGGT CCGGCATCTC
GAGCAGGCTC CACTGCTGCT TCTCGGGCTG TTCGCGCTGC TGCTGCCGGT CGTGGTCACC
CAGCCCTCAC GCCACCTGCT CTACACGATC ATCCTCGGCT ACGCGATCTG CGCGGCTTCG
GTCACCGTCC TCACCGGCTG GGCCGGCCAG CTCTCGCTGG GCCAGATGGC CTTCGCGGGC
CTCGGCGCGC TCACCGCCGC CGCCCTGTTC CGCGGCCTCC GCCTGGACAT CGGCGACACG
TCACTGGCGA TCAACGCCCT GCCGTTCCCG GCGGCGGTCC TGATCGCCAC GGTTCTCACC
GCGGCCATCG CCGCGGTCAT CGGCCTGGGC GCGCTCCGGG TACACGGACT CCTGCTCGCG
GTGAGCACCT TCGCGTTCGC CGTCGCCGCC GAGCAGTTCC TCTACCGGCG GGAGGTCTTC
CACGACGAAG GCAGCAGCGC CGCGTCCTTC ACCCGCGGGA CCCTCTTCGG GATCGACATC
GCCAGTCAGC GCACCTACTA CTACGTGGTG CTGGTGACCC TGGCCATCGT CATGGCCGTC
GTGTCCCGGC TGCGCAAGTC CGGTATCGCC CGCACCACCA TCGGTGTCCG GGACAACCCC
ACCACGGCCG CGGCCTACAC CGTCAGCGCC ACCGGCGTGA AACTGCGCGC CTTCGCTCTC
GCGGGCGCGC TGGCAGGCCT CGGCGGCGCG CTGCTCGCCG GCGCGCTGCA GACAGTCCCC
TACAACGACA TGTACTTCCG CAGCCCGGAC TCCCTCGTCC TCGTCTCCAT CGTGGTCATC
GGCGGGCTCG GCTCCGTCTA CGGTCCGCTG CTCGGCTCGC TGTGGGTGAT CGGCCTGCCC
TCCTTCATGC CCGACAACGA CATCGTGCCG CTGCTCTGTT CGAGCATGGG CCTCCTCGTC
CTGCTGCTCT ACTTTCCGGG CGGCCTGGTC CAGGTGGGCT ACAGCACCCG CGACGCCATC
CTGGGCTGGG CCGAGCGGCG CCGCGGGGAC GCGCCCACCA GCAAGACCGC CACCGCCCCG
CCCGCCGCGC TGACCCGCAC CGACCGGGAG CCCCTGCCCG CCGGGCGGCC CGCGCTCACG
ACGAGCGGAA TCCGGGTGCG TTTCGGCGGG CGGACCGCCG TCGACGGAGT CTCGATCGAG
GTGATGCCCG ACGAGATCGT CGGTCTCATC GGGACCAACG GCGCCGGCAA GTCCACCTTC
ATGAACGCCG TCGGCGGCTT CGTCCCCGCC GCCGGCGCCG TCACCATCCT GGGCCACGAC
GTCTCCACAG CAAGCCCCGC GGCGCGGGCC AAGGTCGGCC TCGGCCGTAC CTTCCAGGCC
GCGACGCTGT TCCCCGAGCT CACCGTCCGC CAGACCGTGC AGATCGCGCT GGAGGCTCGG
GGTCGCACCG CGTTCCTGTC CACCGCCCTG CACCTGCCGC AGACCTTCGC CCGCGAGCGC
GCCAAGCGGT CCGAGGCCGG CGACCTCATC GACTTCCTCG GCCTGGGCCG CTACGCCGAC
GCCTTCGTCG CGGAACTCTC GACCGGAACC CGCCGCATCG TCGAACTCGC CTGCCTGCTC
GCGCTCGACG CGAAGATGCT CTGCCTCGAC GAGCCCACGG CAGGCGTCGC CCAACGCGAG
ACCGAGGCCT TCGGGCCGCT CATCCAGGAG ATCCGCCGCG AACTCGGCGC CGCGATGCTC
ATCATCGAGC ACGACATGCC GCTGATCATG GGAATCAGCG ACCGTGTCTA CTGTCTCGAA
GCCGGGAAGG TCATCGCTGC CGGGGTACCC GGCGCCGTCC GCAACGACCC CAGGGTCATC
GCGAGCTACC TCGGCACCGA CGAACGCGCC ATCCAGCGCA GCGGCGCCAC CGTCGACATG
CCCGGTCCGG CGGCCGTCGA CGACCGCGCC CCGACCGGCA ACCCGACAGC CGCGGTGTGA
 
Protein sequence
MQLPTAQLLF DGATTGLVIG LLAVGIVLVN RATRIINFAV ANMGLVGSAL CALLVVRYNV 
PYWIGLAAAL AAGALFGAII HVGVIRRLFT APRVIVLVAT IGVSQLALTI VNAYPDLKDH
ADQSYPVPWS GTWSPVDGVQ VTGAQLSILV VVPVVTAGLS LFLNRTVLGK TVKASADNPE
LARLQGINPK TISLAVWTAA GFLGTLSMIL VAAQEKSLAQ VTTLGPTTLL RALAAAVIAR
MVSVRIALVA GIGLGLLQSF VQFNWLDQPG LTDTVILVIV LVAVFFTSRG RSTEVSTFSF
APRARPVPER LRELWWVRHL EQAPLLLLGL FALLLPVVVT QPSRHLLYTI ILGYAICAAS
VTVLTGWAGQ LSLGQMAFAG LGALTAAALF RGLRLDIGDT SLAINALPFP AAVLIATVLT
AAIAAVIGLG ALRVHGLLLA VSTFAFAVAA EQFLYRREVF HDEGSSAASF TRGTLFGIDI
ASQRTYYYVV LVTLAIVMAV VSRLRKSGIA RTTIGVRDNP TTAAAYTVSA TGVKLRAFAL
AGALAGLGGA LLAGALQTVP YNDMYFRSPD SLVLVSIVVI GGLGSVYGPL LGSLWVIGLP
SFMPDNDIVP LLCSSMGLLV LLLYFPGGLV QVGYSTRDAI LGWAERRRGD APTSKTATAP
PAALTRTDRE PLPAGRPALT TSGIRVRFGG RTAVDGVSIE VMPDEIVGLI GTNGAGKSTF
MNAVGGFVPA AGAVTILGHD VSTASPAARA KVGLGRTFQA ATLFPELTVR QTVQIALEAR
GRTAFLSTAL HLPQTFARER AKRSEAGDLI DFLGLGRYAD AFVAELSTGT RRIVELACLL
ALDAKMLCLD EPTAGVAQRE TEAFGPLIQE IRRELGAAML IIEHDMPLIM GISDRVYCLE
AGKVIAAGVP GAVRNDPRVI ASYLGTDERA IQRSGATVDM PGPAAVDDRA PTGNPTAAV