Gene Franean1_3152 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3152 
Symbol 
ID5671529 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3707966 
End bp3710797 
Gene Length2832 bp 
Protein Length943 aa 
Translation table11 
GC content67% 
IMG OID641242047 
ProductABC transporter related 
Protein accessionYP_001507467 
Protein GI158314959 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0411] ABC-type branched-chain amino acid transport systems, ATPase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAACTGC CTCCTACGCA GCTGCTGTTC GACGGTGCGG TGAGCGGTCT GGTGATAGGA 
CTGCTGGCGG CCGGGATCGT GCTGGTGCAC CGGGCGACAC GTGTCATCAA CTTCGCTGTC
GCGAACATGG GCCTGGTCGG CTCCGCGCTA CTCGCGCTGC TGGTGGTCCG CTACAACGTC
CCCTACTGGA TCGCGCTCGT CTTGGTCCTC GCTGTGGGCG CGCTGTTCGG AGCGATCATC
GACCTGGCGG TGGTCCGCCG GCTGTTCACC GCACCACGGG TGATCGTGCT GGTCGCCACG
ACCGGCGTCG CGCAGCTCGC CCTGACGATC GTGACGGCTT ACCCGAAACT CGACGACTAC
CCGACCAGCT CCTACCCGCT GCCCTGGACG GGCACCTGGA CGCCGTTCGA CGACCTGCAG
ATCACCGCCG CGCAGCTCAG CATCCTGGTC ACGGTGCCCG TCGTCACACT CCTGCTCGGA
CTGTTCCTCA GCCGCACCGT GCTGGGCCGG ACGGTACAGG CCTGCGCGGA CAACCCCGAA
CTCGCCCGGC TCCAGGGCAT CAGCCCGAAA ACCATGTCCA CCGTGGTGTG GGCAGTCGCC
GGCCTGCTCG CCACACTCTG TCTCATCCTC ATCTCCGCGC AGAACCGCTC GCTGACGCAG
ATCACGACGC TCGGCCCGAC CACGCTGCTG CGGGCGCTGG CCGCGGCGGC CATCGCCCGG
ATGATCTCGT TCCGGATCGC CCTGCTCGCT GGTGTCGTCC TCGGGCTGCT GCAGTCCTTC
GTCCAGTTCA ACTGGCTCGA CCAGGCCGGC CTGACCGACA TGGTCATCCT GATCGTGGTT
CTTGCGGCAG TTTTCCTCGC CAGCAGAAGC CACGACACCG AAACCTCGTC GTTCTCCTTC
GCCCCCAAGG TCCGCCCGAT CCCCGAGCGG CTGCGCAGGC TGTGGTGGAT TCGCAACCTC
GAGCGGGCAC CCCTGCTCCT CCTCGGCCTG ATCGCCGTCA TCCTGCCGCT GGTGATAGGT
CAGCCGTCAC GCCAACTGCT CTACACCGTC ATCCTCGGCT ATGCGATCTG CGGCGCCTCC
GTCACGATCC TCACCGGCTG GGCTGGGCAG CTCTCCCTCG GTCAGATGGC CTTCGCCGGC
CTCGGAGCCC TTATCGCCGC GACCCTCAAC CGCGGACTGC ACCTGACGGT CGGCGGTACC
ACGATCAGTG CGAGTCCCCA ACCCTTCGCC GTCGCCGTCG CACTGGCCAT GCTGGTCACC
GCCGTTCTGG CGACCCTGGT CGGTATCGGC GCGCTACGGG TACGCGGCCT CCTGCTGGCG
GTCAGCACCT TCGCGTTCGG CATCGCGGCC GAGCAGTACC TCTACCGCCG CGAATTCCTG
CACGATGAGG GCACCACTAC CGCGTCGTTT CCCCGCGCCA GCGTATTTGG AATTGACATC
GGCACTCAGC GCGCCTATTA CTATCTGGTC CTGGTGGTAC TGGTCATCGT CATGGTCGTC
ATCGCGCGTC TACGCCGCTC GGGCGTGGGA CGGACCACGA TCGGGGTGCG AGACAATTCC
GTCGCCGCCG CGGCCTACAC GGTCGGTCCC GTCAGGGTGA AGCTCCGCTC GTTCGCGCTC
GCGGGCGGAC TGGCGGGACT TGGCGGAGCG TTGCTGGCAG GTGCAGTGCA GGAGGTGCCC
TACACCGACC GGTACTTCCT CAGCACGGAC TCACTGGTGC TGGTGTCCGT CGTCATCATC
GGCGGCGTGG GATCAGTGAT TGGCCCTGTG CTCGGCTCAC TGTGGGTGGT CGGCTTACCA
GCCATCTTTC CCGACAACAC CATCGTACCG TTGCTAACCT CCAGCCTGGG CCTGCTCCTC
CTGCTGCTGT ACTTTCCCGG CGGACTGGTC CAGATCGGCT ACAGCGCCCG CGATGCCCTC
CTCGCCTGGG CCGACCGGCG CCTCGGCCCC GTCACAGCAC CGGTGAAAGT AGCCACGGCC
AGGCCGGTCG CGCTCACACG TACTCCCAGC ACGCCGCTTC CGGCAGGCAA GCCAGTGCTC
GCCACCACCG ACGCGCGGGT ACGTTTCGGT GGACGGGTCG CGGTCGACGG CGTCTCCGTC
CAGATCATGC CCGGCGAGAT CGTCGGGCTG ATCGGTACCA ACGGCGCCGG CAAATCCACC
CTGATGAACG TCGTCGGCGG GTTCGTGCCC GCCACCGGGA CCGTGACGTT GCTGGGCCAA
GACGTCTCCC GTACCAGCCC CGAGAAGCGT GCGAGACTCG GGCTCGGCCG CACATTCCAG
GCAGCATCCC TGTTCCCGGA GCTCACCGTT CGGGAGACCG TACAGATCGC GCTGGAGGCC
CGCGGTCGCA CCACGTTGCT GTCCACCGCC CTGCACCTAC CGCACACCTT CGCCCGCGAA
CGCGCCAAGC GTTCCGAGGC AGAGGACCTG ATCGACTTCC TCGGCCTCGG CCTCTACGCC
GACGCCTTCA TCGCCGACCT GTCCACCGGC ACCCGTCGCA TCGTCGAACT CGCGGGCCTC
CTCGCCCTCG ACGCCCGCGT GCTCTGCCTC GACGAACCCA CCGCCGGCGT CGCCCAGCGC
GAGACCGAGG CGTTCGCTCC GCTCATCCAG GAGATCCGCC GCGAACTCGG CGCGGCCATG
CTCGTCATCG AACACGACAT GCCACTGATC ATGAGCATCA GCGACCGCGT CTACTGCCTC
GAGACAGGCA GGGTCATCGC CGCCGGTCCT CCTGGCACCG TCCGCAACGA CCCCAAAGTC
ATCGCCAGCT ACCTCGGCAC CGACGAACGC GCCATCGAAC GCAGCGGAGT CTCCACCACC
GCGACGGCGT AG
 
Protein sequence
MELPPTQLLF DGAVSGLVIG LLAAGIVLVH RATRVINFAV ANMGLVGSAL LALLVVRYNV 
PYWIALVLVL AVGALFGAII DLAVVRRLFT APRVIVLVAT TGVAQLALTI VTAYPKLDDY
PTSSYPLPWT GTWTPFDDLQ ITAAQLSILV TVPVVTLLLG LFLSRTVLGR TVQACADNPE
LARLQGISPK TMSTVVWAVA GLLATLCLIL ISAQNRSLTQ ITTLGPTTLL RALAAAAIAR
MISFRIALLA GVVLGLLQSF VQFNWLDQAG LTDMVILIVV LAAVFLASRS HDTETSSFSF
APKVRPIPER LRRLWWIRNL ERAPLLLLGL IAVILPLVIG QPSRQLLYTV ILGYAICGAS
VTILTGWAGQ LSLGQMAFAG LGALIAATLN RGLHLTVGGT TISASPQPFA VAVALAMLVT
AVLATLVGIG ALRVRGLLLA VSTFAFGIAA EQYLYRREFL HDEGTTTASF PRASVFGIDI
GTQRAYYYLV LVVLVIVMVV IARLRRSGVG RTTIGVRDNS VAAAAYTVGP VRVKLRSFAL
AGGLAGLGGA LLAGAVQEVP YTDRYFLSTD SLVLVSVVII GGVGSVIGPV LGSLWVVGLP
AIFPDNTIVP LLTSSLGLLL LLLYFPGGLV QIGYSARDAL LAWADRRLGP VTAPVKVATA
RPVALTRTPS TPLPAGKPVL ATTDARVRFG GRVAVDGVSV QIMPGEIVGL IGTNGAGKST
LMNVVGGFVP ATGTVTLLGQ DVSRTSPEKR ARLGLGRTFQ AASLFPELTV RETVQIALEA
RGRTTLLSTA LHLPHTFARE RAKRSEAEDL IDFLGLGLYA DAFIADLSTG TRRIVELAGL
LALDARVLCL DEPTAGVAQR ETEAFAPLIQ EIRRELGAAM LVIEHDMPLI MSISDRVYCL
ETGRVIAAGP PGTVRNDPKV IASYLGTDER AIERSGVSTT ATA