Gene Franean1_4668 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4668 
Symbol 
ID5673010 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5572543 
End bp5574324 
Gene Length1782 bp 
Protein Length593 aa 
Translation table11 
GC content73% 
IMG OID641243525 
ProductABC transporter related 
Protein accessionYP_001508941 
Protein GI158316433 
COG category[V] Defense mechanisms 
COG ID[COG1132] ABC-type multidrug transport system, ATPase and permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.206506 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.270738 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGGTA GGCCCGCGCA GAACCGCGGG TCTGCGGGGG GCAGAGCGCA CGCCCTGCGC 
ACGCTGCTCG GCCACGCCGG CGACGGCCGA CGGCTGCTCG TCGGCGGCTA CGCGCTCATC
CTCGTCGACG CGGTCGCGCA GAGTCTCACG CCCGCGGTGT TCCGGGTCGT GCTGGACCGC
ATCCAGCAGG ACCCGCACCG GTTCCTGCAC GACGGCTGGC AGGGGCCCGT GGTCGCCGCG
GTCGCCGTCT CCGCGACGTT CCTCGTGGCG GCCTACTTCG CCCACACCTG GTGCCGTCGG
GGTGCGACGC GATGGGCGAA CAACCTGCGA CGGGCCCTGT ACGAGCACGT CCAACGACTG
TCGATGGACT TCTTCCACCG GTCGCGGGTC GGCGATGTCG CCGCCGCGAT CAACCAGGAC
ATCGAACGCC TCGAGGTGAC GGTCTGGCAG GGCCTGTCGG TGTGGTGGGC GCTGGCGGTC
CTGCTCATCT CGGTGGGCCT CGTCGCCTGG GTCGACGCCT GGATGGCCCT GATCTCGCTC
GCCCTGCTCG CGGTGGCCGT CGGGTGGACG CTGCTGGTGC TCCCACGGCT TCGCCGCCAC
AGCCGCGACA TCCGCGACGA GCTCGGCCGC ACCTCCGGAA CCCTCGCCGA GATGCTCGGC
GTGAACACCC TTCTCAAGGC GTTCAACGCG GAGGACGATG CCCTGCGCCA GGTCCGGGGC
GGTACCGACC GGGTGCGGGC CGGGTCCGAG GCCCTGGCGC GGCTCCAGCA CCGCTACGCC
GACCCGCTCG GCTTCCACCT CTCGTTCGTC GCCCCGTTCC TGCTCCTGTT CGTCGGCGCG
TGGCGGGCCG CGTCGGGGAC GCTGTCCATC GGCGACGTCG TCGCCGTCTG GGGCTTCTGG
CTGCGCGGGT CGAGCTCGCT CACCGTCGTC ATGACCAGCC TCCCCGAGGT GCTGGCCGGC
CTGGCCGCGA GCGAACGCGC CGCGAAGCTG TTCGAGGAGC GACCCGCGGT GAGTGATCTC
CCGCGGGCCC CGGCGCTGGT TGTCGCCCGC GGCGGCATCG TCTTCGAGCG GGTCTCGTTC
GCCTACCCCG GACGGCGGTC GCACCTGGTG CTCGACGGCT TCGACCTGAC GATCGCCCCG
GGGCGCAGCG TCGCGCTCGT CGGCTCCTCC GGCGCCGGCA AGTCCACCGT CGCGCAGCTG
CTGCTCCGGC TGTTCGACCC GGCCGCGGGC CGGGTGACGA TCGACGGCCA GGACCTCCGT
GCGGTCACGC AGGCCTCGGT GCGTGCGGCG GTGGGGGTGG TCTTCCAGGA GTCGGTACTG
ATCAGCGGCT CACTGGCACG CAATCTGCGG CTCGCCCAGC CGACCGCCAC CGACCAGGAC
ATCGAGGCGG CGCTGCGGGC GGCGAACGCG TGGGAGTTCG TCCGCGCCTG GGAGGAGGGC
ATTCACACGG AGCTCGGCGA ACGCGGCGCC ATGCTCTCCG GGGGGCAGCG CCAGCGCCTC
GCCATCGCCC GGGTGATGCT GAAGGATCCC CCCATCGTCG TGCTCGACGA GGCGACCAGC
GCGCTCGACG CCGGCAGCGA ACGGCTCGTG CTCGACGCCC TGGACCGGCT CCTGTCCGGG
CGGACGTCCC TGATCATCGC CCACCGGATC GCCACCGTCC GCCACTGCGA CCAGATCGTC
GTGATGGAGC GCGGCCGGGT CGCTGACGCC GGCACGCACT CCTCACTGCT CCAGTCCTCG
GCGACGTACC GCTCGTACTG CCGCGAGCAG TCGGTGGCCT GA
 
Protein sequence
MTGRPAQNRG SAGGRAHALR TLLGHAGDGR RLLVGGYALI LVDAVAQSLT PAVFRVVLDR 
IQQDPHRFLH DGWQGPVVAA VAVSATFLVA AYFAHTWCRR GATRWANNLR RALYEHVQRL
SMDFFHRSRV GDVAAAINQD IERLEVTVWQ GLSVWWALAV LLISVGLVAW VDAWMALISL
ALLAVAVGWT LLVLPRLRRH SRDIRDELGR TSGTLAEMLG VNTLLKAFNA EDDALRQVRG
GTDRVRAGSE ALARLQHRYA DPLGFHLSFV APFLLLFVGA WRAASGTLSI GDVVAVWGFW
LRGSSSLTVV MTSLPEVLAG LAASERAAKL FEERPAVSDL PRAPALVVAR GGIVFERVSF
AYPGRRSHLV LDGFDLTIAP GRSVALVGSS GAGKSTVAQL LLRLFDPAAG RVTIDGQDLR
AVTQASVRAA VGVVFQESVL ISGSLARNLR LAQPTATDQD IEAALRAANA WEFVRAWEEG
IHTELGERGA MLSGGQRQRL AIARVMLKDP PIVVLDEATS ALDAGSERLV LDALDRLLSG
RTSLIIAHRI ATVRHCDQIV VMERGRVADA GTHSSLLQSS ATYRSYCREQ SVA