Gene Franean1_5318 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5318 
Symbol 
ID5673652 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6405505 
End bp6407187 
Gene Length1683 bp 
Protein Length560 aa 
Translation table11 
GC content70% 
IMG OID641244175 
Productputative ABC transporter ATP-binding protein 
Protein accessionYP_001509582 
Protein GI158317074 
COG category[R] General function prediction only 
COG ID[COG0488] ATPase components of ABC transporters with duplicated ATPase domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.134924 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.268079 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCAGT ACGTCTTCCA GATGCGCAAG GCCCGCAAGG CCCACGGCGA CAAGGTGATC 
CTCGACGACG TCACCCTGGC GTTCCTGCCC GGCGCGAAGA TCGGCGTCGT CGGGCCGAAC
GGTGCCGGGA AGTCGTCGCT ACTCAAGATC ATGGCCGGCC TCGACCACCC GAGCAACGGC
GACGCGATCC TGAGCCCCGG CTACACGGTC GGCATGCTCG CGCAGGAGCC GCGGCTCGAC
GAGGCCAAGG ACGTCCGCGG CAACGTCGAG GACGGCGTCC GCGAGATCCG CGCGGTGCTT
GCCCGGTACG AGGAGATCAA CGAGAAGATG GCCGCGCCCG ACGCGGACTT CGACACGCTC
CTCGCCGACC AGGCGGCGCT GATCGACAAG ATCGAGGCGG CGAACGCCTG GGAGCTCGAC
AGCCAGATCG ACCAGGCGAT GGACGCCCTG CGCCTGCCGC CCGGCGACGC GGACGTCACC
GCGCTCTCCG GTGGTGAGCG CCGCCGGGTG GCGCTGTGCA AGCTGCTGCT CGAGGCGCCC
GACCTGCTGC TGCTGGACGA GCCGACCAAC CACCTCGACG CGGAGAGCGT CGCCTGGCTG
GAGCAGCACC TGGCCCGCTA CGCCGGGGCG GTGCTGGCCG TCACGCACGA CCGGTACTTC
CTCGACAACG TCGCCGGCTG GATCCTCGAG CTCGACCGCG GCCGCGCCCA CCCCTACGAG
GGCAACTACT CCACCTACCT GGAGAACAAG GCGTCCCGGC TCAAGGTCGA GGGCCAGAAG
GACGCCAAGC GCCGCCGGGC GCTCGCCCAG GAGCTCGAGT GGGTCCGCTC GAACCCGAAG
GCCCGCCAGG CCAAGAGCAA GTCCCGCCTC GCCCGTTACG AGGAGCTGGC CGCCGAGGCG
GACAAGGCCA GGCCGCGCGA CTTCGAGGAG ATCCAGATCC CGCCCGGCCC GCGGCTGGGC
AGCCTGGTGA TCGAGACGAA GAAGCTCACC AAGGGCTTCG GTGAGCGGGT GCTCATCGAC
GACCTGTCGT TCAGCCTGCC GCGCGGTGGC ATCGTCGGCG TGATCGGCCC GAACGGCGTC
GGCAAGACCA CGCTGTTCAC GATGCTTGTC GGCCAGGCGT CGCCTGATTC CGGCGAGCTG
CAGATCGGCG AGACGGTCGA CATCGCCTAC GTGGACCAGT CCCGCGGCGG TCTCGACGCG
AAGAAGAACG TGTGGGAGAT CGTCTCCGAC GGGCTGGACC ACATCGTCGT CGGGAAGACC
GACATCCCGA GCCGGGCGTA CGTGTCGTCG TTCGGGTTCA AGGGGCCTGA CCAGCAGAAG
CCGGTCGGCG TGCTCTCCGG CGGAGAGCGC AACCGGCTGA ACCTGGCGCT GACCCTCAAG
CGCGGCGGCA ACGTGCTGCT GCTCGACGAG CCCACGAACG ACCTGGACGT CGAGACCCTG
CGGTCGCTGG AGGAGGCGCT GCTGGAGTTC CCGGGCTGCG CCGTGGTCGT CTCCCACGAC
CGGTGGTTCC TGGACCGGGT CGCGACGCAC ATCCTGGCCT GGGAGGGCAC CGAGGCCGAC
CCGGCCCGCT GGTTCTGGTA CGAGGGCAAC TTCGCCGACT ACGAGACCAA CAAGGTCGAG
CGGCTCGGTG CGGACGCGGC CCGCCCGCAC CGGGTGACGT ACCGCAAGCT CACCCGCGAC
TAG
 
Protein sequence
MAQYVFQMRK ARKAHGDKVI LDDVTLAFLP GAKIGVVGPN GAGKSSLLKI MAGLDHPSNG 
DAILSPGYTV GMLAQEPRLD EAKDVRGNVE DGVREIRAVL ARYEEINEKM AAPDADFDTL
LADQAALIDK IEAANAWELD SQIDQAMDAL RLPPGDADVT ALSGGERRRV ALCKLLLEAP
DLLLLDEPTN HLDAESVAWL EQHLARYAGA VLAVTHDRYF LDNVAGWILE LDRGRAHPYE
GNYSTYLENK ASRLKVEGQK DAKRRRALAQ ELEWVRSNPK ARQAKSKSRL ARYEELAAEA
DKARPRDFEE IQIPPGPRLG SLVIETKKLT KGFGERVLID DLSFSLPRGG IVGVIGPNGV
GKTTLFTMLV GQASPDSGEL QIGETVDIAY VDQSRGGLDA KKNVWEIVSD GLDHIVVGKT
DIPSRAYVSS FGFKGPDQQK PVGVLSGGER NRLNLALTLK RGGNVLLLDE PTNDLDVETL
RSLEEALLEF PGCAVVVSHD RWFLDRVATH ILAWEGTEAD PARWFWYEGN FADYETNKVE
RLGADAARPH RVTYRKLTRD