Gene Franean1_3918 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3918 
Symbol 
ID5672279 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4684239 
End bp4686020 
Gene Length1782 bp 
Protein Length593 aa 
Translation table11 
GC content74% 
IMG OID641242797 
Productoligopeptide/dipeptide ABC transporter, ATPase subunit 
Protein accessionYP_001508214 
Protein GI158315706 
COG category[E] Amino acid transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0444] ABC-type dipeptide/oligopeptide/nickel transport system, ATPase component 
TIGRFAM ID[TIGR01727] oligopeptide/dipeptide ABC transporter, ATP-binding protein, C-terminal domain 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.432636 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0575637 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGCCGGTA GCGGAACCGC CCATCTGCGC GGCGAGGACG CGCTGCTCAG CGTGCGCGAC 
CTGGTGGTCG AATACCCGAC CAAGGGCGGG GTGGTCCAGG CCGTCTCCAA GGTCAGCTTC
GACGTCCTGC CCGGCGAGAC GCTCGGGATC GTCGGCGAGT CCGGCTGTGG GAAGTCCACG
ACCGGGCGCG CCGTGCTCCG GCTGGACAGG CTGACCTCCG GCCAGATCAC GTTCGCCGGC
GAGCGCATCG AGGGAGTCGG CGAACGCCGG ATGCGCGAGC TCCGCCGCTC CATCCAGATG
ATCTTCCAGG ACCCGGTGGC GTCGCTGAAC CCCCGCCGCA GCGTCAAGGA CATCGTCGTC
GAGGGTCTGG CGGTTGCCCG TGCCCCGGCG TCCGAACGGG CCACCGTGTC GGCGTCCGTC
CTCGGACAGG TCGGGCTGGA CGGCGACCGC TTCGCCGACA TGCTGCCCCG CCAGCTCTCC
GGCGGCCAGG CACAGCGGGT GGCGATCGGC CGCGCGCTCG CGCTGCACCC GCGGCTGCTG
ATCTGCGACG AGCCGGTCTC CGCGCTGGAC GTCTCCGTGC AGGCGCAGAT CCTCAACCTG
ATCGAGGAGC TGAAGGCCGA GTTCGACCTG ACCGTCGTGT TCATCGCGCA CGACCTCGGG
GTCGTGCGGG CGGTCAGCGA CGACGTCCTG GTCATGTACC TCGGCAAGGT GTGCGAGTTC
GGCGACTCCG ACCTGGTGTA CGACCAGCCC GCCCACCCCT ACACCCGGGC GCTGCTGGAC
TCCGTCCCGC TGACCGACCC GGAGCGCGGC TTCACCGGTC CCGCGTTGGA GGGCGACCTG
CCCTCCCCGC TGTCCCCGCC GACGGGATGC CGTTTCCGCA CCCGGTGCCC GTTGGCCGAG
GAGCGCTGCG CCGCCGAGGA GCCGGAGATC CGCGAGGTCC GCCCGGGCCA GTACGTCGCC
TGCCACTTCC CGCTGACCTC ACCGCTGGCC GAATCGGCGG CCGCGGCAGC GGCCACGGAG
GTGGCCACGC CGGTCACCGC CCCCCTGACC GAGACCACGC CCGACGCCAC CGAGCCCTCG
GCGAGCGAGC TCACAGCCAC CACACCCGAG GCCACCAAGC CCCCGGTCGC CGAAGAGCCC
CCGGTCACCG AGCCCGCGGC CACCAAGTCG GTGGCCGCCG AACCTGTGGC CGCCGAACCG
GTCGCAGGCA AGCCCGCGGC CAGCGAGCCC GCGGCCGCCA AGCTCACGCC CGGCGAACCG
GCGACTGCGA CTGCGACAGC GGCAGAGCCC ACGGCTGTCG AGTCCAAGGT CGCCGAGCCC
GTGGTGGCGG GGGCCGAGCC CGCGCCCGCC ACGTCGGGGA CCGCCGAGCC CGCGCCCGCC
AAGGCCGCCG CGGCACCCGC CACCACGGCC CCTGCCACCA CGGGCGACGA GGCGGCGGCC
GCGGACGACC AGGCGGCGGG CGACCCGCCG ACGGTCGAAT CGGCCGCCGG TGAGCAGACC
ACGACCGAGG CCTCGACGAA GCCGACCACC GACGAGTCAG TGGCGGCCGA GACCGCCGCC
GTGCCAACCC CGGCCACCGA CGACCCGGCC ACGGTCGAGA CCGCGACGAC GCCGAGTGGC
ACCGCGGCGC CCGCGAGCAC CGAGCCTCCC ACGGCTACAG CGACCACGGA GCAGGCCACG
GAGGCTGGTG ACCCGCCCAC CGTCGAGACG ACCGTGAGCG ACGACGCGGA TGGCAGCGCA
GCCGGGGAGA CGAAGAAGAC CTCTGGGGGC ACCACCGCCT GA
 
Protein sequence
MAGSGTAHLR GEDALLSVRD LVVEYPTKGG VVQAVSKVSF DVLPGETLGI VGESGCGKST 
TGRAVLRLDR LTSGQITFAG ERIEGVGERR MRELRRSIQM IFQDPVASLN PRRSVKDIVV
EGLAVARAPA SERATVSASV LGQVGLDGDR FADMLPRQLS GGQAQRVAIG RALALHPRLL
ICDEPVSALD VSVQAQILNL IEELKAEFDL TVVFIAHDLG VVRAVSDDVL VMYLGKVCEF
GDSDLVYDQP AHPYTRALLD SVPLTDPERG FTGPALEGDL PSPLSPPTGC RFRTRCPLAE
ERCAAEEPEI REVRPGQYVA CHFPLTSPLA ESAAAAAATE VATPVTAPLT ETTPDATEPS
ASELTATTPE ATKPPVAEEP PVTEPAATKS VAAEPVAAEP VAGKPAASEP AAAKLTPGEP
ATATATAAEP TAVESKVAEP VVAGAEPAPA TSGTAEPAPA KAAAAPATTA PATTGDEAAA
ADDQAAGDPP TVESAAGEQT TTEASTKPTT DESVAAETAA VPTPATDDPA TVETATTPSG
TAAPASTEPP TATATTEQAT EAGDPPTVET TVSDDADGSA AGETKKTSGG TTA