Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3918 |
Symbol | |
ID | 5672279 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 4684239 |
End bp | 4686020 |
Gene Length | 1782 bp |
Protein Length | 593 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641242797 |
Product | oligopeptide/dipeptide ABC transporter, ATPase subunit |
Protein accession | YP_001508214 |
Protein GI | 158315706 |
COG category | [E] Amino acid transport and metabolism [P] Inorganic ion transport and metabolism |
COG ID | [COG0444] ABC-type dipeptide/oligopeptide/nickel transport system, ATPase component |
TIGRFAM ID | [TIGR01727] oligopeptide/dipeptide ABC transporter, ATP-binding protein, C-terminal domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.432636 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0575637 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGGCCGGTA GCGGAACCGC CCATCTGCGC GGCGAGGACG CGCTGCTCAG CGTGCGCGAC CTGGTGGTCG AATACCCGAC CAAGGGCGGG GTGGTCCAGG CCGTCTCCAA GGTCAGCTTC GACGTCCTGC CCGGCGAGAC GCTCGGGATC GTCGGCGAGT CCGGCTGTGG GAAGTCCACG ACCGGGCGCG CCGTGCTCCG GCTGGACAGG CTGACCTCCG GCCAGATCAC GTTCGCCGGC GAGCGCATCG AGGGAGTCGG CGAACGCCGG ATGCGCGAGC TCCGCCGCTC CATCCAGATG ATCTTCCAGG ACCCGGTGGC GTCGCTGAAC CCCCGCCGCA GCGTCAAGGA CATCGTCGTC GAGGGTCTGG CGGTTGCCCG TGCCCCGGCG TCCGAACGGG CCACCGTGTC GGCGTCCGTC CTCGGACAGG TCGGGCTGGA CGGCGACCGC TTCGCCGACA TGCTGCCCCG CCAGCTCTCC GGCGGCCAGG CACAGCGGGT GGCGATCGGC CGCGCGCTCG CGCTGCACCC GCGGCTGCTG ATCTGCGACG AGCCGGTCTC CGCGCTGGAC GTCTCCGTGC AGGCGCAGAT CCTCAACCTG ATCGAGGAGC TGAAGGCCGA GTTCGACCTG ACCGTCGTGT TCATCGCGCA CGACCTCGGG GTCGTGCGGG CGGTCAGCGA CGACGTCCTG GTCATGTACC TCGGCAAGGT GTGCGAGTTC GGCGACTCCG ACCTGGTGTA CGACCAGCCC GCCCACCCCT ACACCCGGGC GCTGCTGGAC TCCGTCCCGC TGACCGACCC GGAGCGCGGC TTCACCGGTC CCGCGTTGGA GGGCGACCTG CCCTCCCCGC TGTCCCCGCC GACGGGATGC CGTTTCCGCA CCCGGTGCCC GTTGGCCGAG GAGCGCTGCG CCGCCGAGGA GCCGGAGATC CGCGAGGTCC GCCCGGGCCA GTACGTCGCC TGCCACTTCC CGCTGACCTC ACCGCTGGCC GAATCGGCGG CCGCGGCAGC GGCCACGGAG GTGGCCACGC CGGTCACCGC CCCCCTGACC GAGACCACGC CCGACGCCAC CGAGCCCTCG GCGAGCGAGC TCACAGCCAC CACACCCGAG GCCACCAAGC CCCCGGTCGC CGAAGAGCCC CCGGTCACCG AGCCCGCGGC CACCAAGTCG GTGGCCGCCG AACCTGTGGC CGCCGAACCG GTCGCAGGCA AGCCCGCGGC CAGCGAGCCC GCGGCCGCCA AGCTCACGCC CGGCGAACCG GCGACTGCGA CTGCGACAGC GGCAGAGCCC ACGGCTGTCG AGTCCAAGGT CGCCGAGCCC GTGGTGGCGG GGGCCGAGCC CGCGCCCGCC ACGTCGGGGA CCGCCGAGCC CGCGCCCGCC AAGGCCGCCG CGGCACCCGC CACCACGGCC CCTGCCACCA CGGGCGACGA GGCGGCGGCC GCGGACGACC AGGCGGCGGG CGACCCGCCG ACGGTCGAAT CGGCCGCCGG TGAGCAGACC ACGACCGAGG CCTCGACGAA GCCGACCACC GACGAGTCAG TGGCGGCCGA GACCGCCGCC GTGCCAACCC CGGCCACCGA CGACCCGGCC ACGGTCGAGA CCGCGACGAC GCCGAGTGGC ACCGCGGCGC CCGCGAGCAC CGAGCCTCCC ACGGCTACAG CGACCACGGA GCAGGCCACG GAGGCTGGTG ACCCGCCCAC CGTCGAGACG ACCGTGAGCG ACGACGCGGA TGGCAGCGCA GCCGGGGAGA CGAAGAAGAC CTCTGGGGGC ACCACCGCCT GA
|
Protein sequence | MAGSGTAHLR GEDALLSVRD LVVEYPTKGG VVQAVSKVSF DVLPGETLGI VGESGCGKST TGRAVLRLDR LTSGQITFAG ERIEGVGERR MRELRRSIQM IFQDPVASLN PRRSVKDIVV EGLAVARAPA SERATVSASV LGQVGLDGDR FADMLPRQLS GGQAQRVAIG RALALHPRLL ICDEPVSALD VSVQAQILNL IEELKAEFDL TVVFIAHDLG VVRAVSDDVL VMYLGKVCEF GDSDLVYDQP AHPYTRALLD SVPLTDPERG FTGPALEGDL PSPLSPPTGC RFRTRCPLAE ERCAAEEPEI REVRPGQYVA CHFPLTSPLA ESAAAAAATE VATPVTAPLT ETTPDATEPS ASELTATTPE ATKPPVAEEP PVTEPAATKS VAAEPVAAEP VAGKPAASEP AAAKLTPGEP ATATATAAEP TAVESKVAEP VVAGAEPAPA TSGTAEPAPA KAAAAPATTA PATTGDEAAA ADDQAAGDPP TVESAAGEQT TTEASTKPTT DESVAAETAA VPTPATDDPA TVETATTPSG TAAPASTEPP TATATTEQAT EAGDPPTVET TVSDDADGSA AGETKKTSGG TTA
|
| |