Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0428 |
Symbol | |
ID | 5668851 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 503841 |
End bp | 505589 |
Gene Length | 1749 bp |
Protein Length | 582 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641239360 |
Product | ABC transporter related |
Protein accession | YP_001504799 |
Protein GI | 158312291 |
COG category | [E] Amino acid transport and metabolism [P] Inorganic ion transport and metabolism |
COG ID | [COG0444] ABC-type dipeptide/oligopeptide/nickel transport system, ATPase component [COG1173] ABC-type dipeptide/oligopeptide/nickel transport systems, permease components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGCCG AACGGGTAGT GGGCGACACC CGGCGCCGGG TGTTCCGGCA ACTCCTGCGC AACCCGCAGG GCGCCGTGTG CCTGGCCTTT CTCGCCCCGG TCGTGCTGGT CGCGGTGACC AGCCAGTGGC TGGCGCCGTA CTCGCCGACC GAAACCGACC TCGACGCCAC CAACGCCGCG CCGTTCAGCG ACGGGCACCT GCTCGGCGGG GACAGCGCCG GCCGCGACAT CCTGTCCCGG CTCATGTGGG GCTCGCGGCA GACCGTCCTG GCGTGCGCCA TCATCCTGGT GATCTCGCTG GCTGTCGGTG TCACCAGCGG CTTGGTGGCC GGCTTCTATC GTGGGCGGTT CGAGGTCGCG GCGGGCTTCG TCTCCGACGT GGTCATGTCG CTGCCCGGCA TCGTTCTGCT GATCGCCCTG TACGCCCTGA CCGGCCCGAA CATCCCGGCC GCGATGGCCG TCTTCGGCCT GCTGATCGCC CCCACCTACT ACCGCCTGGT CCGCAACGTC GTGCTGGGCG TGCGCAACGA GCTCTACGTC GACGCGGCCC GCGTGGCCGG CCTGTCCGAC CTGCGGATCG TCGGCCGGCA CGTGCTCTGG GCCGTACGGG CGCCGGTCAT CATCCAGAGC TCGTTCGTGC TGGCCGCGGG CATCGGCATC GAGGCCGGCG TGTCCTTCCT CGGCCTCGGT GACCCGGCCG GGGCGTCGTG GGGCATCGTG CTGCAGAACT CGTTCAACGG CATCTACAAC AACCGGTGGG CCGTGGTCTG GCCGGCGCTG CTGATCAGCC TCACCATCCT GGCCCTGGTG CTGCTCGGCA ACGCGCTCAG CGACGTCCTG CAGTCCTCGG CCCGCAGCAA GACACTGTCG CTACGGGCCC GCCGCGCTGC GGTGACGGCG GCGCAGCAGG TCGAGCCGAC CGAGGACTCG ATCCTCGTCG GGAGCTCCAA CGATGTCGTG CTCTCCGTCC GGGGACTGCG GGTCGCCTAC CCGATGGCCG GCGGGGAGAT CCGGGAGGTA GTGCACGGCG CCGACCTGGA TGTGCGGCGC GGCGAGATCC ACGGCCTGGT GGGGGAGTCC GGCTCCGGCA AGTCGCAGAT CGCCTTCGCC ACCCTGGGCC TCCTCCCGCG CGAGGCGCTG GTCCTGGGCG GCAGCGTCCT GCTCGACGGC GAGGATCTGC TGGCCGACGT CGCGAAGATG CGGGCGGCCC GCGGGCGCCG CATCGCGTAC GTTCCGCAGG AGCCGATGTC CAATCTGGAC CCGTCCTTCA CCATCGGCAA ACAGTTGACG TACGGGCTGC GCGCGGTAAC CACTCTCGAC GCGAAACAGG CCCGGGAGCG GATCATCAAA TTACTGGTCC GGGTGGGCAT CACCGATCCC GAGTGGGTGA TGGGCCTGTA CCCGCACGAG ATCTCGGGTG GCATGGCCCA GCGCGTGCTG ATCTGCGGCG CGGTCGCCGC CGACCCGGAC GTCATCGTGG CCGACGAGCC CACCACCGCC CTCGACGTCA CCGTCCAGGC CGAGGTCCTC GAACTGCTGC GCGAACTGAG CCAGGAGCGC GGTCTGGCCA TGATCCTGGT GACCCACAAC CTCGGCGTCG TCGCGGATCT GTGCGACACC GTGAGCGTGA TGAAGGAAGG AAACATCGTC GAACGCGCCG ACGTCGACGC CATCTTCGAG TCCCCGCAAC AGGCGTACAC CCGGGAACTG CTCTCGTCCT CCCGCAGCGT CGAGCTGATG GAGATCTGA
|
Protein sequence | MTAERVVGDT RRRVFRQLLR NPQGAVCLAF LAPVVLVAVT SQWLAPYSPT ETDLDATNAA PFSDGHLLGG DSAGRDILSR LMWGSRQTVL ACAIILVISL AVGVTSGLVA GFYRGRFEVA AGFVSDVVMS LPGIVLLIAL YALTGPNIPA AMAVFGLLIA PTYYRLVRNV VLGVRNELYV DAARVAGLSD LRIVGRHVLW AVRAPVIIQS SFVLAAGIGI EAGVSFLGLG DPAGASWGIV LQNSFNGIYN NRWAVVWPAL LISLTILALV LLGNALSDVL QSSARSKTLS LRARRAAVTA AQQVEPTEDS ILVGSSNDVV LSVRGLRVAY PMAGGEIREV VHGADLDVRR GEIHGLVGES GSGKSQIAFA TLGLLPREAL VLGGSVLLDG EDLLADVAKM RAARGRRIAY VPQEPMSNLD PSFTIGKQLT YGLRAVTTLD AKQARERIIK LLVRVGITDP EWVMGLYPHE ISGGMAQRVL ICGAVAADPD VIVADEPTTA LDVTVQAEVL ELLRELSQER GLAMILVTHN LGVVADLCDT VSVMKEGNIV ERADVDAIFE SPQQAYTREL LSSSRSVELM EI
|
| |