Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3763 |
Symbol | |
ID | 5672128 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 4459408 |
End bp | 4462362 |
Gene Length | 2955 bp |
Protein Length | 984 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641242644 |
Product | ABC transporter related |
Protein accession | YP_001508064 |
Protein GI | 158315556 |
COG category | [V] Defense mechanisms |
COG ID | [COG1132] ABC-type multidrug transport system, ATPase and permease components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.849502 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGGAGG GCTCCCTCGC GCACCCGCGG CCGGCCCTGA TCGCCACCCT CGCCGGCATC GTCAACGGGA CCACCATGAT CCTCGGGGCC GCCGCCATCG GCTGGGCCAC CGACCATCTG ATCGTCCCGG CCCTCGCCGG CGGCCACGTC GCCCGCGCCA CCTGGTGGAT CGCGGTCGGC GCGATCCTCG GCGTCTCCAC GGTGCGCTGG ATGACCATCG TCATCCGCGG CATCGCGACC GGGTACGTGC AGCACGGCTC GCAGGCGCGG GTCCGGCGGT CCGTCGTCGG CCGGTATCTC GAGCTCGACC TGGCCTGGCA CCGTCGGCAC CCGCCCGGTC GCCTGCTGTC CACCGCGGTG TCCGACGTGG ACGCGCTGTG GTTCCCGATG GTCTTCTACT ACTTCGCGCT CGGGATGATC GTCATGCTGG TCGTCGCGAT CGTCCAGCTC TTCGGGCACG ACACCGCCCT CGGGCTGGTC GGGGTCGGCC TCGTCGGCTC CGTCCTCGGG GTGAACCTGC TCTACCAGCG CCTGCTCAGC CCGCGCGCCC GGGCGGTGCA GGACAGCCGC GGCGAGTTCG GCGCGCTCGC CCTGGAGAGC ATCGAGGGCG GCCAGGTCGT CCGGACCCTC GGCATCGCCG ACCGCGAGCG GGCCCGGGTC GGTGCCGCGG CGCTCCGGCT GCGCGCGGCC ACCACCGCGG CCGGCGACCT CAGCTCGGTG TTCGACCCGC TCCTGGAGGT GCTGCCGACC GCCGCGGTCA TGGCCGTCCT CGCCGTCGGC TCGCGCCGGG TCGAGACCGG CGACCTCAGC GTCGGCGTCC TCGTCGAGGT CGTCTACCTG CTGCTGACCA TCTCCATCCC ACTCAACGTG ATCAGCCGTT TCCTGGGGAT GCTGCCGGTC TCGGCGGCCG GCCGCACCCG CGTCGCCGCC GTGCTCGACG CCGCCGAGAC CACCGCCCAC GGAGACCGCG CCCTGCCCAG CGGCGCCGCT CCCGGCCCGC GGGCGCCGGC TCCGCGGGCG CCGGGCGTCG GGCTCGTCCG CGGCGGGACG AGCCTTCTCA CGGACGTCGA CATCGAGGTG CGCCCCGGCG AGATCGTCGC CATCGTCGGG CCGACCGGCT CGGGCAAGAC CACCCTGATC GAGCTCCTCA GCCGCCAGGT CGACCCCACC GACGGCGTCG TCGAGATCGG CGGGGTTCGC GCCACGGACC TCGCCCGCGG GGAGATCTCG TCCCAGCTGG CCGTTGTCGG GCAGACCTCG TGGCTGTTCG GCGGCAGCGT CCACGCCAAC CTTCAGCTCG ACGGCCATCC CCGCGAACGG CGTCCCTACA CCGCCGGTGA GATCTGGCGG GCGCTGGCCG CCGCCGGCGC CGATGACGTC GTCCGTGACC TGCCCAACGG CCTCGACACC CGGGTCGGCG AGCGCGGCGC CCGGCTCTCC GGCGGCCAGC GGCAGCGGCT GTGCCTGGCC CGCGCGCTGT TGCGCGAGCC CGGCGTGCTG CTGCTGGACG ACGCCACCTC GGCCCTCGAT CGGCGCACGG AGGCCGCGCT CGCCGAGCTG CGCGCGGCCG GCGTCGCGCG CATCCACGAG ATGGCCGCGG AGACGTTCGC GCGCACCGGC AGCGCCGACC TCGTCAGCCG GCTCACCGGC GACGTCGACG CCGTCACGAC CTTCGTGCAG AGCGGCGGCG TCATGCTGCT CGTCAACGTC ACCCAGATGA TCATCGCCGG CGTGCTGATC GCGGTGTACT CCTGGCAGCT GGCGGTGCCC GTGCTGGCGA CGGCGGTGCT GTTGTTCGTC GCGATGCGTC GCGTGCAGGC GCTGGTGGCG CGCCGGTTCA CCGTGGTCCG GGAGAGCGTG TCAGCGCTGC AGTCCACCGT GGGTGAGGCC GTCACCGGCA TCAGCGTCAT CCGCTCGACC GGCACCGAGG CACGCAGCCG CGCCATGGTC GAGGACGCCG TCGAGCACAC GGCGTCGGCG CAGCGCCGCA CCCTCGTCCC GCTGCACTTC AACACCGCCT TCGGTGAGAT CGCGATCTCG TTCGTCACCG TGGTGGTGAT CGTCGCGGGC GTCCGCTGGT CGACCGCCCA CACCCGGTGG GAGCCGACGC TGCACCTGTC GGCCGGTGAG CTGGTGGCGA TGCTCCTGCT CGTCACCTTC TTCGTCCGGC CGTTGCAGAT GCTGGTCCAG ATGCTCGGCG AGGCCCAGAA CGCCGTCGTC GGCTGGCGCC GCGCGCTGGA GATCCTCATC GCCGCGGGCG AGCACGTCGC CGTCGTCGGC GAGACCGGCT CGGGCAAGAG CACCTTCGCC CGGCTGCTGA CCAGGCAGAT CGCGCCACGC CACGGCCGTG TCCTGCTCGG CGGCCTCCCG GCCGGCCAGG TGTCGGACCT GTCCTTCCAG CGCCGCGTCG CGGTCGTCCC GCAGGACCCG TTCCTGTTCG ACGCCACGAT CGCCGACAAC ATCCTCGCCG GGGTCCGCGG CGACGCCGGG GCGCTCGACG AGATCGTCGA CTCGCTCGGC CTGCGGCCGT GGATCGCCAC CCTGCCCGAG GGCCTCGACA CCCGCGTCGG CACACGCGGC GACCGGCTCT CCGCCGGCGA GCGCCAGCTC GTCGCCCTCG CCCGCACCGC CCTGGTCGAC CCCGACCTGC TCGTGCTCGA CGAGGCGACC AGCGGCGTCG ACCCCGCGAC CGACGTCCGC GTGCAGCACG CGCTCGGCGC GCTCACGGTC GGGCGGACCA CGGTCTCGAT CGCCCACCGC ATGGTGACGG CCGAGCGCGC CGACCGCGTT CTCGTCTTCG ACCACGGGCG CCTCGTCCAG AGCGGGCGTC ACGACGACCT CGTCCGCGTC CCCGGCCACT ACGCCCGGCT GCACGCCGCC TGGGTCGAGA ACACCGCCGG CGCCGACGAG CGCCACCAGA CCCTCCTCCA CCACAATGAC GGGAGCACCC CGTGA
|
Protein sequence | MLEGSLAHPR PALIATLAGI VNGTTMILGA AAIGWATDHL IVPALAGGHV ARATWWIAVG AILGVSTVRW MTIVIRGIAT GYVQHGSQAR VRRSVVGRYL ELDLAWHRRH PPGRLLSTAV SDVDALWFPM VFYYFALGMI VMLVVAIVQL FGHDTALGLV GVGLVGSVLG VNLLYQRLLS PRARAVQDSR GEFGALALES IEGGQVVRTL GIADRERARV GAAALRLRAA TTAAGDLSSV FDPLLEVLPT AAVMAVLAVG SRRVETGDLS VGVLVEVVYL LLTISIPLNV ISRFLGMLPV SAAGRTRVAA VLDAAETTAH GDRALPSGAA PGPRAPAPRA PGVGLVRGGT SLLTDVDIEV RPGEIVAIVG PTGSGKTTLI ELLSRQVDPT DGVVEIGGVR ATDLARGEIS SQLAVVGQTS WLFGGSVHAN LQLDGHPRER RPYTAGEIWR ALAAAGADDV VRDLPNGLDT RVGERGARLS GGQRQRLCLA RALLREPGVL LLDDATSALD RRTEAALAEL RAAGVARIHE MAAETFARTG SADLVSRLTG DVDAVTTFVQ SGGVMLLVNV TQMIIAGVLI AVYSWQLAVP VLATAVLLFV AMRRVQALVA RRFTVVRESV SALQSTVGEA VTGISVIRST GTEARSRAMV EDAVEHTASA QRRTLVPLHF NTAFGEIAIS FVTVVVIVAG VRWSTAHTRW EPTLHLSAGE LVAMLLLVTF FVRPLQMLVQ MLGEAQNAVV GWRRALEILI AAGEHVAVVG ETGSGKSTFA RLLTRQIAPR HGRVLLGGLP AGQVSDLSFQ RRVAVVPQDP FLFDATIADN ILAGVRGDAG ALDEIVDSLG LRPWIATLPE GLDTRVGTRG DRLSAGERQL VALARTALVD PDLLVLDEAT SGVDPATDVR VQHALGALTV GRTTVSIAHR MVTAERADRV LVFDHGRLVQ SGRHDDLVRV PGHYARLHAA WVENTAGADE RHQTLLHHND GSTP
|
| |