Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3584 |
Symbol | |
ID | 5671953 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 4245067 |
End bp | 4247688 |
Gene Length | 2622 bp |
Protein Length | 873 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641242470 |
Product | ABC transporter related |
Protein accession | YP_001507890 |
Protein GI | 158315382 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1129] ABC-type sugar transport system, ATPase component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.282341 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.46326 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTCCTGG CCAAGAGCCT TCTGCCCCGT GTCCTCGCCG GCCCGCTGCT CGCCGTCGTG CTGCTGGTCG TCGTGCACGG AGAGCTGATA CCGGCATACC AGACCTACTC GCTGGCCCTG GCGGCGACAT ACGCGGTCCT CGTGCTCAGC GTGGGCCTGC TGGCCGGATG GGCTGGCGTC TGGTCGGTCG GCCACCCGGC CCTGTTCGCC ATCGGGGCCT ACACAGCGGC GTACGGCTCC GCCCACGGAT GGGGGCTGGA GGTCACCGTG CTTGCGGCGA TGGCGCTGGC CGGAACCTGC GGTGCCTTCC TGGGCTTCGC CGGTGCCCGT TTCTCCGTCC TCTATATCGC CCTGCTCACC CTGGCCTTCA GCCTGGTCGC TCTCGAGGTG ATCAATCGGT GGACCGGTGT GACCGGCGGA GACCAGGGTG TTCCTGTCCT CGAGCTGTCG AGCGTGCTCG GCCTGGGAAG CCTCGGTGGT GGCAGCGCGG AGGCGATCGA CGCGGCGATC GTCACGGCTG GGGTCATGCT CACGGTCGCC GCCCTCGCCC GGCCTATGGG CCTGCGGATG CGCATGGTCG CGGCGAAGTC GCATCCGCTG GCGGCTCGCT CCATCGGAAT CGCCCCGGAG GCACAGACCG CGCTGGCCTT CGGGGCCAGT GGGGCGGCCG CCGGGCTTGC CGGTGTGCTA CTAGCCCTGA TCACTGGCTA CGTCAGCCCA GAGTCGTTCT CCCTGGTCTT CGGCATCAAC ACGATCGCCG CGGCAGTGCT CGGCGGTGTC GGCACCATCG CCGGAGCCGT GGTCGGCGGC GCCTTCATGG CGTGGTCACC CACCCTCGCC GACGACATCG GGGTCAGCCA GATGGTCGTC CAGGGCACCG TGCTGATCAT AGTTCTTCTC CTGCTGCCCA GCGGTGTCGT GCCGGCCGTC GCGAGACTCG GGCGGGCGGT GTTCAGGCGG GCGGTACTCC CGCGGGCACC GTGGCTGCGG CCGCGTCCCG GCCCGGCCTC CGGTTTCGCC GACGGGATGG CCGACGGGAC GGCCGACCGA GCCACCGGCC CGGCCGCGCC GGGCCTCGTC CCGGCCGCAC GCCCCGAGGA CGCCGGGGGG ACAGCGTCGG GCGGCGCGGA CGCCGAGACC GTGCTCGAGA TCAGCGACCT TGCGGTGACA TTCGGCGGGC TAAGAGCGCT GGAGGGAGTC TCGCTCAGCG TGCGGAAGGG CGAGGTGCTC GCGATCATCG GGCCGAACGG GGCCGGAAAG ACCACGCTGG TGAACGCGCT GTCCGGGCTG CTGCCCAGCG GTCGGGTCAC CGGGTCGGCC CGCTACCGCG GGCACGAGCT CCTCGGCCGC CGCGCGACCG GCCGTCGCGG TCTGGGTATC GGGCGCACGT TCCAGCATGC CGAGCCCTTC GGCGAGCTCA GCATGCTCGA GAACTTGCTG TGCGCCCACC GGTGGCCCAC TGTGCGGCGG CGCGCCGAAG CCTGGCAGCT GCTGGAGCAG GTCGGCCTGT CCGACGTCGC GGACCGCTCC CCGCACGAGC TGCCGTTCGG GGTACGCAAG CGGCTCGACC TGGCGCGCGC CATCGTCGCG CATCCGGACC TGCTCATCCT CGACGAGCCG TTCGGCGGGC TCGACGCCGG CGAGCGCGCG CTGCTCGCGA CCCAGGTGCG CCGGCTTCGC GACGAGGGTG TTGCGATAAT CATCATCGAT CACGTGATCG AGGACCTGTT CGCGGTCGCG GACCGCGTCG TGGCCTTCGA CTTCGGCAGG CCCATCGGTA GCGGCACCCC GGACGTCGTG CTGCAGGACG ATGCCGTCCG CACCTCTTAC CTGGGCGCGG CAACGGTACG GCCACGCGCC GCGCTCGCCG CGGGACGTGG CGAGCCGCTG GTCACGCTCA CCGCCGTCGG CCACCGCTAC GGCGGCGTGG TCGCGCTGGA CGGCGTGGAC CTGCGGATCC CGCGGGGCGG CATCCTCGCC GCCGTCGGCG CCAACGGGGC TGGCAAGAGC ACTCTCGGCA AGGTGGTGCA CGGCACCGTC GTGCCGACCC GCGGCACCCG CGAGGTCGTC CAGGTCGACG GGCGCGCTTT GCGCTGCAGC CTCATGCCCG AGGGGCGCGC CCTGTTCAAG TCCCTGTCGG TGCGGGAGAA CCTGGACGTC GCCGCCTATG CGGCGGGGGT GCGCGGCGCC CTGCTCCGCC AGCGCCGAGA CGAGACTATG GACTGGCTGC CGGACCGGGT GCGCAGCCGC ATGTCGGTGT CGGCGGGCGC GCTGTCCGGT GGCGAGCAGC AGCTCCTCGC GACGGCTCGG GCACTGATGG CCGGGCCGGA CCTGCTCGTG CTCGACGAAC CCGCGCTCGG GCTGGCGCCC GCCATGGTCG ACGAGATCTA CGAGCGCATC GCCGGGCTGG CCGAACAGGG GCTGACGGTA GTGCTGCTCG AGCAGCTCCT GAGCCGGGCC CTCAGCCTTG CCACCGACAT GGTGGTGCTG CACGAGGGCA CTGTCGCGGT CACGGGCTCG CCCGCCGACC CCGGATTCGC CGAGCTTGCC GAGCACGCCT ACTTCGGCGG TGCTGCGGCC GTCGCCTCCG GCACCCCGCT AACCGAGGCG GTGAGCCGGT GA
|
Protein sequence | MVLAKSLLPR VLAGPLLAVV LLVVVHGELI PAYQTYSLAL AATYAVLVLS VGLLAGWAGV WSVGHPALFA IGAYTAAYGS AHGWGLEVTV LAAMALAGTC GAFLGFAGAR FSVLYIALLT LAFSLVALEV INRWTGVTGG DQGVPVLELS SVLGLGSLGG GSAEAIDAAI VTAGVMLTVA ALARPMGLRM RMVAAKSHPL AARSIGIAPE AQTALAFGAS GAAAGLAGVL LALITGYVSP ESFSLVFGIN TIAAAVLGGV GTIAGAVVGG AFMAWSPTLA DDIGVSQMVV QGTVLIIVLL LLPSGVVPAV ARLGRAVFRR AVLPRAPWLR PRPGPASGFA DGMADGTADR ATGPAAPGLV PAARPEDAGG TASGGADAET VLEISDLAVT FGGLRALEGV SLSVRKGEVL AIIGPNGAGK TTLVNALSGL LPSGRVTGSA RYRGHELLGR RATGRRGLGI GRTFQHAEPF GELSMLENLL CAHRWPTVRR RAEAWQLLEQ VGLSDVADRS PHELPFGVRK RLDLARAIVA HPDLLILDEP FGGLDAGERA LLATQVRRLR DEGVAIIIID HVIEDLFAVA DRVVAFDFGR PIGSGTPDVV LQDDAVRTSY LGAATVRPRA ALAAGRGEPL VTLTAVGHRY GGVVALDGVD LRIPRGGILA AVGANGAGKS TLGKVVHGTV VPTRGTREVV QVDGRALRCS LMPEGRALFK SLSVRENLDV AAYAAGVRGA LLRQRRDETM DWLPDRVRSR MSVSAGALSG GEQQLLATAR ALMAGPDLLV LDEPALGLAP AMVDEIYERI AGLAEQGLTV VLLEQLLSRA LSLATDMVVL HEGTVAVTGS PADPGFAELA EHAYFGGAAA VASGTPLTEA VSR
|
| |