Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3476 |
Symbol | |
ID | 5671847 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 4132940 |
End bp | 4134724 |
Gene Length | 1785 bp |
Protein Length | 594 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | 641242364 |
Product | ABC transporter related |
Protein accession | YP_001507784 |
Protein GI | 158315276 |
COG category | [R] General function prediction only |
COG ID | [COG1123] ATPase components of various ABC-type transport systems, contain duplicated ATPase |
TIGRFAM ID | [TIGR02323] phosphonate C-P lyase system protein PhnK |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0289183 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0200783 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGGCCA TCACCGGAGC CGACGACGCT CCCGAACTGC TCTCGCCCCC GGCGGCGGCG CCGGAGTCCC CGCCGGAGTC CCCGCAGCCG CCGGAGCCGA CCGCGCTGGT CGAGGTGGAC GGCCTCGACG TGCGGTTCGG CTCCGGCCCC GGCGCGGTGC ACGCCGTACG CGACGTCTCG CTCACCCTGA CCGCGGGCCG CTGTCTGGCC CTCGTCGGGG AGTCCGGGTC CGGCAAGAGC GCGCTGGCCC GCGCGCTGCT GGGCCTGGCC GGGCCGACGG CCACCGTCAC CGCCCGCCGG CTGCGCATCG ACGACGCGGA CGCCCTCACC TTCCGGCCGC GGGACTGGCT GAAGGTCCGC GGCCGGCGCA TCGGACTGGT GTCCCAGGAC GCGCTGGTCG CGCTCGACCC GCTCCGCCCG ATCGGCCGCG AGGTCGCCGA GCCGATCCTG GCGCACCGGC TGCTCCCCCG GCGAGAGGTC GAACCGGCCG TGCACGCGCT GCTGGAGCGG GTCGGCATCC CCGACCCGGC CGAGCGGGCG CGCAGCTACG TCCACCAGCT CTCCGGCGGG CTGCGCCAGC GGGCGCTGAT CGCCTCCGCG CTGGCCGCCG GACCCGGCGC GCTCATCGCG GACGAGCCGA CCACCGCGCT CGACGCCTCG GTGCAGGCCC GCGTCCTCGG CCTGCTCGGG CAGCTCAAGG CGGACGGCAC CGGGCTGCTG CTGATCAGTC ATGACCTGGC GGTCGTCGAG GCGCTGGCCG ACGAGGTCGC GGTGATGCGC GAGGGGGTCG TCGTCGAGGC CGGGCCGGCC GCCGAGGTCC TGGCCCGCCC GCGGCATCCC TACACGATCG CGCTGCTGGA CGCCGTCCCC GGCCGCCGCG GACGTCCGAA CGGCGTCGCC CCGGCGACCG ACACCGCCGC GACCGCCGAC GCGACCTCGA CCGCCGACGC AGCGGGGGCG GGCGACGTCC CGGGCGCCGA CGAGGCGGCG CCGCTGCTCG CGGTCGCCGG TGCCACCAAG CACTTCCGCG GGCCGCGGGG CAGCCGGCGC ACGGCCGTCG ACGACGTCTC GTTCACCCTG CGGGCCGGCG AGACGCTCGG GCTGGTGGGC GAGTCGGGCT CCGGGAAGTC CACCCTGGCC GGCCTGGTGC TCGGCCTCGT CGCCGCCGAC GGCGGCGAGA TCCGGCTGGC CGGGCAGCCG TGGAGCGGCA TGCCCGAGCG GGAGCGGCGG GAACGGCGCC ACCTGCTCCA GCTCGTCCCG CAGGATCCAC TGAGCGCCTT CGACCCGCGC TGGACGGTCG CCCGCATCAT CGGCGAGGGG CTGGAGGCGG CCGGGGTGGA GCGCCGGGAG CGACGCGCGC AGGCCCTGAC CCTGCTCGAA CAGGTCGGGC TGTCCGACAT CCACCTGGAC CGCCGCCCGC TGGCGCTGTC CGGGGGGCAG CGGCAGCGGG TGGCGATCGC CCGCGCGCTG GCGACCCGGC CGCGGCTGCT CGTGTGCGAC GAGCCCGTCT CCGCCCTCGA CGTCTCGGTG CAGGCTCAGG TGCTGGCCCT GTTCCGGGAG CTGAGCGACT CCCTCGGGCT GGCCACCCTG TTCATCTCGC ACGACCTGGC CGTGGTGCGG GAGGTCTGCG CCCGGGTGCT GGTGATGAAG GACGGCCGGA TCGTCGAGAC CGGACCGGTG GAGCAGGTGT TCGCCGACCC GCGGCACCGG TACACCCAGG AGCTGCTCTC CGCCGTCCCC GGCAAGGCGG GGCGGGCCTA CCGCGCGCGT TTCGGAACCG GCTGA
|
Protein sequence | MTAITGADDA PELLSPPAAA PESPPESPQP PEPTALVEVD GLDVRFGSGP GAVHAVRDVS LTLTAGRCLA LVGESGSGKS ALARALLGLA GPTATVTARR LRIDDADALT FRPRDWLKVR GRRIGLVSQD ALVALDPLRP IGREVAEPIL AHRLLPRREV EPAVHALLER VGIPDPAERA RSYVHQLSGG LRQRALIASA LAAGPGALIA DEPTTALDAS VQARVLGLLG QLKADGTGLL LISHDLAVVE ALADEVAVMR EGVVVEAGPA AEVLARPRHP YTIALLDAVP GRRGRPNGVA PATDTAATAD ATSTADAAGA GDVPGADEAA PLLAVAGATK HFRGPRGSRR TAVDDVSFTL RAGETLGLVG ESGSGKSTLA GLVLGLVAAD GGEIRLAGQP WSGMPERERR ERRHLLQLVP QDPLSAFDPR WTVARIIGEG LEAAGVERRE RRAQALTLLE QVGLSDIHLD RRPLALSGGQ RQRVAIARAL ATRPRLLVCD EPVSALDVSV QAQVLALFRE LSDSLGLATL FISHDLAVVR EVCARVLVMK DGRIVETGPV EQVFADPRHR YTQELLSAVP GKAGRAYRAR FGTG
|
| |