Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4179 |
Symbol | |
ID | 5672534 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 4967846 |
End bp | 4970233 |
Gene Length | 2388 bp |
Protein Length | 795 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641243052 |
Product | ABC transporter related |
Protein accession | YP_001508469 |
Protein GI | 158315961 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0178] Excinuclease ATPase subunit |
TIGRFAM ID | [TIGR00630] excinuclease ABC, A subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0706824 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.286625 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCAACG CCACGAGGAC GGACCCGCAG CCGCCCGCGC CCCACCTCGC CGACAGCCAC GACCTCATCC GCGTCCACGG CGCCCGCGTG AACAACCTCA AAGACCTCAG CGTCGAACTC CCCAAACGCC GACTCACCGT GTTCACCGGC GTCTCCGGCT CAGGCAAAAG CTCCCTGGTC TTCGGCACCA TCGCCGCGGA ATCCCAGCGA CTGATCAACG AAACCTACAG CGCCTTCGTC CAAGGCTTCA TGCCCACCCT CGCACGACCC GAAGTCGACG TCCTCGACGG ACTCACCACC GCGATCATCG TCGACCAGCA GCGGCTCGGC GCCGACCCCC GCTCCACCGT CGGCACCGCC ACCGACGCCA ACGCCATGCT GCGCATCCTG TTCAGCCGGC TGGGACGTCC ACACATCGGC TCACCCAACG CCTTCTCCTT CAACGTCCCC ACCGTCCGGG CAAGCGGCGC GATCACCACC GAACGCGGAA CCAGCAAAAC CGAACGAAAG ACCTTCACCC GCACCGGCGG CATGTGCCCC CGCTGCGAAG GCCGCGGCGC CGTCTCCGGC TTCGACCTCA CCGCCCTCTA CGACGACAGC AAATCCCTCA ACGAAGGCGC CCTAACCATC CCCGGCTACA GCGTCGACGG CTGGTACGGC CGCATCTTCG GCGGCTCCGG CTTCCTCGAC CCCGACAAAC CCATCCGCCG ATACACCAGG ACCGAACTCC ACGACCTGCT CTACAAAGAA CCCACCAAAA TCAAGGTCGA CAACGTCAAC CTCACCTACG AAGGCCTCAT CCCGAAAATC CAGAAATCGA TCCTGTCCAA AGACCGCGAA GCGATGCAGC CACACATCCG CGCCTTCGTC GACCGGGCCG TCACCTTCAC CACCTGCCCC GACTGCGACG GCACCCGGCT CAGCGAAGCC GCCCGCTCCT CCCGCATCGC CGGCACCAAC ATCGCCGATG CCTGCGCCAT GCAGATCAGC GACCTCGCCC ACTGGGTCCG CGACCTCGAC GAACCATCCG TCGCACCCCT GCTCACCGCG CTGCACCACA CCCTCGGCTC CTTCGTCGAG ATCGGCCTGG GCTACCTCTC CCTCAACCGG CCCTCCGGCA CCCTCTCCGG CGGCGAGGCG CAGCGCGTCA AAATGATCCG CCACCTCGGC TCCTCACTCA CCGACACCAC CTACGTCTTC GACGAACCCA CCGTCGGCCT GCATCCCCAC GACATCCAGC GCATGAACAA CCTGCTGCTG CGACTGCGAG ACAAGGGCAA CACAGTGCTC GTCGTCGAAC ACAAGCCGGA AACAATCGCC ATCGCCGACC ACGTCGTCGA CCTCGGGCCC GGCGCCGGCA CCGCCGGCGG CACCGTCTGC TACGAAGGCA CCCTCGCCGG GCTACGAACC AGCGGCACTC TCACCGGCCG CCACCTCGAC GACCGCGCCA CCCTCAAACC GACCGTGCGC ACCCCCACCG GCCAGCTCCC GATCCGCGGC GCGACCACCC ACAACCTGCA CGACGTCAAC GTCGACATCC CCCTCGGCGT ACTCGTCGTC GTCACCGGCG TCGCCGGCTC CGGCAAAAGC TCCCTCATCC ACGGATCGAT CCCCGCCGGC GCGGACGTCG TCTCGATCGA CCAGACCGCC ATCCGCGGCT CACGACGCAG CAACCCCGCC ACCTACACCG GACTGCTCGA CCCGATCCGC AAGGCATTCG CGAAAGCCAA CGGTGTCAAG CCCGCACTGT TCAGCGCCAA CTCCGAAGGC GCCTGCCCCG CCTGCAACGG CGCTGGCGTC ATCTACACCG ACCTGGCGAT GATGGCCGGC GTCGCCAGCA CCTGCGAAGA ATGCGACGGC AAACGGTTCG AAGCCTCCGT GCTCAACCAC CACCTCGGCG GCCGCGACAT CAGCGAAGTC CTCGCCATGT CCGTCACCGA CGCCCAGGAG TTCTTCGGCA CCGGCGAGGC ACGCACACCC GCCGCACACA CCATCCTCAA CCGGCTCGCC GACGTCGGAC TCGGCTACCT CACCATCGGC CAGCCACTCA CCACCCTCTC CGGCGGCGAA CGGCAACGAC TCAAACTCGC CACCCACATG GCCGACAGGG GCGCCACCTA CATCCTCGAC GAACCCACCA CCGGCCTGCA CCTCGCCGAC GTCGAACAAC TCCTCGGCCT ACTCGACCGG CTCGTCGACT CCGGCAAGTC CGTCATCGTC ATCGAACACC ACCAGGCCGT CATGGCCCAC GCCGACTGGA TCATCGACCT CGGCCCCGGC GCCGGTCACG ACGGCGGCCG GATCGTCTTC GAAGGCACAC CCGCCGACCT CGTCGCCGCC CGTTCCACCC TCACCGGCGA ACACCTCGCC GCCTACATCG GCACCTGA
|
Protein sequence | MSNATRTDPQ PPAPHLADSH DLIRVHGARV NNLKDLSVEL PKRRLTVFTG VSGSGKSSLV FGTIAAESQR LINETYSAFV QGFMPTLARP EVDVLDGLTT AIIVDQQRLG ADPRSTVGTA TDANAMLRIL FSRLGRPHIG SPNAFSFNVP TVRASGAITT ERGTSKTERK TFTRTGGMCP RCEGRGAVSG FDLTALYDDS KSLNEGALTI PGYSVDGWYG RIFGGSGFLD PDKPIRRYTR TELHDLLYKE PTKIKVDNVN LTYEGLIPKI QKSILSKDRE AMQPHIRAFV DRAVTFTTCP DCDGTRLSEA ARSSRIAGTN IADACAMQIS DLAHWVRDLD EPSVAPLLTA LHHTLGSFVE IGLGYLSLNR PSGTLSGGEA QRVKMIRHLG SSLTDTTYVF DEPTVGLHPH DIQRMNNLLL RLRDKGNTVL VVEHKPETIA IADHVVDLGP GAGTAGGTVC YEGTLAGLRT SGTLTGRHLD DRATLKPTVR TPTGQLPIRG ATTHNLHDVN VDIPLGVLVV VTGVAGSGKS SLIHGSIPAG ADVVSIDQTA IRGSRRSNPA TYTGLLDPIR KAFAKANGVK PALFSANSEG ACPACNGAGV IYTDLAMMAG VASTCEECDG KRFEASVLNH HLGGRDISEV LAMSVTDAQE FFGTGEARTP AAHTILNRLA DVGLGYLTIG QPLTTLSGGE RQRLKLATHM ADRGATYILD EPTTGLHLAD VEQLLGLLDR LVDSGKSVIV IEHHQAVMAH ADWIIDLGPG AGHDGGRIVF EGTPADLVAA RSTLTGEHLA AYIGT
|
| |