Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_6719 |
Symbol | |
ID | 5675032 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 8165208 |
End bp | 8167730 |
Gene Length | 2523 bp |
Protein Length | 840 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641245567 |
Product | helicase c2 |
Protein accession | YP_001510959 |
Protein GI | 158318451 |
COG category | [R] General function prediction only |
COG ID | [COG1204] Superfamily II helicase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCGCTGG ACTGGAACCG GATCGGCTCG GCGTCGCCGC AGCGGCTCCT GAGACCGCGG GACATCTTCG CGGCGCTGCC GAACAAGCCC TGGTCCTACC TGCGGCAGGA ACAGGGCGAG GCGCTGGAAG GCTGGTTCGA CCGCCGAAGT AACCGCGACG TCGTGATCAA GCAGAACACG GGAGGCGGGA AGACCGTCGT CGGTCTCCTC ATCGCGCAGA GCACCCTCAA CGAGGGCGTC GGCAAGGCCG TTTACCTCGC TCCGGACACC TATCTGGCAA GCCGCGTCCG GGCCGAGGCC GTCCGGCTGG GCCTGCCGAC CGCGGACCGG CCCGACGACG TCGGTTTCGC CAGTGGCCGC GCCATCCTCG TCACGACCTT CCAGAAACTG GTCAACGGCC AGTCCGTGTT CGGCGTGACC GGGGGCCGCC GCCCCGCGAT CGATCTGGGC GTGTGCGTCG TCGACGACGC GCACGCCGCA CTCGCCACAA CCGAGGGCCA GTTCCGTCTC ACCATCCCGT TCGAGCACCC AGCGATGCGG AAACTGTTCG ACCTGTTCCA CTCGGCCGCC GAGGGCCAAA GCCAGAAGGC CTGGCTGGAC ATGTCCTTCG GCGACCCGGG TGCCGGGATC CGGGTGCCGT TTTGGGCCTG GGCCGACCTG CAGAGCGAGG TACTCAAGAT CCTGCACCCG CACCAGCGCG ACCCCGAGTT CATGTTCGCC TGGCCACTGA TCGCCGACTG CCTGAGCCTG TGCGCGGCGA TGCTTTCCAG CCGGGGCGTG GAGATCCGAC CCCCGTGCCC GCCCATCGAC AAGATCCCCT CGTTCGTAAA CGCCAAGCGG CGCGTGTACC TGACGGCGAC CCTCTCCGAC GACAGCGTGC TGGTCACCAA CCTCAAGGCC GACCCGAAGG ATCTGACGAA CATCGTCACC CCCGGCAGCG CGGCCGACCT CGGTGACCGC CTGGTACTCG CGCCGCTGGA GCTCAACCCG ACGCTCGATG AGGCGGCGAT TCGGCAACTG GCCAAGGGCT TCGCGACCGG GGACCGCGAC GGTGACGGGC GCCCGGAGGC CAGGCCCGTG AACGTGGTGG TGCTCGTGCC CAGCGCAGTC AAGGCCGCGC AGTGGCGTCG CCTCGCCGAC CGCGAGCTCT ACGTCAGAGA TCTCGAGCAG GGTGTCGCCG CGCTCACTGC CGGGCATGTC GGCCTGGTCG TGCTCGTCAA CAAGTACGAC GGCATCGATC TGCCTGGCGG CGCATGCGAA CTGCTCATCA TCGACGGCAT CCCCCGGTCT CTCGACGCAG CCGACCGCCG CGAGGCCTCC GCCCTGGCGG ACAGCCCAGC CCAGCGCGCA CGCGACGTCC AGCGGATCGA GCAGGGCATG GGCCGCGGCG TCCGTGACGT CAACGACCAC TGCGCGGTGC TCCTCATGGG CGCCGGCCTG GGCCTCGCCG TGCACGACGA CGCCTGGCTG GAGCTGTTCT CCCCGGCGAC CCGCGCGCAG CTGCGCCTGA GCCGGGACGT TGCCATACAG GTGCAGCACA CGGGCCTGGA AGGCATTCGC AGCGCGTTGT CGGTCTGTCT GGACCGCGAC CCGCAGTGGG TGGAACGCAG CAAGCTCGCC CTCGCCCCCA TCCGCTACAA CAACGTCGGG TCCGTGCGTC CCGAAGCCGT GGCCGCCCGC GAGGCGTTCG ACCTGGCGAC AACCGGCCAG TTCAACCAGG CGGCCCAGCG GTTGCAGACG GCGATCGACG GCATCACCGA CCCCGCGCTG CGTGGCTGGG TGCGGGAGCA GAAGGCGACC TACCTGCACT TCGTCGACCC TGCCGTCGCG CAGCAGCAAC TGGGAGCAGC GATCCGCGAG AACCCGGCCG TGCTGCGACC GGCCGTGGGC GTGGACGTCC GGCAGGCCCG TGCCGCCGCG GTCCAGGCCC GCGCCGCCGC CGAGCACCTC ACCGCCACCT ATGCCGACAG CACAGCCCTC GTCCTCGGCC TGCGCGCGGT ACTCGATGAT CTCGTCTGGG ACGACGACCG CACCGACAAC ACCGAGGCGG CCTGGGAGAA GCTGGGCCTG CACCTGGGGT TCGACAGCAT GCGCCCCGAA CGCCTGTACG GCACCGGCCC GGACAACCTG TGGGCGCTTA GTGCCACCCG GCACGCTGTG ATCGAACTCA AGACCGGAGT GAAGGCAGAC TGCCCCGGCA TCGCCAAGAA GGACGCCGAC CAGCTCGGCG GCAGCGTGCG CTGGAACGAG AGCAAGAACC CCGGGCCGGC CAACGTGCCG GTCATGCTGC ACCCGGTCGA CGTGCTCGAC CCCAAGGCCG TCGCCGTGCC CGGCATGAGG GTGATCACTC CGGCCAAGCT CGACGAGCTG AAGACGGCCG TCCAGGCCTT CGCGACGGCG CTCGCCCAAG ACGCCAGCCG TTGGGCCAGC GAACAAGCGG TTGCCAACCA ACTGGCGTTC CATGCCCTGA CCGGCGACCG CATCATCGCT ACGTACTCCG TCGCGGTCGG GCGACCCAGC TGA
|
Protein sequence | MALDWNRIGS ASPQRLLRPR DIFAALPNKP WSYLRQEQGE ALEGWFDRRS NRDVVIKQNT GGGKTVVGLL IAQSTLNEGV GKAVYLAPDT YLASRVRAEA VRLGLPTADR PDDVGFASGR AILVTTFQKL VNGQSVFGVT GGRRPAIDLG VCVVDDAHAA LATTEGQFRL TIPFEHPAMR KLFDLFHSAA EGQSQKAWLD MSFGDPGAGI RVPFWAWADL QSEVLKILHP HQRDPEFMFA WPLIADCLSL CAAMLSSRGV EIRPPCPPID KIPSFVNAKR RVYLTATLSD DSVLVTNLKA DPKDLTNIVT PGSAADLGDR LVLAPLELNP TLDEAAIRQL AKGFATGDRD GDGRPEARPV NVVVLVPSAV KAAQWRRLAD RELYVRDLEQ GVAALTAGHV GLVVLVNKYD GIDLPGGACE LLIIDGIPRS LDAADRREAS ALADSPAQRA RDVQRIEQGM GRGVRDVNDH CAVLLMGAGL GLAVHDDAWL ELFSPATRAQ LRLSRDVAIQ VQHTGLEGIR SALSVCLDRD PQWVERSKLA LAPIRYNNVG SVRPEAVAAR EAFDLATTGQ FNQAAQRLQT AIDGITDPAL RGWVREQKAT YLHFVDPAVA QQQLGAAIRE NPAVLRPAVG VDVRQARAAA VQARAAAEHL TATYADSTAL VLGLRAVLDD LVWDDDRTDN TEAAWEKLGL HLGFDSMRPE RLYGTGPDNL WALSATRHAV IELKTGVKAD CPGIAKKDAD QLGGSVRWNE SKNPGPANVP VMLHPVDVLD PKAVAVPGMR VITPAKLDEL KTAVQAFATA LAQDASRWAS EQAVANQLAF HALTGDRIIA TYSVAVGRPS
|
| |