Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5464 |
Symbol | |
ID | 5673795 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 6607621 |
End bp | 6609438 |
Gene Length | 1818 bp |
Protein Length | 605 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641244319 |
Product | ATP-dependent DNA helicase RecQ |
Protein accession | YP_001509725 |
Protein GI | 158317217 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0514] Superfamily II DNA helicase |
TIGRFAM ID | [TIGR00614] ATP-dependent DNA helicase, RecQ family [TIGR01389] ATP-dependent DNA helicase RecQ |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.195323 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGAAGTGC TCCGCCGCGT CTTCGGCTAC GAGTCCTTCC GCGACGGCCA GCAGGAGATC ATCGATCACG TCGTCGGCGG TGGCGACGCG CTGGTCCTCA TGCCGACCGG CGGCGGCAAG TCGCTGTGCT ACCAGATCCC GGCGCTCGTC CGGCCTGGCA CCGGCGTCGT CATCTCCCCG CTCATCGCGC TGATGCAGGA CCAGGTCGAC GCGTTGCTGG CGCTCGGGGT CCGGGCGGGG TTCCTCAACT CCACCCAGCA GGCCGACGAG CGCCGCGCGG TGGAGTCCGC GTTCCTCGCC GGCGAGCTCG ACCTGCTCTA CCTGGCCCCC GAGCGGCTGC GGGTACGCGC GACGATCGAG CTCCTCGACC AGGGCGAGAT CGCGCTGTTC GCGATCGACG AGGCGCACTG CGTCGCGCAG TGGGGGCACG ACTTCCGGCC CGACTACCTC CTGCTCTCCG AGCTGCACCA GCACTGGCCG CAGGTGCCGC GCGTCGCACT GACCGCCACC GCCACCCCGG CGACCCACCA GGAGATCACG AACCGGCTCG GGCTGGGAAA CGCCCGCCAT TTCGTCGCCG ACTTCGACCG GCCGAACATC CAGTACCGCA TCGTGCCGAA GAACGACCCG AAGGCACAGC TGCTGGAGCT CCTGCGCACC GAGCACCCCG GCGACGCGGG CATCGTGTAC TGCCTGTCCC GGGCGTCGGT CGAGAAGATC GCGGAGTTCC TGGTGTCGAA CGGGATCGCC GCGCTGCCCT ACCACGCCGG CCTCGAAGCA CGCGTCCGCG CCGAGCACCA AGCCCGTTTC CTCCGTGAGG ACGGCCTGGT CATGGTCGCC ACGATCGCCT TCGGGATGGG CATCGACAAG CCGGACGTCC GCTTCGTCGC CCACCTCGAC CTACCGAAGT CGGTAGAGGG CTATTACCAG GAGACCGGCC GCGCCGGCCG GGACGGGCTG CGCTCCACCG CCTGGCTCGC GTACGGCCTG CAGGACGTGG TCCAGCAGCG CCGGCTCATC GACGCCTCCG ACGGCGACGC CACGCACCGC CGGCGGCTCA ACTCCCACCT CGACGCGATG CTCGCACTGT GCGAGACCAT CGAGTGCCGC CGGGTCGGCC TGCTGGCGTA CTTCGGGCAG CAGGGGTCGA GCGCCTGCGG CAACTGCGAC GCCTGCCTGC ACCCGCAGCA GTCCTGGGAG GCGACCGTGC CCGCCCAGAA GCTGCTGTCG ACGGTGCTGC GCCTGCAGCG GGAACGCCGG CAGAAGTTCG GCGCCGGCCA CATCGTCGAC ATCCTGCTGG GCCGGCGCAC ACCCAAGGTC AACCAGCACG GCCACGACTC GCTGACGGTC TTCGGCATCG GCACCGAGCT CAGCGAGGCC GAGTGGCGTG GCGTGGTCCG CCAGCTGCTG GCCCAGGGGC TGCTGGCCGT CGAGGGTGAG CACGGAACGC TCGTCCTCAC CGACGCGAGC GCGGACGTCC TGCGCGGGCA GCGGACGGTC TCCATGCGGC GCGAACCGAA GCGACCGGCG AAGGCCACCA GCTCCTCGGC GACGAAGGCA CGCCGGGCCG AGCCGGTCGA ACTGTCCGCG GAGGCGGCGC CGCTGTTCGA ACGGCTGCGC GCCTGGCGGG GCGCCACCGC CAAGGAACAG GGCGTTCCCG CCTACGTGAT CTTCCATGAC GCGACGCTGC GCGAGATCGC CACCCGCGCG CCCTCCTCGC TCGCCGAGCT CGCGACGGTG AACGGCGTCG GCGAGAACAA GCTGGCCAAG TACGGCGAAC ACATCCTCCC GCTGTGTAGC GCCACGCCCA CGGACTGA
|
Protein sequence | MEVLRRVFGY ESFRDGQQEI IDHVVGGGDA LVLMPTGGGK SLCYQIPALV RPGTGVVISP LIALMQDQVD ALLALGVRAG FLNSTQQADE RRAVESAFLA GELDLLYLAP ERLRVRATIE LLDQGEIALF AIDEAHCVAQ WGHDFRPDYL LLSELHQHWP QVPRVALTAT ATPATHQEIT NRLGLGNARH FVADFDRPNI QYRIVPKNDP KAQLLELLRT EHPGDAGIVY CLSRASVEKI AEFLVSNGIA ALPYHAGLEA RVRAEHQARF LREDGLVMVA TIAFGMGIDK PDVRFVAHLD LPKSVEGYYQ ETGRAGRDGL RSTAWLAYGL QDVVQQRRLI DASDGDATHR RRLNSHLDAM LALCETIECR RVGLLAYFGQ QGSSACGNCD ACLHPQQSWE ATVPAQKLLS TVLRLQRERR QKFGAGHIVD ILLGRRTPKV NQHGHDSLTV FGIGTELSEA EWRGVVRQLL AQGLLAVEGE HGTLVLTDAS ADVLRGQRTV SMRREPKRPA KATSSSATKA RRAEPVELSA EAAPLFERLR AWRGATAKEQ GVPAYVIFHD ATLREIATRA PSSLAELATV NGVGENKLAK YGEHILPLCS ATPTD
|
| |