Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_0469 |
Symbol | |
ID | 4028000 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | - |
Start bp | 517061 |
End bp | 520003 |
Gene Length | 2943 bp |
Protein Length | 980 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637965627 |
Product | ATP-dependent helicase HepA |
Protein accession | YP_572530 |
Protein GI | 92112602 |
COG category | [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG0553] Superfamily II DNA/RNA helicases, SNF2 family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGAAT TCATTCCGGG CCAGCGCTGG GTCAGTGACG GCGAAGCCGA ACTGGGGCTG GGCACGATCC TGAGCTGCGA TTTCCGAAGC GTGACCGTTC TCTTTGCCGC CAGTCAGGAA ACGCGCACCT ACAGCGTTCG CCAGGCACCG CTGACCCGCG TGGTATTCGG CGCCGGCGAC CGGATAGAAG CCGCCGACGG CCGCACGCTG ACCGTCGACG ACACCAAGGA GGCCAAGGGG CTGCTGATCT ATGTCGTCGA GACGGCGGAC GGCGAATACG AAGAGTTTCC CGAAAGCCGC CTCGCCGACA GCATGCGCTT CCAGCAAGCG CGGGACCGCC TGCTGACCGG TCAGGTCGAC CGCAACGACT GGTTCGAGCT GCGTTACCGC ACCCTGCACC ACTATCACCG CATCGCTCAA CACCCGGCGA ACGGCCTGGC AGGCCCGCGG GTGGACCTCA TCCCGCACCA GCTGTACATC GCCGACGAAG TCTCCCGTCG CCATGCCCCG CGCGTGCTGC TGGCCGACGA AGTGGGCCTG GGCAAGACCA TCGAGGCGGG TCTGATCCTG CACCGCCTGC TGATCACCGG TCGTGCCGAG CGTGCGCTGA TTCTGGTGCC GGACAGCCTG ACCCACCAGT GGCTGGTGGA AATGCTGCGC CGTTTCTCGC TGCATTTCAC GCTGCTTGAC GAGCAACAGA GCCAGCACAG TACGGGCAAC CCCTTCGAAA GCGCCCAGCT GGTGCTCGCC AGCCAGCAGT GGCTATTCGC CAACCCACAC CGCCAGACCC AGGCCCAGGC CTGCGACTGG GACTTGCTGA TCGTCGACGA GGCGCATCAT CTCGACTGGA GCCCCGAGGC CAGCGGCCCC GGCTATGCCT GCGTCGAGCA GCTGGCGCAT GCCATCGACG GCGTGCTGCT GCTCACCGCC ACGCCCGAGC AGATGGGGCT TGCCAGCCAT TTCGCCCGCC TGCGTCTGCT CGACCCCGAG CGCTATACCA GCCTGGCGGC CTTCGAGGCC GAAGAAACGG GCTACGCCCG GGTCGCGGAG GCGGTGGACG CCCTCGAGCG CCTGCCCGGC GATGCCGCGG ACCGCGATAG CGTGGGCGCG GTCATCGACG ACGCGGACAG CCAGGCACTG CTCGACACGC TGATCGACCC GGAAAGCGAC GACACCCAGC GCGACAGCGC ACGCCGCCAG CTACGCGAGC AGTTGCTCGA CCGTCACGGG ACCGGCCGGG TGATGTTCCG CAACAGCCGT CGCCATGTCG CCGGCTTTCC CGAACGCCAC CTGCACGTCA CTCGCCTGGC GCCGCCGTCC AGCTATCGCC GCGAGTTGCG CAAGCTGGAT CGCGACATGG ACTACCTCGA CGACCTGCTG ATCACCACCG GCCTCGACCA TCCCGAGGTG CTGCTGTATC CGGATGCCAT GTATCGCGCG CAATGCGATG CCTCGCCGGG CGACACCGCA CTCAACGCCG AGCCCTGGTG GCAGATAGAT CCGCGCGTGA CCTGGCTGCT GGAGCGCCTG ACCGAAGTGG GGCGCGACAA GATGCTGGTC ATCGCCCACG ACCGCGAGAC CGCCAGCGAT CTCAGCGAGG CACTGCGCGT ACTGGGCGGA GTCCAGGCAC CGGTCTTCCA TGAGGGCCTG ACGCTCATCG AACGCGACCG TGCCGCGGCG GCCTTCGCCG ACGAGGAAGA CGGGTGCCAG GTACTGGTGT GCTCGGAGAT CGGCTCCGAG GGACGCAATT TCCAGTTCTG CCGGCATCTG GTGATGTTCG ACTTGCCGGC GCATCCCGAC CAGCTCGAAC AACGCATCGG CCGCCTCGAC CGTATCGGCC AGCAGCACGC CATCGAGATT CATGTGCCGG TCTTCGAGGA CAGCCCCGGC GAGCGCTTGC TGCGCTGGTA CGACGAGGGC ATGCGCGCCT TCGACGCCCC GCACGGCATC GGCAGCGAAC TCTACGACGC TTTCGGCGAC AGCGTGGCCG ATGCCCTGCT GGATGACGAC GCGCTCGCCG AGGTCATTGC CGACACGCGA CGCTTCTTCG AAAGCCGCCT CGCCGCGCAC GAGGCCGGCC GCGACCGGCT GCTGGAGTGG AACGCCTGTC GGCCGGCGCG CGCCCAGGCC ATCACCGAGG CGATTCGCGA GCTCGACGAC GACCCCGCAC TGCCGCGTTA TGTCGACAAG GCGCTGGACG TATTCGGCGT CGAAAGTCGC GATCTGGGCG GCGACATTCA GCACCTGCTG GCCGGGCCCC ACATGCTCGA CGGCTTGCCG GGACTGGTCA AAGGCGAGGA AGGCTTCTCG GCCACCTTCG ACCGGCAGCG CGCCCTGGCC CGCGACGATG TGCAGCGCTT GTCCTGGGAG CACCCTCTGG TCCGCGAGAT GATGGAGCGC ATCCTCGACG GCACGCTTGG CAACACCGCA TTGGCGCTGC TTCAGCACCC GGCGATTCCG GGCGGACGAC TGATGGCCGA ACTGGTGTTC CGCACCCACT GTCCGGCGCC CAGACACCTG AACGTGGGGC GCTTCCTGCC GCCGACGGCG GTGCGCCTGC TGCTCGACGA GTCCGGCGCC AACCTGACCC AGAAGGTCTC CTTCGGCGGG CTGGCGAAAA ACCTTCAGAA GGTCAAGAAG GCGGTCGCCC GGGACCTGAT CAAGTCGCGT CACGCGCAGT TGCGCGACCT GTTGACGCGG GCCGAGGACG AGGCAGAGAA CGAACTGCCG AGCATCATCG AAGCCGCGCA GACGCACATG CGTGACACGC TGGACACCGA GCTTGCGCGT CTCAAGGCCC TCGCACGTCA CAACCCGGCG GTACGCGACG CGGAAATCGA GGCCCTCGCC CACGAACGCC GCGAGCTGGA CACGGCCATC GACGCCACAC GCCTGCGCCT CGACGCGGTC CGCGTGGTGG TGACCGTCGA CCCGCCTGAA TGA
|
Protein sequence | MSEFIPGQRW VSDGEAELGL GTILSCDFRS VTVLFAASQE TRTYSVRQAP LTRVVFGAGD RIEAADGRTL TVDDTKEAKG LLIYVVETAD GEYEEFPESR LADSMRFQQA RDRLLTGQVD RNDWFELRYR TLHHYHRIAQ HPANGLAGPR VDLIPHQLYI ADEVSRRHAP RVLLADEVGL GKTIEAGLIL HRLLITGRAE RALILVPDSL THQWLVEMLR RFSLHFTLLD EQQSQHSTGN PFESAQLVLA SQQWLFANPH RQTQAQACDW DLLIVDEAHH LDWSPEASGP GYACVEQLAH AIDGVLLLTA TPEQMGLASH FARLRLLDPE RYTSLAAFEA EETGYARVAE AVDALERLPG DAADRDSVGA VIDDADSQAL LDTLIDPESD DTQRDSARRQ LREQLLDRHG TGRVMFRNSR RHVAGFPERH LHVTRLAPPS SYRRELRKLD RDMDYLDDLL ITTGLDHPEV LLYPDAMYRA QCDASPGDTA LNAEPWWQID PRVTWLLERL TEVGRDKMLV IAHDRETASD LSEALRVLGG VQAPVFHEGL TLIERDRAAA AFADEEDGCQ VLVCSEIGSE GRNFQFCRHL VMFDLPAHPD QLEQRIGRLD RIGQQHAIEI HVPVFEDSPG ERLLRWYDEG MRAFDAPHGI GSELYDAFGD SVADALLDDD ALAEVIADTR RFFESRLAAH EAGRDRLLEW NACRPARAQA ITEAIRELDD DPALPRYVDK ALDVFGVESR DLGGDIQHLL AGPHMLDGLP GLVKGEEGFS ATFDRQRALA RDDVQRLSWE HPLVREMMER ILDGTLGNTA LALLQHPAIP GGRLMAELVF RTHCPAPRHL NVGRFLPPTA VRLLLDESGA NLTQKVSFGG LAKNLQKVKK AVARDLIKSR HAQLRDLLTR AEDEAENELP SIIEAAQTHM RDTLDTELAR LKALARHNPA VRDAEIEALA HERRELDTAI DATRLRLDAV RVVVTVDPPE
|
| |