Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_0837 |
Symbol | |
ID | 4027400 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | - |
Start bp | 934197 |
End bp | 935903 |
Gene Length | 1707 bp |
Protein Length | 568 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637966003 |
Product | hypothetical protein |
Protein accession | YP_572893 |
Protein GI | 92112965 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTACGAC GTATCTCACT TGCCTGCGTG ATGGCCGGCA GCCTGCTTCT GGCAGGTTGC GGACAAGATT CGGCAGACGC ACCACTGGCA CATGTTCCGG CTGACACCCC CTTTCTCTTC GCCAACCTCG AAACCATCGA CGACGCGACT CTCGACGCCG CGCTGGCGTC ATCAAACGCT TCACTCGCGC AACAGCGTAT CGACCTGCGC CAATTCGCAG AAGAACTGCG CCGCGACGGC GAGGCCGAGT CCCTCGCCAA TGTGCTTGAC GCACTCGCCG AGGAGCTTGC CGGCAAATCG GACTATCAGC AGGTCGCCGA ACAGATCGGC GTCGACCTGG GCGGTCAGAA CGCTCTCTAC GGTCTGGGCC TGAGCCCGGT ATTGCGCCTG AGCATCAACG ATGCTGAGCG CTACCAGGCT TTCCTGCAAC GTCTTGCAGA CGCTGCAGGC CTGCCGCTGG AAACCCGCAC GCAGGGCGAG CTGGAATACC GCCAGGCACG CCTCGGCGAG GCGCCGCTGC AACTGCTGAG TACCGTTCAT GACGGCCAGG CGGTGCTGGC AGTCGCCCCC ACCGAACTGG ATGACGACGC ACTGCAGCAA GTGTTAGGCA CGAGCCTGCC CGACAGCAGC GTGCAGGATA CCCAGCGCCT CAGTGAGCTG GCCGACGCCA AGGACTACCT GCCCTACGGC CTGGGCTATG TCGATACGAC ACGCTTGGCG ACCTTGCTCA CCGGCAGTCA GGATCTCATG ATTCAGGCCT TTCGCGCATT CGCCGAGCAG ACGCAAGGTC AGGCCCCAGA ACCGGTTTCG CAGAGCTGCC GCGAGGATGC GACGCGCCTA GCCTCGCGCA TGCCGCGACT GAGCGCCGGC TACACCACGC TCGACGCCGC ACGCACCGAG CAACGCTTCG ATGTGTCATT GGCCGAAGAT ATCACCGCCC CGCTTGCCTC GCTCACTTCA ACACTGCCGG GCCTGGGCAA TGACTCGCTT GAGTCTCCCT TCGACCTTGC GATCGCACTG CCCATGAACG ACCTGCGCGA CCTGCTGACC CAACAGATCC AACACGTGCG TACCGCACCG TTCAGTTGCT CGGCGCTCGC CGAACTCAAC AACGATCTGG ACGAACTCGG CCGTCAGGCC AACATGCTGG CCATGCCCCC GTTCGGTAGC CTGCGCGGCA TGCGGCTGGT AATCGATGAG CTCACGATGC CCAAGCATAG CGATCAGCCC GCCATCAAGG GCGCTCTGCT GGTAGCCTCC AGCGACCCCA ACGGGTTGAT GGCGATCGGC CAGAGCATGC TGCCCGGGCT CGCGACACTC TCCCTTTCCA ACGACGGCGA ACCCCAAGCT CTGCCGCCAC AGCTCACCGC GATGCTAGGC GATGCACCGG CCTGGCTGGC CATGACAGAC AAGGCACTCG GCGTGGCAAC GGGTGAGGGC GAGCAGACGA CGCTCAAGTC CTTGCTTCAG GAAGAAACCG GCGAGGCCGG CGAACTGATG CACGTCAAGC TTTCCGGCGA CATGTACGCC AAGTGGCTTC AGCTTGCCGA CGCCTTCGGC AACCTCGCAG GCAACGACGC TGCAGCACTC GAAGAGCAGC TCGATGCCAT GCAGAACCAA TTCGAGCGCA TCGACAACGT CGTAATACGC ATGCGTATGG AAGACGACGG CCTGGTCATC AACAACCGTA TCGACTGGCA ACAGTAA
|
Protein sequence | MLRRISLACV MAGSLLLAGC GQDSADAPLA HVPADTPFLF ANLETIDDAT LDAALASSNA SLAQQRIDLR QFAEELRRDG EAESLANVLD ALAEELAGKS DYQQVAEQIG VDLGGQNALY GLGLSPVLRL SINDAERYQA FLQRLADAAG LPLETRTQGE LEYRQARLGE APLQLLSTVH DGQAVLAVAP TELDDDALQQ VLGTSLPDSS VQDTQRLSEL ADAKDYLPYG LGYVDTTRLA TLLTGSQDLM IQAFRAFAEQ TQGQAPEPVS QSCREDATRL ASRMPRLSAG YTTLDAARTE QRFDVSLAED ITAPLASLTS TLPGLGNDSL ESPFDLAIAL PMNDLRDLLT QQIQHVRTAP FSCSALAELN NDLDELGRQA NMLAMPPFGS LRGMRLVIDE LTMPKHSDQP AIKGALLVAS SDPNGLMAIG QSMLPGLATL SLSNDGEPQA LPPQLTAMLG DAPAWLAMTD KALGVATGEG EQTTLKSLLQ EETGEAGELM HVKLSGDMYA KWLQLADAFG NLAGNDAAAL EEQLDAMQNQ FERIDNVVIR MRMEDDGLVI NNRIDWQQ
|
| |