Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_0991 |
Symbol | |
ID | 4026214 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | + |
Start bp | 1112659 |
End bp | 1114671 |
Gene Length | 2013 bp |
Protein Length | 670 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637966168 |
Product | protein of unknown function DUF224, cysteine-rich region |
Protein accession | YP_573047 |
Protein GI | 92113119 |
COG category | [C] Energy production and conversion |
COG ID | [COG0247] Fe-S oxidoreductase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.996446 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTCGATA TCCTCCTGCC GATCCTGATC TTTTCCGCCC TGGCGCTGGC GGCGATCGGC GCCGTGCGGC GCATTCGCCT GTGGCGTCAG GGGCGGCCAT CGCCGGTGCC CGTGCTCCGC GGTCTGGCGG CGGTACCGCG TCGGTATCTG GTCGACCTGC ACCACGTGGT GGCGCGCGAC AAGGTGATGT CCAACACTCA CGTCGCCACC GCCGGCGGCT TCGTCGCCGC CATGGGGCTG GCGATCGTCG TCCATGGCCT GGGCCTTGCC GAGGGCGTCC TGGGCTGGTT GCTGCTGGCG GCGAGCGCGA CGATGTTCGC AGGCAGCCTG TTCGTCGCCC GGCGTCGGCG GAATCCTCCG GCGCGCCTGT CGAAGGGGCC CTGGATGCGT TTGCCCAAGA GTCTGATGGC GTTCTCGCTG GGCATCTTTG TGGTTACCCT GCCGGCGGTG GGTGTGCTGC CCGCCGACGC CGGTGGCTGG CTGGTCGCGC TGGTGCTGGC TGCCGTGGTG GCCTGGGGCC TGGCCGAGCT GGTGTTCGGC ATGACATGGG GCGGGCCGAT GAAGCACGCC TTTGCCGGCG CCCTGCACCT GGCCTTCCAT CGCCGGCCGG CGCGCTTTTC GGACAAGCGC GGGGGCGAGG GCCGCTCGAC CGGGCTCAAG GCGCTGGACC TGGACGACGC CGACGCGCCG CTGGGCGTCG AGAAGCCCGC CGACTTCACC TGGAACCAGT TGCTGGGCTT CGACGCCTGC GTGCAGTGCG GTCGCTGCGA GGCGGTGTGT CCGGCCTTCG CCGCCGGCCA GCCGCTCAAT CCCAAGAAAC TGGTGCAGGA CATGGTGGTG GGCATGGTCG GCGGCAGTGA CGCCCGCTAC GCCGGCAGTC CATACCCGGG CAAGCCGGTC GGCGAGCACG CCGGCGACCC CCACGGGCCG ATCGTCGCCC GCGAGGGCAC CGCCCTGGTC GACGCCGAGA CGCTGTGGTC GTGCACCACC TGTCGCGCCT GCGTCGAGGA GTGCCCGATG ATGATCGAGC ATGTCGATGC CATCGTCGAC ATGCGCCGCC ACCTGACCCT GGAAGAAGGC GCGACGCCCG GCAAGGGCGC CGAGGTCATC GACAACCTGA TCGCCACCGA CAACCCGGGC GGTTTCGATC CCGGGGGACG ACTCAACTGG GCGGCGGATC TCGACCTGCC GTTGATGGCC GACGTGGAGC GCGCCGAGGT GCTGCTGTGG CTGGGCGATG GCGTCTTCGA CATGCGCAAT CAGCGCACGC TGCGTGCCTT GATCAAGGTG CTGCGTGCTG CCGACGTCGA TTTCGCGGTG CTCGGCAATG AAGAGCGCGA CAGCGGAGAC GTGGCACGCC GTCTGGGAGA CGAGGCCACC TTTCAGTCGC TGGCGCGGCG CAATATCGAC ACGCTGTCGC GATATCGCTT CGAGAGTATC GTGACCTGCG ATCCGCACAG CTTTCACGTG CTCAAGAACG AATACGGCGC CCTCTATCCG CAAGGGCAAG ACGCCGACTA CCCGGTATGG CATCACAGCA CTTTCATCAA TCAGTTGATC GAAAGCGGGC GGTTGCCGTT GGCGCCGGGA CAGGCGCAGC GCGTGACCTA CCACGACCCC TGCTATCTGG GCCGCTACAA CGGTGAATAC GAAGCACCGC GCGCGGTGCT GCGCGCCCTG GGCATGGAGC TGGTCGAGAT GCAGCGTTCG GGATACCGCT CGCGTTGCTG CGGCGGCGGC GGTGGCGCGC CGATCACCGA CGTTCCCGGC AAGCAGCGTA TCCCCGACAT GCGCATGGGC GACGTGCGCG AGACTCAGGC CGAGCAGGTC GTCGTGGGAT GTCCGCAATG CACGGCCATG CTCGAAGGCG TTGTCCCACC GGCGGGTAAC GAGGCGACCG CCGTCAAGGA CATCGCCGAG ATGGTTGCCG CCGCGCTGGA TAACACGCCA CCGGCGACAC CCGCGTCGCA TGACACTGCG GCCTCGCAGG CCACGGAGGA GGTGCTGTCA TGA
|
Protein sequence | MLDILLPILI FSALALAAIG AVRRIRLWRQ GRPSPVPVLR GLAAVPRRYL VDLHHVVARD KVMSNTHVAT AGGFVAAMGL AIVVHGLGLA EGVLGWLLLA ASATMFAGSL FVARRRRNPP ARLSKGPWMR LPKSLMAFSL GIFVVTLPAV GVLPADAGGW LVALVLAAVV AWGLAELVFG MTWGGPMKHA FAGALHLAFH RRPARFSDKR GGEGRSTGLK ALDLDDADAP LGVEKPADFT WNQLLGFDAC VQCGRCEAVC PAFAAGQPLN PKKLVQDMVV GMVGGSDARY AGSPYPGKPV GEHAGDPHGP IVAREGTALV DAETLWSCTT CRACVEECPM MIEHVDAIVD MRRHLTLEEG ATPGKGAEVI DNLIATDNPG GFDPGGRLNW AADLDLPLMA DVERAEVLLW LGDGVFDMRN QRTLRALIKV LRAADVDFAV LGNEERDSGD VARRLGDEAT FQSLARRNID TLSRYRFESI VTCDPHSFHV LKNEYGALYP QGQDADYPVW HHSTFINQLI ESGRLPLAPG QAQRVTYHDP CYLGRYNGEY EAPRAVLRAL GMELVEMQRS GYRSRCCGGG GGAPITDVPG KQRIPDMRMG DVRETQAEQV VVGCPQCTAM LEGVVPPAGN EATAVKDIAE MVAAALDNTP PATPASHDTA ASQATEEVLS
|
| |