Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Clim_0732 |
Symbol | |
ID | 6356013 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium limicola DSM 245 |
Kingdom | Bacteria |
Replicon accession | NC_010803 |
Strand | - |
Start bp | 800613 |
End bp | 801680 |
Gene Length | 1068 bp |
Protein Length | 355 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 642668357 |
Product | protein of unknown function DUF403 |
Protein accession | YP_001942792 |
Protein GI | 189346263 |
COG category | [S] Function unknown |
COG ID | [COG2307] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 49 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTAAGCC GTGTTGCCGA ATCGCTTTTC TGGATGAGCC GCTATGTCGA GCGGGCTGAA AACACCGCGA GATTTCTTGA GGTCAATTTC AACCTGTTGC TGGATCTGAA CGATATTGCC ATAGTCGATC ATCCTAACTA CTGGAACCCG CTTATTCGGG TCTCAGGAGA TCCGGATAAC TTCGTCGAGC ATTATACGGA GTATAACGCC CATACGGTTA CCGATTATCT GGTCTTCAAC CGCCAGAACA GCAACTCCAT CAACTCTTCA ATCGGAATGG CCAGGGAGAA TGCCCGAAGC ATTATAGACA GCATATCGAG CGAAATGTGG GAGCAGATCA ACAATCTCTA CCATTTCCTG CAGAGCATGA CTCCGCAGCA GGTTCACAAC GATCCCTTCA CGTTCTACAA AGAGATCAAG AACGCATCAC ACCTGTTTCA GGGGATCACC GACAATATCT TTTCCAGAAC CGAAGGATGG GATTTCATCC AGATCGGAAA GTATCTTGAA CGGGCCGATA ACGCGGCAAG GCTCATCGAC GTCAAATATC ACATGCTTAT GCCGAAAAAC GGCATTGAAC ATGATCCGGT GCTCGATTCG TTCGACACCA TCCAGTGGAT GGCTGTATTG AAAAGCTGCA GTGCACTTGA AGCCTTCAGA AAAGTCTTCC TGTCAAAAAT CGATCCGGAA AACATTCTCG GGTTTCTGGT GCTTGACCGT ACGTTTCCCC GCAGCATAGC CTTTTCAGTC TGTGCCGCCC AGGAAGCTCT GTGGAGGATT TCCGGGAGTT CCCGGCACCG CTACGCCAAC AACGCTGATC GGCTGATCGG CAAAATGGAA GCGGAACTCA GTTACACCAC CGTTGACGAT ATGTACAGAA AAGGACTCCA TGACTTTCTT GTCGATATCG AACACGGACT CATCAGAATC GGAGAACAGA TTCATCTGCT TTATTTTGCC TATCACACTC CGAAAATCGA ACCTCATGAT ATATCCGAAG CACTCCCCTT CACCGGTATC GCCGGAGGCC GGGCCAACTG GAGCCAGTCG CAGCAGCAAC AGCAATAA
|
Protein sequence | MLSRVAESLF WMSRYVERAE NTARFLEVNF NLLLDLNDIA IVDHPNYWNP LIRVSGDPDN FVEHYTEYNA HTVTDYLVFN RQNSNSINSS IGMARENARS IIDSISSEMW EQINNLYHFL QSMTPQQVHN DPFTFYKEIK NASHLFQGIT DNIFSRTEGW DFIQIGKYLE RADNAARLID VKYHMLMPKN GIEHDPVLDS FDTIQWMAVL KSCSALEAFR KVFLSKIDPE NILGFLVLDR TFPRSIAFSV CAAQEALWRI SGSSRHRYAN NADRLIGKME AELSYTTVDD MYRKGLHDFL VDIEHGLIRI GEQIHLLYFA YHTPKIEPHD ISEALPFTGI AGGRANWSQS QQQQQ
|
| |