Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphamn1_1072 |
Symbol | |
ID | 6374746 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides BS1 |
Kingdom | Bacteria |
Replicon accession | NC_010831 |
Strand | - |
Start bp | 1161026 |
End bp | 1163164 |
Gene Length | 2139 bp |
Protein Length | 712 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 642683574 |
Product | protein of unknown function DUF255 |
Protein accession | YP_001959492 |
Protein GI | 189500022 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1331] Highly conserved protein containing a thioredoxin domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0898725 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGCAAG AAAAACGACG CCCGAACCTT CTCGCTGAAG AAACCAGCCC ATACCTGCTA CAACACGCGT ACAATCCGGC AGCATGGTAT CCATGGGGGG AAGAAGCTTT TGAAAAAGCA CGTAATGAGG ATAAACCGGT TTTTCTCTCA GTCGGCTATT CCACCTGCCA CTGGTGCCAT GTGATGGAGC GGGAGTCATT TGAAAACGAC CGTATCGCCG AGCTCCTCAA CAGGGCTTTC GTGCCGGTCA AGGTTGATCG GGAAGAAAGG CCGGATATTG ACAGATTGTA TATGACCTAT GTACAGGCAA CGACCGGAAG CGGCGGATGG CCGATGTCCG TATGGCTCAC TCCGGATCTC AAACCGTTTT TCGGAGGCAG CTATTTTCCT CCTGAAGACC GATACGGAAA ACCCGGGTTT CATTCCCTCC TGCTCTCGAT CGAAAGGGCT TGGAAAGAGG ATCGCAACCG GTTTCTTTCT GCAGCGGAAG GGATGACCGA GCAGCTTGAA GCCCTCTCTC TGCAGAAACC CGAGACGGTT CCTCTGGATG AACAGGTGTT CCATCATGCG GCCAAAACGT TTGCCGGAAT GTTCGACAAG GAGGATGGAG GTTTCGGCAA TGCGCCGAAA TTCCCTCAGC CGTCTATTCT TGAATTTCTT CTTGCCTACT CTTATTTCAC TGGCAATCAA GAAGCAAAGG AGATGGTTCT GCTCTCCTTA AGGAAAATGG CTTCCGGCGG CATTCATGAC CATTTGGGAA TAAAAAATCT TGGCGGAGGA GGATTTGCAC GCTACTCGAC CGATGTACGG TGGCATGTTC CGCATTTCGA AAAAATGCTC TACGACAACG CTCAGCTTGC GGTTGTAGCT ACAGAAGCCT ACCAGATTAC CGGCGAAAAC CTCTATGCGA ATCTTGCCGA CGATATCCTG AACTATGTTC TCTGCGATAT GACCGACAAC AAAGGCGGCT TCTATTCTGC AGAAGACGCC GACAGTTTTC CGAACAGCAA GAGCAAGGCT AAAAAAGAAG GAGCTTTTTA CACCTGGTCC ATTCAGGAAA TCACGGCAAA GCTTGATCCG CTTGAAACAG ATATCTTCTG TTTTATATAC GGTGTTGAGT CTGATGGAAA CGCCCTGGAC GATCCGCATC TGGAGTTTAC CGGGCGAAAC ATACTTTTTG CCAGAAATGA TATTGAAGCT GCCGCCGCAC AGTTCTCCAT GCCTTCGGAA ATCATCCGCG AAATCACGGA CGATGCACGG GAAAAACTGT TCCATTCAAG GAATGACAGA CCACGGCCGC ATCTGGACGA CAAAATTCTC ACATCGTGGA ATGGTCTGAT GATCTCAGCC CTGTCAAAAG CCTCTTGCGT GCTTCGCAGC CAAAACTATC TTGACGCGGC TCTTAAAGCC GCCGAGTTTA TCCTCAACAA TCTTTACAGC ACAACTGATG GGAGACTGCT TCGACGATAC CGTAGCGGCC AGGCCGGTAT CGGGGGCAAA GCCGATGATT ACTCCTTTTT CATACAGGGG CTACTGGACC TTTACGAAGC CTCATCAGAA CATCGCTACT TGAGCAATGC GGTCAAACTT ATGGAAAAAC AGATTGAACT TTTTTTCGAT GATAAGTCGG GTGGTTTTTT CAATGCAGCG TCGGACGACT CATCCGTTCC GATACGCATG AAAGAGGATT ACGACGGTGC CGAGCCTTCT CCAAATTCAA TAAACACTTT TTCTCTCTAC CGGTTGGCGG ATATGATGGA TCGCGATGAT TTCAGAGAAA TCGCGGATAA AACCATTGCC TATTTCAGTA AATCGCTTAA GGAAAACGGA CGTCAACTAC CTTGTCTGCT CAAAACGGCA ATGCTTCCCT TTTATGGAAC ACGGCAGGTT ATTCTTACCG GTGAACGGCA CAACGAAACA ATGAAAAACC TTGAAAATAC GCTTGGCGAA ATGTATCTGC CTGACATGTT CATCATACAC GCATCAGGCA ACAACGCTGA AAACACTGAT TTTCTCAAAA AAATCACGCT TAAATCCACA GGAAATGCAG CTTACGTTTG CAGCAATCAA ACATGCAACC TGCCGGCATA CTCCGCAAAA GAGCTTCGGA AGATTTTTTC AGCAAAAAAT CAGAAATAG
|
Protein sequence | MPQEKRRPNL LAEETSPYLL QHAYNPAAWY PWGEEAFEKA RNEDKPVFLS VGYSTCHWCH VMERESFEND RIAELLNRAF VPVKVDREER PDIDRLYMTY VQATTGSGGW PMSVWLTPDL KPFFGGSYFP PEDRYGKPGF HSLLLSIERA WKEDRNRFLS AAEGMTEQLE ALSLQKPETV PLDEQVFHHA AKTFAGMFDK EDGGFGNAPK FPQPSILEFL LAYSYFTGNQ EAKEMVLLSL RKMASGGIHD HLGIKNLGGG GFARYSTDVR WHVPHFEKML YDNAQLAVVA TEAYQITGEN LYANLADDIL NYVLCDMTDN KGGFYSAEDA DSFPNSKSKA KKEGAFYTWS IQEITAKLDP LETDIFCFIY GVESDGNALD DPHLEFTGRN ILFARNDIEA AAAQFSMPSE IIREITDDAR EKLFHSRNDR PRPHLDDKIL TSWNGLMISA LSKASCVLRS QNYLDAALKA AEFILNNLYS TTDGRLLRRY RSGQAGIGGK ADDYSFFIQG LLDLYEASSE HRYLSNAVKL MEKQIELFFD DKSGGFFNAA SDDSSVPIRM KEDYDGAEPS PNSINTFSLY RLADMMDRDD FREIADKTIA YFSKSLKENG RQLPCLLKTA MLPFYGTRQV ILTGERHNET MKNLENTLGE MYLPDMFIIH ASGNNAENTD FLKKITLKST GNAAYVCSNQ TCNLPAYSAK ELRKIFSAKN QK
|
| |