Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphamn1_0121 |
Symbol | |
ID | 6373765 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides BS1 |
Kingdom | Bacteria |
Replicon accession | NC_010831 |
Strand | + |
Start bp | 112854 |
End bp | 114128 |
Gene Length | 1275 bp |
Protein Length | 424 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 642682635 |
Product | hypothetical protein |
Protein accession | YP_001958582 |
Protein GI | 189499112 |
COG category | [S] Function unknown |
COG ID | [COG4198] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00158567 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0307809 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTGAAA TCCGTCCTTT CAAAGGTTTG CGGTACGATC CTGAAACCGC AGGTGACATG GGGAGCATCA TCTGCCCGCC CTATGATATT ATACCTCCGT CGATGCAGCA GGAACTGTAC GACAGTTCAG CCTACAATGC GGTTCGGCTT GAGTTGCCGA AGGAAGACGA TCCGTATGGT GCCGCTGCGG AACGGCTGGC GCAGTGGATG CGGGAGGGGG CTCTGATGCA GGACGACGAA CCTGCGCTCT ATCCCTATTT CCAGACCTAT ACTGACCCTG AAGGAAATAC CTATACCCGG AAAGGCTTTT TCTGCGCCTT GCGTCTGCAC GAGTTCAGCG AAAAAAAGGT GCTGCCTCAT GAGCGGACAC TATCGGGGCC GAAAAAAGAT CGCCTGAACC TGTTCAAGCG GACACAGACA AATATCAGCA GTATTTTCGG TTTGTACGCC GATGACCAGA TGCAGGCGGA CAAGCTTATC GAGGAATATG CAGTATCGCA TGAACCGGTA GTCGATGCCG CATTCCAGGG GGTGCGGAAC CGGCTCTGGA AGATTACCGA TCCTGATATT ACCGGTAAGG TGCAGCAGGT TCTGCTGGAG CGTCAGGTCT ATATAGCCGA CGGTCATCAT CGCTACGAGA CAGGAGTGAA CTATCGTAAT CTTCGGGCGG AGGAAAATCC TTCGCACACC GGAGAGGAAC CATACAATTT CATCCTTGTC TATCTCGCCA ACATCTTCGA CAGAGGCTTG ATCATTTTTC CGCTGCACAG GATGGTACAC AGCCTTGATA CCTTCAATGC CGATGCGCTC TTCGATGCGC TTGGCGCAAA CTTTTCGATA ACCCCTCTCA GCGGCAGGGA CGAACTGAAG CGCTACCTCG ATGGAGAAAC GTCAAACCAT GCCTATGGTG TCGTGACTTC TACGGGGGTC TGGGGCATCA GGCTGAAAAC TCCTCCGGAA ACCCTGCTCG GTGACGCTGT TCCCGCTCCT CTGCTGCAAC TCAGCGTGGT CGTGCTGCAC GAACTGATTC TGCAGCGTGT GCTTGGCATT ACGCCGGAGG CGATGCGCAG CCAGACCAAT CTTGTGTATA TCGAGGACGA CCGTGAGGTT TTCGAGAACG TTTCCAACGG CAAGATGCAG GTCGGGTTTG TGGTTAAACC GACCACGGTA GAGCAGGTTC GGGATATTTC GCAGGCCGGC GAGGTTATGC CGCAGAAGTC CACGTACTTT TATCCGAAGA TTATGACAGG TTTAGTCATG CATAGGCTTA AGTAA
|
Protein sequence | MPEIRPFKGL RYDPETAGDM GSIICPPYDI IPPSMQQELY DSSAYNAVRL ELPKEDDPYG AAAERLAQWM REGALMQDDE PALYPYFQTY TDPEGNTYTR KGFFCALRLH EFSEKKVLPH ERTLSGPKKD RLNLFKRTQT NISSIFGLYA DDQMQADKLI EEYAVSHEPV VDAAFQGVRN RLWKITDPDI TGKVQQVLLE RQVYIADGHH RYETGVNYRN LRAEENPSHT GEEPYNFILV YLANIFDRGL IIFPLHRMVH SLDTFNADAL FDALGANFSI TPLSGRDELK RYLDGETSNH AYGVVTSTGV WGIRLKTPPE TLLGDAVPAP LLQLSVVVLH ELILQRVLGI TPEAMRSQTN LVYIEDDREV FENVSNGKMQ VGFVVKPTTV EQVRDISQAG EVMPQKSTYF YPKIMTGLVM HRLK
|
| |