Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_1444 |
Symbol | |
ID | 4570178 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | - |
Start bp | 1645740 |
End bp | 1649105 |
Gene Length | 3366 bp |
Protein Length | 1121 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 639766030 |
Product | hypothetical protein |
Protein accession | YP_911896 |
Protein GI | 119357252 |
COG category | [S] Function unknown |
COG ID | [COG4913] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTGATG TGATTGAAAT GGGATTTGTT GCCGATGACC GTCTGGCCGG TTTCCGGTTG CAACGTCTCG AGGTTTTCAA CTGGGGCACC TTTGACGGAA GGGTCTGGAC GCTAAAACTG GGAGGAAAAA ACGGCCTTCT CACCGGCGAT ATCGGTTCCG GAAAATCGAC GCTGGTCGAT GCCGTGACCA CCTTGCTGGT TCCAGGCCAG CGAATTGCCT ACAACAAGGC GGCAGGAGCC GATAACCGTG AACGCACGCT GCGCTCCTAT GTGCTCGGTT ACTTCAAATC TGAGCGGCAG GAGAGCCTTG GAGGCGGCGC CAAACCGGTA GCCCTGCGCG AATCCAACAG TTATTCGGTT ATTCTCGGCG TCTTTCATAA CGAAGGCTAC GACAAGACCG TCACCCTTGC GCAAATTTTC TGGATGAAAG ACGCATCACA GCCAGCCCGT CTTTACGCCG TCTGCGAGCG CGATTTATCT ATCGCCGCCG ACTTTTCGGC TTTTGGTACA GAAATATCGA CCCTGCGCAA GCGCTTGCGC GGGTCGGGCA TCGAGCTGTT CGAAAGCTTT CCCCCATACG GAGCCTGGTT CCGCCGCCGC TTCGGTATCG ACAATGAACA GGCGCTTGAT CTCTTTCATC AAACCGTATC GCTCAAGTCG GTGGGGAACC TGACCGATTT CGTACGCCTC CATATGCTTG AGCCCTTCAA TGTCGAGCCG CGCATTGCCG CTCTTATCCA CCATTTCGAA GATCTCAACC GGGCACACGA AGCTGTTCTC AAAGCGAAGC GGCAAATAGA GATGCTCGCT CCTCTGGTGG ATGATTGCGA TCATCATCAA ACCCTCGTGC GAACAACCGA AGAGCTGCGA GCCTCTCGCG ACAGCCTGCG CCACTGGTTC GCATCCCTGA AACTCGAATT GCTCGAAAAA CGCCTTGTCT CGCTGGATGA AGAGCTGAGT CGTCACCATA TAGCCATCGA ACGTCTTGAC ACAGAGCGTC GCAGCCAACA GGGTCGTGAT CGCGAGCTTC GCCGAACCAT CGCCGAAAAC GGCGGCGACA GGATCGAGAG TATCTCGGCA GAGATTCGTC AGAAACAGGA AGAGCTTGAA CGGCGCAATC AGAAAGCTGC ACGATATGAA GAGCTTGCCC GCCTGCTTGG GGAGCATCCG GCAGCGACAG CCGAGGAATT TCATAGTCAG CAAGCCGGTC ATTCAGCCAT GCGAGACGCA ACGGCTGAAG TTGAAGCCCA GGTTCAAAAC AATCTCAATG AAGCGGGCGT TCTCTTTACC CAGGGGCGTC ATGAGCATGA GCAACTTACC GAAGAGATCA AGGGGTTGAA AGCCCGTATG AGCAATATCG ACGAAAAGCA GGTTGCCATG CGTCGCTCGC TCTGTGAGGC GCTCAATCTT TCTGAAAAAG AGATGCCGTT TGCCGGCGAG TTGCTTCAGG TTCGGGAGCA AGAGCAGCTC TGGGAAGGAG CCATCGAGCG TCTGCTTCGC AATTTCGGCC TGTCGCTGCT GGTGTCCGAC CATCACTACC CGAAGGTGGC GGAGTGGGTG GAACAAACCA ATCTGAAAGG TCGTCTGGTC TATTTTCGCG TACGTCAGCA CTCTCGAAGC GAACAGTTGC CGGATCATCC GGCCTCGCTG GCGCGCAAGC TCGCCATCAA AGCGGATTCC CCCTGGTTTG ACTGGCTGGA AAGGGAGGTC GCCCATCGTT TTGATCTGGT TTGCTGCATG AATCAGGAAG AGTTTCGAAG GGAGAAAAAA GCTCTCACCC TCGGCGGCCA GATCAAGTCA CCGGGTGAAC GCCACGAAAA AGACGATCGC CATCGGCTTG ACGATCGCAG CCGCTACGTG CTCGGATGGA GCAATGCCGC CAAAATCGCG ACATTGGAAG AGAAGGCCGG GCGGCAAAAA AGAGATCTTG CCGAACTTGC AGGTCGCATA AGCACGCTCC AGCAAGAGCA AACAAAACTC AAGGAGCGCC TGACCATTCT CTCAAAGCTC GACGAGTATT CCGATTTTCG CGATCTCGAC TGGCAGCCGG TGGCCATGGC CATAGCCAGG CTTGAAACTG AAAAACGCGA TCTTGAAGCG ACCTCGAATT TTCTGCAAAC CCTTGCTTCG CAACTTGCCG CCCTCGAAGA AGAGATGCGG GAAACGGAGC GGCTACTTGA CGACAGAAAA GACAAACGGT CGAAAATCGA ACAGAAAATC ACTGTCATCA AGGAGTTACA GCAGCAGACG CATACGCTGC TTTCTGAAGC GGGAGCCGAA TCCGCTTCCC GATTCGCCGC GCTCCGGCAG ATGCGCAGTG ACGCTTTCGG CGACCAGTCG GTGACGATCG AGTCGTGCGA CAATCGCGAA AGAGAGATGC GCGACTGGTT GCAGACAAAA ATCGACGCCG AAAACGGGAA AATTTCCCGA CTCAGCCAAA AGATCATCAA GGCGATGACC GAATACAAGG AAGAGTGGAA ACTTGAGACT CGCGAAGTCG ATGTCAACAT TGCCGCCGGC TCGGAATATC GCTCAATGTT TGAACAACTT CGGGCCGACG ATCTGCCTCG CTTCGAGGGG CGGTTCAAGG AGCTGCTCAA CGAAAACACT ATCCGCGAAG TCGCCAATTT TCAGTCGCAA CTGGCCCGCG AACGGGAGAC CATCAAGGAA CGCATTACCC GGCTCAATGA ATCGCTCACG CAAATCGACT TCAATGCGGG TCGATACATC ACCCTCGAAG CACAACTGAA CCTCGACGCC GATATCCGTG ATTTTCAGTC GGAGCTGCGA GCCTGCACCG AAGGTTCTCT TACCGGTTCG GACGACGCCC AGTACTCCGA AGCCAAATTC CTCCAGGTGC GGCGAATCAT CGATCGCTTC CGTGGTCGTG AAGCCTATGC CGACCTCGAT CGGCGCTGGA CAGCCAAAGT CACCGACGTA CGCAACTGGT TCGTTTTTGC CGCCAGCGAA CGGTGGCGTG AAGATGACAG CGAACACGAG CACTACGCCG ACTCGGGGGG GAAATCCGGA GGACAGAAAG AGAAGCTCGC CTACACGGTG CTTGCCGCCA GTCTTGCCTA TCAATTCGGT TTGGAGTGGG GCGCCGTACG CTCCCGTTCG TTCCGCTTCG TTGTCATTGA CGAAGCCTTC GGACGCGGTT CCGACGAATC GGCGCAGTAC GGCCTGCAAC TCTTCGCCCA GCTTAACCTG CAACTCCTTA TCGTCACCCC TTTACAGAAA ATCCATATCA TCGAACCCTT CGTCTCCAGC GTAGGCTTTG TGCACAATCA GGAAGGGCGC TGCTCGGTAT TGCGCAACCT CACCATCGAA GAGTATCGCG CCGAAAAAGA GAAGGCGGCA GAATGA
|
Protein sequence | MSDVIEMGFV ADDRLAGFRL QRLEVFNWGT FDGRVWTLKL GGKNGLLTGD IGSGKSTLVD AVTTLLVPGQ RIAYNKAAGA DNRERTLRSY VLGYFKSERQ ESLGGGAKPV ALRESNSYSV ILGVFHNEGY DKTVTLAQIF WMKDASQPAR LYAVCERDLS IAADFSAFGT EISTLRKRLR GSGIELFESF PPYGAWFRRR FGIDNEQALD LFHQTVSLKS VGNLTDFVRL HMLEPFNVEP RIAALIHHFE DLNRAHEAVL KAKRQIEMLA PLVDDCDHHQ TLVRTTEELR ASRDSLRHWF ASLKLELLEK RLVSLDEELS RHHIAIERLD TERRSQQGRD RELRRTIAEN GGDRIESISA EIRQKQEELE RRNQKAARYE ELARLLGEHP AATAEEFHSQ QAGHSAMRDA TAEVEAQVQN NLNEAGVLFT QGRHEHEQLT EEIKGLKARM SNIDEKQVAM RRSLCEALNL SEKEMPFAGE LLQVREQEQL WEGAIERLLR NFGLSLLVSD HHYPKVAEWV EQTNLKGRLV YFRVRQHSRS EQLPDHPASL ARKLAIKADS PWFDWLEREV AHRFDLVCCM NQEEFRREKK ALTLGGQIKS PGERHEKDDR HRLDDRSRYV LGWSNAAKIA TLEEKAGRQK RDLAELAGRI STLQQEQTKL KERLTILSKL DEYSDFRDLD WQPVAMAIAR LETEKRDLEA TSNFLQTLAS QLAALEEEMR ETERLLDDRK DKRSKIEQKI TVIKELQQQT HTLLSEAGAE SASRFAALRQ MRSDAFGDQS VTIESCDNRE REMRDWLQTK IDAENGKISR LSQKIIKAMT EYKEEWKLET REVDVNIAAG SEYRSMFEQL RADDLPRFEG RFKELLNENT IREVANFQSQ LARERETIKE RITRLNESLT QIDFNAGRYI TLEAQLNLDA DIRDFQSELR ACTEGSLTGS DDAQYSEAKF LQVRRIIDRF RGREAYADLD RRWTAKVTDV RNWFVFAASE RWREDDSEHE HYADSGGKSG GQKEKLAYTV LAASLAYQFG LEWGAVRSRS FRFVVIDEAF GRGSDESAQY GLQLFAQLNL QLLIVTPLQK IHIIEPFVSS VGFVHNQEGR CSVLRNLTIE EYRAEKEKAA E
|
| |