Gene Cpha266_1444 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_1444 
Symbol 
ID4570178 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp1645740 
End bp1649105 
Gene Length3366 bp 
Protein Length1121 aa 
Translation table11 
GC content55% 
IMG OID639766030 
Producthypothetical protein 
Protein accessionYP_911896 
Protein GI119357252 
COG category[S] Function unknown 
COG ID[COG4913] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGATG TGATTGAAAT GGGATTTGTT GCCGATGACC GTCTGGCCGG TTTCCGGTTG 
CAACGTCTCG AGGTTTTCAA CTGGGGCACC TTTGACGGAA GGGTCTGGAC GCTAAAACTG
GGAGGAAAAA ACGGCCTTCT CACCGGCGAT ATCGGTTCCG GAAAATCGAC GCTGGTCGAT
GCCGTGACCA CCTTGCTGGT TCCAGGCCAG CGAATTGCCT ACAACAAGGC GGCAGGAGCC
GATAACCGTG AACGCACGCT GCGCTCCTAT GTGCTCGGTT ACTTCAAATC TGAGCGGCAG
GAGAGCCTTG GAGGCGGCGC CAAACCGGTA GCCCTGCGCG AATCCAACAG TTATTCGGTT
ATTCTCGGCG TCTTTCATAA CGAAGGCTAC GACAAGACCG TCACCCTTGC GCAAATTTTC
TGGATGAAAG ACGCATCACA GCCAGCCCGT CTTTACGCCG TCTGCGAGCG CGATTTATCT
ATCGCCGCCG ACTTTTCGGC TTTTGGTACA GAAATATCGA CCCTGCGCAA GCGCTTGCGC
GGGTCGGGCA TCGAGCTGTT CGAAAGCTTT CCCCCATACG GAGCCTGGTT CCGCCGCCGC
TTCGGTATCG ACAATGAACA GGCGCTTGAT CTCTTTCATC AAACCGTATC GCTCAAGTCG
GTGGGGAACC TGACCGATTT CGTACGCCTC CATATGCTTG AGCCCTTCAA TGTCGAGCCG
CGCATTGCCG CTCTTATCCA CCATTTCGAA GATCTCAACC GGGCACACGA AGCTGTTCTC
AAAGCGAAGC GGCAAATAGA GATGCTCGCT CCTCTGGTGG ATGATTGCGA TCATCATCAA
ACCCTCGTGC GAACAACCGA AGAGCTGCGA GCCTCTCGCG ACAGCCTGCG CCACTGGTTC
GCATCCCTGA AACTCGAATT GCTCGAAAAA CGCCTTGTCT CGCTGGATGA AGAGCTGAGT
CGTCACCATA TAGCCATCGA ACGTCTTGAC ACAGAGCGTC GCAGCCAACA GGGTCGTGAT
CGCGAGCTTC GCCGAACCAT CGCCGAAAAC GGCGGCGACA GGATCGAGAG TATCTCGGCA
GAGATTCGTC AGAAACAGGA AGAGCTTGAA CGGCGCAATC AGAAAGCTGC ACGATATGAA
GAGCTTGCCC GCCTGCTTGG GGAGCATCCG GCAGCGACAG CCGAGGAATT TCATAGTCAG
CAAGCCGGTC ATTCAGCCAT GCGAGACGCA ACGGCTGAAG TTGAAGCCCA GGTTCAAAAC
AATCTCAATG AAGCGGGCGT TCTCTTTACC CAGGGGCGTC ATGAGCATGA GCAACTTACC
GAAGAGATCA AGGGGTTGAA AGCCCGTATG AGCAATATCG ACGAAAAGCA GGTTGCCATG
CGTCGCTCGC TCTGTGAGGC GCTCAATCTT TCTGAAAAAG AGATGCCGTT TGCCGGCGAG
TTGCTTCAGG TTCGGGAGCA AGAGCAGCTC TGGGAAGGAG CCATCGAGCG TCTGCTTCGC
AATTTCGGCC TGTCGCTGCT GGTGTCCGAC CATCACTACC CGAAGGTGGC GGAGTGGGTG
GAACAAACCA ATCTGAAAGG TCGTCTGGTC TATTTTCGCG TACGTCAGCA CTCTCGAAGC
GAACAGTTGC CGGATCATCC GGCCTCGCTG GCGCGCAAGC TCGCCATCAA AGCGGATTCC
CCCTGGTTTG ACTGGCTGGA AAGGGAGGTC GCCCATCGTT TTGATCTGGT TTGCTGCATG
AATCAGGAAG AGTTTCGAAG GGAGAAAAAA GCTCTCACCC TCGGCGGCCA GATCAAGTCA
CCGGGTGAAC GCCACGAAAA AGACGATCGC CATCGGCTTG ACGATCGCAG CCGCTACGTG
CTCGGATGGA GCAATGCCGC CAAAATCGCG ACATTGGAAG AGAAGGCCGG GCGGCAAAAA
AGAGATCTTG CCGAACTTGC AGGTCGCATA AGCACGCTCC AGCAAGAGCA AACAAAACTC
AAGGAGCGCC TGACCATTCT CTCAAAGCTC GACGAGTATT CCGATTTTCG CGATCTCGAC
TGGCAGCCGG TGGCCATGGC CATAGCCAGG CTTGAAACTG AAAAACGCGA TCTTGAAGCG
ACCTCGAATT TTCTGCAAAC CCTTGCTTCG CAACTTGCCG CCCTCGAAGA AGAGATGCGG
GAAACGGAGC GGCTACTTGA CGACAGAAAA GACAAACGGT CGAAAATCGA ACAGAAAATC
ACTGTCATCA AGGAGTTACA GCAGCAGACG CATACGCTGC TTTCTGAAGC GGGAGCCGAA
TCCGCTTCCC GATTCGCCGC GCTCCGGCAG ATGCGCAGTG ACGCTTTCGG CGACCAGTCG
GTGACGATCG AGTCGTGCGA CAATCGCGAA AGAGAGATGC GCGACTGGTT GCAGACAAAA
ATCGACGCCG AAAACGGGAA AATTTCCCGA CTCAGCCAAA AGATCATCAA GGCGATGACC
GAATACAAGG AAGAGTGGAA ACTTGAGACT CGCGAAGTCG ATGTCAACAT TGCCGCCGGC
TCGGAATATC GCTCAATGTT TGAACAACTT CGGGCCGACG ATCTGCCTCG CTTCGAGGGG
CGGTTCAAGG AGCTGCTCAA CGAAAACACT ATCCGCGAAG TCGCCAATTT TCAGTCGCAA
CTGGCCCGCG AACGGGAGAC CATCAAGGAA CGCATTACCC GGCTCAATGA ATCGCTCACG
CAAATCGACT TCAATGCGGG TCGATACATC ACCCTCGAAG CACAACTGAA CCTCGACGCC
GATATCCGTG ATTTTCAGTC GGAGCTGCGA GCCTGCACCG AAGGTTCTCT TACCGGTTCG
GACGACGCCC AGTACTCCGA AGCCAAATTC CTCCAGGTGC GGCGAATCAT CGATCGCTTC
CGTGGTCGTG AAGCCTATGC CGACCTCGAT CGGCGCTGGA CAGCCAAAGT CACCGACGTA
CGCAACTGGT TCGTTTTTGC CGCCAGCGAA CGGTGGCGTG AAGATGACAG CGAACACGAG
CACTACGCCG ACTCGGGGGG GAAATCCGGA GGACAGAAAG AGAAGCTCGC CTACACGGTG
CTTGCCGCCA GTCTTGCCTA TCAATTCGGT TTGGAGTGGG GCGCCGTACG CTCCCGTTCG
TTCCGCTTCG TTGTCATTGA CGAAGCCTTC GGACGCGGTT CCGACGAATC GGCGCAGTAC
GGCCTGCAAC TCTTCGCCCA GCTTAACCTG CAACTCCTTA TCGTCACCCC TTTACAGAAA
ATCCATATCA TCGAACCCTT CGTCTCCAGC GTAGGCTTTG TGCACAATCA GGAAGGGCGC
TGCTCGGTAT TGCGCAACCT CACCATCGAA GAGTATCGCG CCGAAAAAGA GAAGGCGGCA
GAATGA
 
Protein sequence
MSDVIEMGFV ADDRLAGFRL QRLEVFNWGT FDGRVWTLKL GGKNGLLTGD IGSGKSTLVD 
AVTTLLVPGQ RIAYNKAAGA DNRERTLRSY VLGYFKSERQ ESLGGGAKPV ALRESNSYSV
ILGVFHNEGY DKTVTLAQIF WMKDASQPAR LYAVCERDLS IAADFSAFGT EISTLRKRLR
GSGIELFESF PPYGAWFRRR FGIDNEQALD LFHQTVSLKS VGNLTDFVRL HMLEPFNVEP
RIAALIHHFE DLNRAHEAVL KAKRQIEMLA PLVDDCDHHQ TLVRTTEELR ASRDSLRHWF
ASLKLELLEK RLVSLDEELS RHHIAIERLD TERRSQQGRD RELRRTIAEN GGDRIESISA
EIRQKQEELE RRNQKAARYE ELARLLGEHP AATAEEFHSQ QAGHSAMRDA TAEVEAQVQN
NLNEAGVLFT QGRHEHEQLT EEIKGLKARM SNIDEKQVAM RRSLCEALNL SEKEMPFAGE
LLQVREQEQL WEGAIERLLR NFGLSLLVSD HHYPKVAEWV EQTNLKGRLV YFRVRQHSRS
EQLPDHPASL ARKLAIKADS PWFDWLEREV AHRFDLVCCM NQEEFRREKK ALTLGGQIKS
PGERHEKDDR HRLDDRSRYV LGWSNAAKIA TLEEKAGRQK RDLAELAGRI STLQQEQTKL
KERLTILSKL DEYSDFRDLD WQPVAMAIAR LETEKRDLEA TSNFLQTLAS QLAALEEEMR
ETERLLDDRK DKRSKIEQKI TVIKELQQQT HTLLSEAGAE SASRFAALRQ MRSDAFGDQS
VTIESCDNRE REMRDWLQTK IDAENGKISR LSQKIIKAMT EYKEEWKLET REVDVNIAAG
SEYRSMFEQL RADDLPRFEG RFKELLNENT IREVANFQSQ LARERETIKE RITRLNESLT
QIDFNAGRYI TLEAQLNLDA DIRDFQSELR ACTEGSLTGS DDAQYSEAKF LQVRRIIDRF
RGREAYADLD RRWTAKVTDV RNWFVFAASE RWREDDSEHE HYADSGGKSG GQKEKLAYTV
LAASLAYQFG LEWGAVRSRS FRFVVIDEAF GRGSDESAQY GLQLFAQLNL QLLIVTPLQK
IHIIEPFVSS VGFVHNQEGR CSVLRNLTIE EYRAEKEKAA E