Gene Cpha266_0635 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_0635 
Symbol 
ID4569789 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp719415 
End bp722333 
Gene Length2919 bp 
Protein Length972 aa 
Translation table11 
GC content49% 
IMG OID639765233 
Producthypothetical protein 
Protein accessionYP_911114 
Protein GI119356470 
COG category[L] Replication, recombination and repair 
COG ID[COG1743] Adenine-specific DNA methylase containing a Zn-ribbon 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATATCAG ACTATCCAAT AAAATCACCC CGCAAGCTTA TTGAGGTTGC GCTTCCGCTG 
GATGCTATCA ATGCAGCTTG TGCCTATGAG AAAATGCCAG GGATAGGTGC TCATCCACGC
GGAATCCATC TTTGGTGGGC AAGGCGACCT CTTGCAGCCG CACGAGCGGT TCTGTTTGCG
CAACTTGTCA ACGACCCTGG TTACCAACAG GGTTGTGGTT TCAAATATGG CAAGAATAAA
AAGGAAGCTG CTATTGAACG AAAGCGACTG TTCAAGATTA TCGAAGAGCT TGTATTGTGG
GAAAATACCA CCAACGAGGA AGTTCTGGAA CGCGCCAGGG TTGAGATTCG CCGATCCTGG
CGGGAGGTGT GCGAACTGAA TAAAGAGCAT CCGCAGGCTG CCGAGTTGTT CAACCCGGAA
AAGATGCCTG CGTTTCATGA CCCTTTTGCT GGTGGTGGAG CTATTCCACT TGAAGCACAG
CGATTGGGAC TCGAAGCCTA CGCCTCCGAT CTGAATCCAG TTGCAGTGCT GATTAACAAA
GCAATGATTG AGATTCCGCC CAAGTTCGCT GGACGACCTC CGGTCGGACC AGAGATTGAG
AACAAACAGG GCAAAAGTCT CGAACTTCCA AAAATTTGGC CGGGAGCTAC CGGACTGGCA
GAGGATGTTC GCCGCTATGG GTCGTGGATG CGTGATGAAG CACAAAAACG TATTGGACAC
CTTTATCCAC CTGTTGAAGT CACTGAGGAG ATGGCGAGGG AACGGTCTGA CCTCAAACCG
CTTGTTGGCA AGAAACTTAC CGTCATTGCA TGGTTGTGGA CAAGAACTGT CAAAAGTCCG
AATCCTGCTT TTACCCATGT AAACGTTCCA CTTGTTTCGT CCTATATATT GGCAAATAAA
GATGGTAAAG AAGTATACGT AAAGCCAGTT ATCGAGGGTG ATAAGTATTA TTTTTTGATT
AAAAATGGTA CTCCAACAAC TGAAGCTAAA GATGGGACAA AAGCAACTGT GCGAGGGGCA
AACTTCAGAT GTATAGTTTC AGGCGCTGTA ATTGGAAGCG ACTATATAAA AACAGAAGGT
AAAGCGGGAA GAATCGGCCA AAGACTCATG GCTATTGTTG CCGAGGGTAC CCGAGGCCGG
GTTTATCTCC CCCCAACATT GGATGCGGAA ACTGTTGCAA AAGATGCTGC TCCAACGTGG
CGTCCATCCG GTGATGTACC AGAGAGATTG ACAGGTGGGA CTTGTGTTCC TTATGGATTG
AAAGAATGGG GAGATATTTT TACTCCTCGG CAGTTGGTCG CATTGACAAT ATTAAGCGAT
TTACTTTCTG AAGTGCGTGA ACTTGTCAGA GAAGATGCTT TAGCTTCTGG CTTATCTGAC
AGCGAAAAAG GACTTGGTGA TGGAGGCTCT GGTGTGATTG CTTATGCGGA AGCAGTTAGC
GTGTATCTTG CTTTTGCCAT CAATCGCTGT GCGGATTTTT GTAATTCAGT TACACGATGG
GTTCCAGGAA ATCAGAAGGT GATGAATCTT TTTGGGAAAC AAGCCATTCC TATGACATGG
GATTATCCAG AAGCTGCTAT TCTTGCTGAT ACGGTCGGTG GCTTTGCTCC CGCTTCGAAA
TACGTTGCTG ATTGCATTGG GAAGTTGTCT CCTGCCGCTG TCGGTTTTGC CTCTCAAGCT
GATGCGCAAA CACAGAGTAT TTCCATGGCT AAAATAATTT CAACTGATCC ACCCTATTAT
GACAACATTG CCTATGCTGA TCTTTCTGAT TTTTTTTACG TGTGGCTGAG GAAATCCTTG
CGTCTATTTC TTCCAGGCTT GTTTTCCACA ATTACAGTCC CTAAGGTTGA AGAACTCGTA
GCGGCTCCTT ATCGTCATGG CACAAAACAG AAAGCTGAAA CGTTTTTTCT TACGGGCATG
ACTGAGGCCA TCCATAATCT TGCCGAACAA GCACATCCCG AGGGCCCGGT TACAATTTAC
TACGCATTTA AGCAGTCGGA GACAAGTGGA CAAGATGGCA CATCTTCTCC AGGATGGGTA
ACGTTTTTGT CGGCAGTGTT AAGTGCCGGT TTTGCTATTG TTGGTACATG GCCGTTACGC
AGCGAACAGG AATTCCGGAT GATTGGCATG GGAGCAAATG CCCTTGCATC CAGCATCGTC
CTTGTTTGCC GCAAACGTTC TGCTGACGCT CCCTCCGTTT CCCGCCGTGA GTTCATTCGC
GAGTTGAACG GCGTATTGCC CGAGGCACTT GACGAAATGA CCAAAGGTTC CGGCGAGGAA
CGTTCGCCCG TTGCTCCGGT GGATTTGTCG CAGGCTATCA TTGGCCCCGG CATGGCGGTT
TTTTCCAAAT ACTCAGCAGT GCTGGAAGCA GACGGCACAC CGATGAGCGT CAGAACGGCT
CTCCAGCTCA TCAACCGTTT TCTTGCCGAG GATGATTTCG ATCCAGATAC CCAGTTCTGC
CTTCACTGGT TTGAGCAATA CGGATGGAAT GAAAACCTTT TCGGTGAAGC TGATGTATTA
GCCCGCTCAA AATCGACAAG TGTTGACGCC ATGAAGGAGG CCGGTGTGCT CCAGAGTGGC
AGCGGCAAAG TCCGTCTGCT CAAATGGGCG GAGTATCCCA GCGATTGGGA TCCGCGTACT
GACAAGCGAA TGCCTGTATG GGAAGCACTG CACCAGCTTA TCCGCGCATT GAAGCAAGGA
GGCGAATCTG CATCCGGAGC ACTTCTTGCT GCTCTCGGCG GCAAAGCTGA AGCGGTACGT
CAACTCGCAT ACCGCCTCTA TACGCTCTGC GAACGACTCG GCCAGGCCGA AGATGCACGA
GCCTATAACG AACTGATCAC AAGCTGGACA GGTATCGAAT CTGTTGCCAA CAGCATACCA
AAACCGTCCG ATCCGCAACT GTCACTTTTT GATAACTGA
 
Protein sequence
MISDYPIKSP RKLIEVALPL DAINAACAYE KMPGIGAHPR GIHLWWARRP LAAARAVLFA 
QLVNDPGYQQ GCGFKYGKNK KEAAIERKRL FKIIEELVLW ENTTNEEVLE RARVEIRRSW
REVCELNKEH PQAAELFNPE KMPAFHDPFA GGGAIPLEAQ RLGLEAYASD LNPVAVLINK
AMIEIPPKFA GRPPVGPEIE NKQGKSLELP KIWPGATGLA EDVRRYGSWM RDEAQKRIGH
LYPPVEVTEE MARERSDLKP LVGKKLTVIA WLWTRTVKSP NPAFTHVNVP LVSSYILANK
DGKEVYVKPV IEGDKYYFLI KNGTPTTEAK DGTKATVRGA NFRCIVSGAV IGSDYIKTEG
KAGRIGQRLM AIVAEGTRGR VYLPPTLDAE TVAKDAAPTW RPSGDVPERL TGGTCVPYGL
KEWGDIFTPR QLVALTILSD LLSEVRELVR EDALASGLSD SEKGLGDGGS GVIAYAEAVS
VYLAFAINRC ADFCNSVTRW VPGNQKVMNL FGKQAIPMTW DYPEAAILAD TVGGFAPASK
YVADCIGKLS PAAVGFASQA DAQTQSISMA KIISTDPPYY DNIAYADLSD FFYVWLRKSL
RLFLPGLFST ITVPKVEELV AAPYRHGTKQ KAETFFLTGM TEAIHNLAEQ AHPEGPVTIY
YAFKQSETSG QDGTSSPGWV TFLSAVLSAG FAIVGTWPLR SEQEFRMIGM GANALASSIV
LVCRKRSADA PSVSRREFIR ELNGVLPEAL DEMTKGSGEE RSPVAPVDLS QAIIGPGMAV
FSKYSAVLEA DGTPMSVRTA LQLINRFLAE DDFDPDTQFC LHWFEQYGWN ENLFGEADVL
ARSKSTSVDA MKEAGVLQSG SGKVRLLKWA EYPSDWDPRT DKRMPVWEAL HQLIRALKQG
GESASGALLA ALGGKAEAVR QLAYRLYTLC ERLGQAEDAR AYNELITSWT GIESVANSIP
KPSDPQLSLF DN