Gene Cpha266_2104 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_2104 
Symbol 
ID4569635 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp2439291 
End bp2440898 
Gene Length1608 bp 
Protein Length535 aa 
Translation table11 
GC content48% 
IMG OID639766686 
Productcytochrome c family protein 
Protein accessionYP_912540 
Protein GI119357896 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAC TGCTGTTGAC AGGTATTCTT CTCTCGGGTT TTCTCTTTCC GTTATCGAAG 
CCGCTTTCCG CAGAGCCCTT CCACCACAAA GTTGATTCCC TGCTGATTTC AACGACCGAT
CACAAGAAAT TCAAAATTCT TCAACAGGAT TTCAAGAGTG GTCCCGAAGT AACAAAAGCC
TGTCTGACCT GCCATACCGA AGCGTCCAAA CAGTTGCATC GCACCAGACA CTGGACATGG
GATGTACCCA TGAAAAAAGG TGAGCGGCTG GGCAAAAAAA ATGTTGTGAA CAATTTCTGC
ATATCGGTGG AAGGCAACGA ACCGCGATGT ACCTCCTGCC ATATCGGCTA CGACTGGAAA
GACAAAAACT TCAATTTCAA GAGTGAAGAG AATGTAGACT GCCTTGCCTG CCACGATATG
ACCGGAACCT ATAAAAAGCT TCCCGCAGGT GCGGGACATC CGGCATATTT CGATACGGTT
TTCGAAAAAA AGCTATACCC GAAAGTCAAC CTCTCTTATG TTGCGCAGCG TGTCGGACAG
CCCGACCGGC ACAATTGCGG CATCTGCCAT TTCGAAGGTG GTGGGGCCGA TGCGGTCAAG
CATGGCGATC TTGACAACTC GCTGCTCAAA CCCGATCGCG AACTCGATGT CCATATGGCG
ATCGGCAAAA AAGACCTCAA CATGACCTGC GCGGACTGCC ATAAAACGGA AGGCCATCAG
GTTCCCGGAA GCCGATATAC GCCGGAAGCT CATGATACGC ATGGATTCGA TTATCCGCTG
ACGGATAACA ATCCGGCCAC CTGCAGTTCA TGTCACGGTC TCAAGCCGCA TAAAAAGCTT
AAAAAGCTCA ACGACCACGT AGCCAGAGTA GCCTGTCAGA CCTGTCATAT CCCTTTCATA
GCGAAACAAC GCCCGACAAA AATGTGGTGG GACTGGTCCA AAGCCGGAAA ATTCGATAAA
AACGGTAAAG AGATCACGAT TAAAGACTCT TCGGGATGTG TTCTGTATGT ATCAAAGAAA
GGAGCGTTCA GATGGGCTAA AAACGTTGCT CCCGAATACA GATGGTTTAA CGGCGAAATG
AACTACACGA CGTTTAATAC CCAAATCAAC GACAGGAAAG TTGTTTCGGT CAATCATCCC
GATGGCGCGG CAAACGACAC CCTTTCGAGA ATATGGCCGT TCAAGGTACA TCGCGGAATG
CAGCCTTACG ATCCGGTACT GAAACGCTTC GTCAAGCCCA TTGTTTACGG ACCGAAGGGC
TCCGGGGCCT ACTGGTCAGA CTTCAACTGG GATAAATCCA TCAGGAAGGG AATGGAGAAT
GCAGGTCTCG AATACAGCGG AAAATATGCT TTTGTGGAAA CTGAAATGTA CTGGCCGATC
TCTCATATGG TATCTCCGAA AGAAAAATCC CTCAGTTGCA AAGAGTGCCA CTCACGAAAC
GGAAGACTGC AGAATCTCAG CGGATTCTAT CTGATGGGAA GAGACACGAA CCCTTTTGTG
GAATACTTTG GTCTTCTGGC CATATCGGGA TCACTTATCG GTGTCATCAT CCACTCAATC
ATCCGGTATT TCACCGTTAA AAAACTGAAA AAAGCAGGTG GCCTATGA
 
Protein sequence
MKKLLLTGIL LSGFLFPLSK PLSAEPFHHK VDSLLISTTD HKKFKILQQD FKSGPEVTKA 
CLTCHTEASK QLHRTRHWTW DVPMKKGERL GKKNVVNNFC ISVEGNEPRC TSCHIGYDWK
DKNFNFKSEE NVDCLACHDM TGTYKKLPAG AGHPAYFDTV FEKKLYPKVN LSYVAQRVGQ
PDRHNCGICH FEGGGADAVK HGDLDNSLLK PDRELDVHMA IGKKDLNMTC ADCHKTEGHQ
VPGSRYTPEA HDTHGFDYPL TDNNPATCSS CHGLKPHKKL KKLNDHVARV ACQTCHIPFI
AKQRPTKMWW DWSKAGKFDK NGKEITIKDS SGCVLYVSKK GAFRWAKNVA PEYRWFNGEM
NYTTFNTQIN DRKVVSVNHP DGAANDTLSR IWPFKVHRGM QPYDPVLKRF VKPIVYGPKG
SGAYWSDFNW DKSIRKGMEN AGLEYSGKYA FVETEMYWPI SHMVSPKEKS LSCKECHSRN
GRLQNLSGFY LMGRDTNPFV EYFGLLAISG SLIGVIIHSI IRYFTVKKLK KAGGL