Gene Cpha266_0902 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_0902 
Symbol 
ID4570516 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp1031117 
End bp1032877 
Gene Length1761 bp 
Protein Length586 aa 
Translation table11 
GC content48% 
IMG OID639765497 
Producthypothetical protein 
Protein accessionYP_911374 
Protein GI119356730 
COG category[S] Function unknown 
COG ID[COG1262] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.530608 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTGTTG AAGAAATTAT CAGGATGCTC GATAATCCGT CAACGCTGGG CGATGCTTTG 
ATTGAAGCGG CGAAGTTGCC ATTCAAAGGT GACGACAGGA TACGGTTCCA GCAAAAACGA
CAGCGATTCA TTGATGGATT GAGTGATCAT GAACGGCCTC AATGGGTCGG GGAGATGAAA
ACTTTTTTGT CATTGTATGA ATCGACCGGT AGCGTTGATA GCGCAGTCAC ACCACTCAGT
GCGTATCGCA ATGATTTCAG TACCGATATA TTTATAAGCT ACGCCCGCGA AGACATGAAA
CGGGTTGAGC CAATTGTCAG GGAGCTGGAA AAACACGGCT GGAGTGTTTT CTGGGATCCT
GAGATTCCAC CGGGGGAGAC CTGGCGGGGT TATATCAAGA AAAAGCTGGA TGAATCACGT
TGCGTGCTCG TTGCCTGGTC TCGTCTTTCA GTCACTTCCG AATGGGTTAT CGCTGAAGCT
GATGAAGCAA AAAAACGAGG CATTCTGGTT CCTGTGCTAC TCGATGCTGT TGAGCCTCCA
TTCGGGTTCA GCCATATTCA TGCGGCAAAT CTTTCCTGCT GGAAAAATGA TAGTAATAGT
CGAGCATTCA AGGAACTCGT GAATGCGGTC ACCCTGAAAA TTTCTTCGTC ATCACCATCT
TTCATGAACT CAATATTGGG TGAGACGTCA GTTCCTATCC CTAAGACAGC ACCAGTTATT
GTACCATCAA CCTCAATGCT GGAAGCACTT CGTGTAATGG TTCAAAATTG GATTTCAGTA
GTTGCGAGTA TCAGAACATG GCAACCAAAA AGCCGGCAAA CAGCAATTGC TATTGTGCTG
GTCATGGTGT TTATTTTTGC GGTTTTATCG TATCGTTATT TGCGTGTAAC GGGTCCTTCG
TCACCATCTG TTCCGGAAAC TGTTCAGCCT GTGAACGCAT CGCCAGCAAG GCCAGGAAAC
TTTGTTCTGA TACGCGGAGG TGAGTTTACA ATGGGGAGCC CGGCGAATGA ATCTGGTCAC
GAGAGTGACG AGACTCAGCA TCAGGTCAAA GTGAGTGATT TTTACCTGTG CAAATATGCG
GTTACCCTTG CCGAGTTTAA AAAATTCATT GAAGATTCAG GCTATCAGAC TGATGCTGAA
AAAGATGGCG GCAGTTATAG TTGGGATGGA ATAAGTTGGG TGAAGAATGC TGGAGTTGAC
TGGCGATATG GGGTTTCAGG CAGTGTACGA CCTCAAAGTG AAGAGAACCA TCCTGTGTTA
CATGTGAGCT GGAATGACGC TGTGGCCTAT TGCAAGTGGA TATCGAAAAA AACAGGAGAT
GCATTTCGTT TGCCAACGGA AGCTGAGTGG GAATATGCGT GTCGAGCAGG AACAACCACA
CCGTTCCATA CCGGCGATAA CCTGACAACC GGTCAGGCGA ACTATAACGG AAACTATCCG
TATACCAACA ATCAGAAAGG AGTGTATCGG GAGAACACGG TTAAGGTTGA TGAGTTTGCT
CCGAACGCGT GGGGGTTATA CCATATGCAT GGCAATGTGT GGGAGTGGTG TGGCGACAGG
TATGGGGATA AATATTATGA TGAATGCAAA GCCGAAGGTG TTGTTGAAAA TCCGGTTGGC
CCGGAAACCG GTTCGCTCCG TGTGCTTCGT GGAGGTGGCT GGAGCTTCAA TGCGAGGAGC
TGTCGGTCGG CTTTTCGCAT CGACGTCGCC CCCGACTACC GCAGCAACTA CGCCGGCTTC
CGCCTGGCCT TCGTCCCGTA G
 
Protein sequence
MTVEEIIRML DNPSTLGDAL IEAAKLPFKG DDRIRFQQKR QRFIDGLSDH ERPQWVGEMK 
TFLSLYESTG SVDSAVTPLS AYRNDFSTDI FISYAREDMK RVEPIVRELE KHGWSVFWDP
EIPPGETWRG YIKKKLDESR CVLVAWSRLS VTSEWVIAEA DEAKKRGILV PVLLDAVEPP
FGFSHIHAAN LSCWKNDSNS RAFKELVNAV TLKISSSSPS FMNSILGETS VPIPKTAPVI
VPSTSMLEAL RVMVQNWISV VASIRTWQPK SRQTAIAIVL VMVFIFAVLS YRYLRVTGPS
SPSVPETVQP VNASPARPGN FVLIRGGEFT MGSPANESGH ESDETQHQVK VSDFYLCKYA
VTLAEFKKFI EDSGYQTDAE KDGGSYSWDG ISWVKNAGVD WRYGVSGSVR PQSEENHPVL
HVSWNDAVAY CKWISKKTGD AFRLPTEAEW EYACRAGTTT PFHTGDNLTT GQANYNGNYP
YTNNQKGVYR ENTVKVDEFA PNAWGLYHMH GNVWEWCGDR YGDKYYDECK AEGVVENPVG
PETGSLRVLR GGGWSFNARS CRSAFRIDVA PDYRSNYAGF RLAFVP