Gene Cpha266_1047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_1047 
Symbol 
ID4571009 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp1185859 
End bp1187145 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content45% 
IMG OID639765650 
Producthypothetical protein 
Protein accessionYP_911518 
Protein GI119356874 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00366943 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCACATGA TGCCGCATCG CATTGAAAAG AATTTGGGCG GGGTAGCAGA GCTCTTTTTG 
CGAGTTTCAG TGACGGTCAT CACCATTGCG GTTTTTCTTG GAGCAATCGG CGCCTTGCTC
GGTAATATTT TTTTGCTGCG TCTGCATCCC TATGTTTTTT TTATAGGGTT CGGCAACCTT
GCCATTCTTA TTCTCAACAG GTATCTCACG GCTGTCATCT ACCCGGAGTT GAGAATAGAT
CCTCATAAGC AGCTTCGCTA TATGTATGCC GTGCTGCTAT CACTGATAAG TATTGCCATT
GCACTGTTTA TGGAGTGGCC TCTTCTGAAG GCCGCCACAG GCCTTTTGCT CATGGTTGTT
GTTATGGGGC CGCTCAAGGA GATATTTACA ACTCTTTCCG TCAGCCGGAT ATGGAAGGAG
GTTTCTGTAC GTTATTATAT TTTCGATGTT CTTTTTCTGC TTAATGCCAA CCTCGGACTT
TTCACCCTTG GCCTGAAAGA GGCTTTTCCC GACCAGAAGA TTATTCCTTT CTTTGTTACG
CAGTCAGCCT ATTTTCTCGG TTCCTCTTTT CCTCTCAGTA TCAGCGTCAT GGGATTTCTC
TATACTTACG GCTGGCGTAC CTCTCCGAAA AGAGGGCTTA TCCGGCAGCT TTTCAGTATC
TGGTTCTATG TTTTCGTCGG TGGTGTTCTC GGCTTTCTCA TTGTTATTCT CCTTGGTAAT
TATTTGGGCA TGATGCTGAT CAGCCACCTT CTTCTTTTAG GCGTCATGGC TATACTCGGA
GGTTTTGCCG CCTATCTCTA TGGCTTTTTC AAAAAGAACT TTCATCATCC GGCGCTTGCC
TTTCTGTTAA GCGGGCTCTC CCTTTTATTA GCAACAAGCG CCTATGGCAT CATGAATGTT
TACTTCATCA AGGGGATCCC TTTCGGATCC TATCCCCCTA TTCGCCTGGA TAAAATGTGG
CTTTACCACT CTCATACCCA TGCGGCACTG CTCGGGTGGA TAACCTTCTC TTTTATTGGC
ATGATCTATA TCGTCATACC TGCAATTTTC CGTTCGAACT CTCTTCAGTT TCTTCAAGGT
TCCGGAGAGC TTTCTGAAAT GCTGCAGAAG AAGACGATGA AAAAGGCATT CAGGCAGCTT
ACCATTATGC TCCTGTCGGC AACAGCAATC CTTCTTGCTT TTTTTCTTGA AAACCAGATA
CTTCTTGGTC TCTCGGGTCT TCTGTTTGGT TGTTCGGTAT TTTTTGTAAT TATCAATCTT
CGTTCTGAAC TCTACGAGGA AGAATAA
 
Protein sequence
MHMMPHRIEK NLGGVAELFL RVSVTVITIA VFLGAIGALL GNIFLLRLHP YVFFIGFGNL 
AILILNRYLT AVIYPELRID PHKQLRYMYA VLLSLISIAI ALFMEWPLLK AATGLLLMVV
VMGPLKEIFT TLSVSRIWKE VSVRYYIFDV LFLLNANLGL FTLGLKEAFP DQKIIPFFVT
QSAYFLGSSF PLSISVMGFL YTYGWRTSPK RGLIRQLFSI WFYVFVGGVL GFLIVILLGN
YLGMMLISHL LLLGVMAILG GFAAYLYGFF KKNFHHPALA FLLSGLSLLL ATSAYGIMNV
YFIKGIPFGS YPPIRLDKMW LYHSHTHAAL LGWITFSFIG MIYIVIPAIF RSNSLQFLQG
SGELSEMLQK KTMKKAFRQL TIMLLSATAI LLAFFLENQI LLGLSGLLFG CSVFFVIINL
RSELYEEE