Gene Cpha266_1742 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_1742 
Symbol 
ID4571104 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp1968204 
End bp1969652 
Gene Length1449 bp 
Protein Length482 aa 
Translation table11 
GC content50% 
IMG OID639766325 
Producthypothetical protein 
Protein accessionYP_912183 
Protein GI119357539 
COG category[S] Function unknown 
COG ID[COG2308] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.208956 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCGCT TTTTCGAACG ATATGAAGCT GAGCCAAGCC GGTTTTTTGA CGAAGTGATC 
TCTCATGAAG GCAGACCACG CGCTCACTAC AACAAACTGC TTAACCGTTT CAGCCAGTTC
TCATCAGATG ACATCAAGGC CCGCCGCCAG ATTCTCAATA TTTTTTTCCG CAATCAGGGG
ATCACCTTTA CGGTGTATGG CCTGGAAGAG GGCATCGAAC GGATTTTCCC TTTCGACATG
GTTCCCAGAG TACTGCCTGC GCATGAATGG AAAATCATTG AAAAAGGTCT GGAGCAGCGC
ATTACAGCAC TGAACAAGTT CCTGCGCGAC ATCTACCACC ATCAGAAAAT CCTGAAAGAC
AAAATCATTC CTGCGGAACT CGTACTTGGA AGCCAGCATT TCAGGCGTGA GTTCATAGGG
GTCAATCCTC CTCTCGGCAT TTATATTCAT GTTGCCGGAA GCGATATTAT CCGCGACGGA
GAGGGAAACT ATCTGGTACT TGAAGACAAC CTGAGAACAC CGAGCGGGGT TTCCTACATG
CTTCAGAACA GGCAGGCTAT GAAACGGGCG TTTCCTGTGC TGTTTGAAAA GTATCAGGTG
CGACCAATTG AGAACTATCC CCAGGAACTG CTGCGCACCC TGCAGGAAAT CAGCCCGGTT
ACACGTCGCG AACCGAATGT TGTTCTGCTC ACTCCGGGCA TCTACAATTC AGCATATTTC
GAGCACAGTT TTCTTGCCCG GCAAATGGGC ATCGAACTGA CTGAAGGGAG AGATCTCGTG
GTGAACAACA ACAAAGTCTA CACAAGAACC TCCCGGGGTC TCGAGCGGGT TGATGTAATT
TATCGCAGGG TTGACGACGC GTTCCTTGAC CCGCTTGTTT TCCGACCCGA CTCGAAACTC
GGGGTAGCCG GTCTCATCAA TGCCTATCGC AAAGGAAATG TCGCTCTTGC AAACGCAATC
GGCACCGGCG TTGCAGACGA TAAGGTGATC TACAGTTTTG TTCCCAAAAT GATCAAGTAC
TATCTTGGAG AAGACCCCAT ACTGCAGAAT GTGCCGACCT GGCTTGCAAG CAATCCGTCC
GATCTGAAAT ACATTCTTGC AAACCTCGGT TCACTGGTAG TAAAGGCTGC CAACGAATCC
GGAGGGTACG GAATGCTCAT AGGCCCGGAA TCGACCGCTG AACAGCAGGA AAAATTCGCA
GAACTTATTG TTTCAAACCC GCGCAACTAT ATTGCACAAC CAACTATATC GCTATCGAGG
CATCCGAGCT TTTATAACGA TACCGATCTG TGCGGCTGCC ACATTGATCT GAGGCCTTAT
GTACTGAGCG GGAAAACCAC TACTATTGTG CCAGGAGGTC TTACCAGGGT CGCGCTGAAG
CGAGGCTCAC TTGTTGTGAA CTCATCGCAG GGCGGCGGGA GCAAGGACAC CTGGGTTGTT
GACGAATAA
 
Protein sequence
MSRFFERYEA EPSRFFDEVI SHEGRPRAHY NKLLNRFSQF SSDDIKARRQ ILNIFFRNQG 
ITFTVYGLEE GIERIFPFDM VPRVLPAHEW KIIEKGLEQR ITALNKFLRD IYHHQKILKD
KIIPAELVLG SQHFRREFIG VNPPLGIYIH VAGSDIIRDG EGNYLVLEDN LRTPSGVSYM
LQNRQAMKRA FPVLFEKYQV RPIENYPQEL LRTLQEISPV TRREPNVVLL TPGIYNSAYF
EHSFLARQMG IELTEGRDLV VNNNKVYTRT SRGLERVDVI YRRVDDAFLD PLVFRPDSKL
GVAGLINAYR KGNVALANAI GTGVADDKVI YSFVPKMIKY YLGEDPILQN VPTWLASNPS
DLKYILANLG SLVVKAANES GGYGMLIGPE STAEQQEKFA ELIVSNPRNY IAQPTISLSR
HPSFYNDTDL CGCHIDLRPY VLSGKTTTIV PGGLTRVALK RGSLVVNSSQ GGGSKDTWVV
DE