Gene Cpha266_2038 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_2038 
Symbol 
ID4569158 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp2357333 
End bp2358589 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content49% 
IMG OID639766619 
Producthypothetical protein 
Protein accessionYP_912474 
Protein GI119357830 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0333853 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGATCA TTATTTTTGA AGATGAACGG GTCTCTTTTC TGAAACCGCT CGCCAATCTT 
AAGCCCGTTT ACGGACTGGT TTCTGGTTTT TTGTCGCTTG ATGAAAAATT TCAGCATTAT
ATCGGCGCCG CTCATCCGCT CTCTTATCAT CTTCGTCGCT ATCTTGCTCC TTTTTTCAGC
GAGCAGCACC CTGACAGGCT TGTCAATTCG ATTGAAGATG ATGATGTGTT GTTTGTGAAC
GGGAGGCTGA TTTGTGACAA GCGTGTTGCG CAGATTGTCG CAACCGGTGG CGTGGATTCC
GGGCACGCTT TAGTGCAGGG GGATGATCTT GTTTTGGCAA GGGTTCGTCG CGATCGGCTT
CGGCTTGATC AGGATGAGAT GCTGACCGAT CTTATTGATA CAGCGGAGCT CTCTTCGGGG
CTTGTTGTCG ATAAGGTATC GGGTTTCAGG ATGATCCGCT ATGTCTGGGA TGCGATTGGC
TTTCATGCTG CCGAGTTGAA AAACGATAGC GAAACGCTGG AGCTTGGTTG TATTGAGGGC
AATGTTCATC CTTCTGCGGT GATGGTGAAT CGTTCGAATG TTTATATCGG CGCTGATGCT
GTGGTGAAGG CCGGAGCGGT GATTGACGCG ACGGAGGGGT TTGTTGCCGT TGGTTGCGGG
GCTGTTGTCG AACCGCAGGC CGTGTTGATG AACAGCGTTT ATCTGGCGCC CTGGTCAAAG
GCGAATATCG GAGCAAAGAT TTACAGCAAT GTGGCTGTTG GTCTTGCTTC GAAGGTCGGG
GGCGAGGTTG AGGATTCGAT TATTGAGCCT TTTGCAAATA AGCAGCATGA TGGTTTTCTC
GGTCACTCCT ATCTTTCATC CTGGTGTAAT CTTGGGGCAG GCACGAATAC TTCGGATCTG
AAAAACAATT ACAGCAAGAT CGTTCTTGAT ATGGGCAGCC ATAAAGTTCT TACCGATCTG
CAGTTTCTTG GTCTGATTAT GGGAGACCAT TCGAAAAGTT CAATCAATTC GATGTTCAAT
ACGGGAACCA TTGTCGGCAC CAGCGCCAAT GTTTTTGGCG AGGGGTTTCC TCCTAAATAT
ATTCCCTCTT TTTCGTGGGC AGGAAGTGCC GCAGGTATTG AGCCTTACGA GCTTGACAAG
GCTGTAGGGA CTGCACGAAA AGTAATGGAG CGAAGAAGGG TTTCCATGAG CCTTTCGTAT
GAGGCTATGT TCCGTTTTCT TGCCTTGCGG GAGCAGGGTG GAGAGGTTTT TATCTAA
 
Protein sequence
MQIIIFEDER VSFLKPLANL KPVYGLVSGF LSLDEKFQHY IGAAHPLSYH LRRYLAPFFS 
EQHPDRLVNS IEDDDVLFVN GRLICDKRVA QIVATGGVDS GHALVQGDDL VLARVRRDRL
RLDQDEMLTD LIDTAELSSG LVVDKVSGFR MIRYVWDAIG FHAAELKNDS ETLELGCIEG
NVHPSAVMVN RSNVYIGADA VVKAGAVIDA TEGFVAVGCG AVVEPQAVLM NSVYLAPWSK
ANIGAKIYSN VAVGLASKVG GEVEDSIIEP FANKQHDGFL GHSYLSSWCN LGAGTNTSDL
KNNYSKIVLD MGSHKVLTDL QFLGLIMGDH SKSSINSMFN TGTIVGTSAN VFGEGFPPKY
IPSFSWAGSA AGIEPYELDK AVGTARKVME RRRVSMSLSY EAMFRFLALR EQGGEVFI