Gene Cpha266_2667 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_2667 
Symbol 
ID4568771 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp3060341 
End bp3061615 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content56% 
IMG OID639767233 
Producthypothetical protein 
Protein accessionYP_913075 
Protein GI119358431 
COG category[S] Function unknown 
COG ID[COG4198] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0707942 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCAGACA TTATGCCTTT CAGGGCACTG CACTACAAGC AGGAAACCAT GAACCACGCC 
GAAAAGGTTC TTTGTCCGCC CTACGACGTT ATCTCTCCAG CCCGCCAGCA GGAGCTCTAC
GAACTCTCGC CCTGTAACGC CGTCAGGCTC GAACTCCCGC TTGAGTCCGA TCCCTACCAG
GCAGCCATGG AACGACTGCT CGAATGGAGC CGCATCGGCG AACTGGTAAG GGACGCTGAA
CCGGCGATCT ACCCGTACAT GCAGACCTTC GAAGACGCCG AAGGCGCCGT CTACAACCGA
ACCGGCTTTT TTTGCGCCAT GCGCCTCCAT GATTTTGTCG AACGCAAGGT TCTGCCTCAC
GAAAAAACCC TCTCGGGGCC AAAAGCAGAC CGTCTCAACC TCTTCAGAAA GACAAAAACC
AATATCAGCC CCGTTTTCGG GATCTATGCC GATCCCGATA AAGCAGCCGA TCGTCAAATA
GCCGCCTTTG CATCCAGCAA CCCCCCCCTG ATCGACGCCG TGTTTCAGGA TGTCAGAAAC
CGGATGTGGA AGATAACCGA CAAAGAGATC GTCGAAAAGG TGCGGGCGGG GCTTCAGCAT
CGCACCGTTT TCATCGCCGA CGGCCACCAC CGCTACGAAA CAGGGCTCAA CTACCGAAAC
GAACGAGCGG CAATGAACCC CGCACACACC GGCAATGAGG CATACAACTT TATTCTGGCC
TGCCTTGCCA ACATGCATGA CGAAGGGCTG ATTATTTTCC CGATCCATCG ACTCCTGCAC
AGCCTTGAGC AGTTTGACGC CCTGGCGTTC CGCCGACAGC TTGAGCAATA CTTCGTCGTC
ACGGAACTTC CCCACAGAGA GGCCCTCAAA CGCTACCTGG CCGACGAACC TTCGATCTAC
GCATACGGCG TGGTAACGCG GGAATATATG CTCGGCATCG TCCTCAAAGG GAGCCCCGAG
GAGCTCCTTG ACCACGCAAC TCCCGACTCC CTCCGGAAGC TTGGTCTGGT GGCCCTCCAC
GAGATCGTGC TCGGCAGGCT GCTTGGCATA ACCCCCGAAG CCATGGCGAA ACAGAGCAAC
ATCAAGTACA TCAAGGATGA AGCCGAACTG TATGCTGCCG TCGAAAACGG AGCCGCGCAG
GCCGGGATCG TCGTCAAGCC AACAACGGTT CAACAGGTCG TAGCCGTGTC GGAATCAGGA
GAGGTCATGC CTCAGAAATC AACGTTTTTT TATCCGAAAA TAATGACAGG ACTTGTCTTC
AACCCGCTCG ACTGA
 
Protein sequence
MPDIMPFRAL HYKQETMNHA EKVLCPPYDV ISPARQQELY ELSPCNAVRL ELPLESDPYQ 
AAMERLLEWS RIGELVRDAE PAIYPYMQTF EDAEGAVYNR TGFFCAMRLH DFVERKVLPH
EKTLSGPKAD RLNLFRKTKT NISPVFGIYA DPDKAADRQI AAFASSNPPL IDAVFQDVRN
RMWKITDKEI VEKVRAGLQH RTVFIADGHH RYETGLNYRN ERAAMNPAHT GNEAYNFILA
CLANMHDEGL IIFPIHRLLH SLEQFDALAF RRQLEQYFVV TELPHREALK RYLADEPSIY
AYGVVTREYM LGIVLKGSPE ELLDHATPDS LRKLGLVALH EIVLGRLLGI TPEAMAKQSN
IKYIKDEAEL YAAVENGAAQ AGIVVKPTTV QQVVAVSESG EVMPQKSTFF YPKIMTGLVF
NPLD