Gene Cphamn1_1504 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphamn1_1504 
Symbol 
ID6375182 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides BS1 
KingdomBacteria 
Replicon accessionNC_010831 
Strand
Start bp1624442 
End bp1626028 
Gene Length1587 bp 
Protein Length528 aa 
Translation table11 
GC content51% 
IMG OID642683997 
ProductSel1 domain protein repeat-containing protein 
Protein accessionYP_001959911 
Protein GI189500441 
COG category[R] General function prediction only 
COG ID[COG0790] FOG: TPR repeat, SEL1 subfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGAAAA AAGCCATACT TTCTGCTCTC TTTCTAACCA TACTTACCCT GCATTCTTTT 
GCGGAATCCG CTCTTGAAAA AACAGCGGAA TTCAAAACCC TTCTTTCAGA TGCCTCACAG
GGCAACGAAG AGCATCAATT AAAACTTGGA TTCATCTACG CCAATGGTGA CGGAGTCGAA
CAGAATTATA CCAAAGCCGT CAAATGGTAC CGAGTGGCCG CCGATCAGGG CAACATGATC
GCTCAGAATA ACCTGGGCCA GCTCTACGCG ACTGGAAAAG GAGTGACGCA AAATCATACA
GAAGCCGCCA AATGGTTTCG CATGGCCGCC GAACAGGGCC ATGCTAAGGC ACAGAGCAAT
CTCGGCCTGA TTTATTTTTC AAATCAGGGA GTACAACAGG ACTATGTTGA AGCCGCCAAA
TGGTTCGGGA TGGCTGCTGA TCAGGGTCAT ACAAGAGCTC AATTTTTCCT CGGAAGAATG
TACTATTCTG GTGAGGGTGT AACGAAAAAC CACAAAACCG CAGCCCGATT ATTTCAGCTT
GCAGCGAAAA ATAATGACGC GAAAGCACAG CATAATCTCG GCGTGATGTA TGCAGAGGGT
CAGGGAGTTG AGCAGAACTA TACAGAAGCA GCCAGGTGGT ATAGGAAAAG TGCGGAACAG
GGCGATCCTG ATGCCGCCTT TCATCTGGGC ATGCTCTTCT CTGGAGGCAG AGGCGTCGCA
CAAAACAATG CCGAGGCGTT CAAGTGGTTG CATATCGCAT CTGAAAAAGG CCATACTCAG
GCACAGTTAC AACTTGCCGG CATGTATGAG ACCGGTACAG GAACCTCTCA GAACAGTGAA
GAAGCGCTCA AATGGTATCG TAAAGCCGCC GAAAAAGGTA TCACTCAGGC TCAGAGTAAA
CTCGATTCAC TGCTGAGCAA AAAACCGCTT GTAGAGAGTA GTCCCGCCGA GAGCCTTCCT
GTGCCCCCTG TGCTTGTCCC GAAAGACAAC GAAATTTCCA CACCAGAGGT TGCAGAGACG
GCGCCGGAAG ATCGTGGGGC CCCGGAAAAC AACACCTCGG ATCGGGCACA CTATCTCAGT
GCGGCTCAAG AGGGTGACAG TGAGGCCGCA CTCAAACTTG CCGATATGCT CTCAGAAGGT
CGCGGTGGCG AACAAAATGA TGCTGAAGCC CGCTCATGGT ATCAGAAGGC TGCTGAAATG
GAAACTGGTG AAGCGGCTTT CAAACTTGCT GGCATGATTA TAGAAGGACG CGGTGGAAAA
CAGAGCAATT CCGATGGCCG CTCCTGGTAC AAGAAAGCCG CGGCAATGGA ATACAGTGAA
GCAGCTCTTC AATTAGGCTT CATGTACCAG GCCGGGAAAA ATGCTCCGAG AAACAACTGG
CTCGCGCGTC AATGGTTTCT CGTCGCAGCT GAAAAAGGAT TGCCCCGGGC ACAGTATCAG
CTCGGGAACA TATTCGCAGA GGGGCGTGGC GTAGACAAGA ATGTTGAAAA AGCGGCTGAA
TGGTACCGAA AAGCCGCCGA ACAGGGTCTG GAAGAAGCAC GCGACCGGCT CAGCAAAATG
TCGGGAGACG AACAAACGGC ACGCTGA
 
Protein sequence
MLKKAILSAL FLTILTLHSF AESALEKTAE FKTLLSDASQ GNEEHQLKLG FIYANGDGVE 
QNYTKAVKWY RVAADQGNMI AQNNLGQLYA TGKGVTQNHT EAAKWFRMAA EQGHAKAQSN
LGLIYFSNQG VQQDYVEAAK WFGMAADQGH TRAQFFLGRM YYSGEGVTKN HKTAARLFQL
AAKNNDAKAQ HNLGVMYAEG QGVEQNYTEA ARWYRKSAEQ GDPDAAFHLG MLFSGGRGVA
QNNAEAFKWL HIASEKGHTQ AQLQLAGMYE TGTGTSQNSE EALKWYRKAA EKGITQAQSK
LDSLLSKKPL VESSPAESLP VPPVLVPKDN EISTPEVAET APEDRGAPEN NTSDRAHYLS
AAQEGDSEAA LKLADMLSEG RGGEQNDAEA RSWYQKAAEM ETGEAAFKLA GMIIEGRGGK
QSNSDGRSWY KKAAAMEYSE AALQLGFMYQ AGKNAPRNNW LARQWFLVAA EKGLPRAQYQ
LGNIFAEGRG VDKNVEKAAE WYRKAAEQGL EEARDRLSKM SGDEQTAR