Gene Cphamn1_1241 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphamn1_1241 
Symbol 
ID6374918 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides BS1 
KingdomBacteria 
Replicon accessionNC_010831 
Strand
Start bp1346825 
End bp1348063 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content53% 
IMG OID642683739 
Productpentapeptide repeat protein 
Protein accessionYP_001959654 
Protein GI189500184 
COG category[S] Function unknown 
COG ID[COG1357] Uncharacterized low-complexity proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0335657 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAGCT TCTTGAGAGC GGCCTGTCTG ATCGTTCTGG CGGGAGTTTT TCTTCCGTCA 
ATCAGCGCTG CCTATGACAG CGGAAGTCTG ACGCTCATCC GTAAGAGTGT CACGTCATGG
AACAGCATGA GAGAGAACTA TCCTGAAGCA GCGATCGATC TCAGCGGGGC GGACCTGAAA
GGCCGGAATC TTAAAGGCGC TGATCTGCAC AACGCCAATC TTCAGGGTGC GAATCTTCAC
GGTGCCGATT TGAGCGATAC CGATCTTCGT GGGGCATCTT TTGACCATGC GTCACTGAAG
GGCGCGCTGC TTTTCGATGC CGATCTTCGT GAAGCCACTG TACGCGAAGC CGATCTTGAG
GATGCCGCTT TCGAAGGCGC CGATCTCAGA GGTGCCGTGC TTGACGGCGC GGTGATGAAA
CAGGCGGATC TTGGTGAATC CAATCTTCGA AACGCCAGTC TGAGAGGAAC TGATCTGCGG
GCGGCAAACC TGAAAATGGC GGATCTGGCC GGTTGTGATC TGAGTGGAGC ATACCTGTGG
AGGGCAGTAC TTGACGGGGC AAATCTTGAG AACAGTGTCG TGACATCGGT CACTATCGTT
GAAACCGGTC GTTCCGCCGA TCCGGAATGG GCTCAGAAGA ACGGAGCAGT GCTTGCCATG
TCCGAACCAG CCCGGCAAAA GGAGGGTGCT GCTGAAGCGG AAAGCGAGAA TACAGTGACA
GAGTCGATTC TTGCTCAAAA AACCTGGCCG ATAAATCCTG TGGTGCAGAA AATCCGTTTC
GGCGTGGAGA GAAAAGATGC TGCAACGCTA TCATACGACG TTCATCAGCG GGAGTTGTTG
ATAAAAAGCG TCTCAAAATG GAACAGGATG AGGGAGACGA ACCCTGATGC TCCGGTTCGT
CTTTCCGGAG CAAAATTAAG CAGGAAAGTG CTCGATGGAG CGGATTTGCG GGATGCCGAT
CTTGCAGGAT CCCTGATGAA AAGAACAGGG TTGGCCGATA CTGATCTGAG GAATGCCGAT
CTCAGGGAGG CAAATCTCCG TGAAGCGGAA CTGACAAATG CCGATCTTCG GGGGGCGGAT
TTGAGGGGAG CCTACCTGTG GAGAGCGAAT CTGAGCTGGA CGAAAATCGC AGGGATACGT
GTCAATTCGC ATACTGTATT CGATGATGGA AAGAATGTTA CGCCTGCATG GGCGAAAAAA
AGAGGCGCAG TGTTCATGGA CCGGGACATG GAAGAGTAG
 
Protein sequence
MNSFLRAACL IVLAGVFLPS ISAAYDSGSL TLIRKSVTSW NSMRENYPEA AIDLSGADLK 
GRNLKGADLH NANLQGANLH GADLSDTDLR GASFDHASLK GALLFDADLR EATVREADLE
DAAFEGADLR GAVLDGAVMK QADLGESNLR NASLRGTDLR AANLKMADLA GCDLSGAYLW
RAVLDGANLE NSVVTSVTIV ETGRSADPEW AQKNGAVLAM SEPARQKEGA AEAESENTVT
ESILAQKTWP INPVVQKIRF GVERKDAATL SYDVHQRELL IKSVSKWNRM RETNPDAPVR
LSGAKLSRKV LDGADLRDAD LAGSLMKRTG LADTDLRNAD LREANLREAE LTNADLRGAD
LRGAYLWRAN LSWTKIAGIR VNSHTVFDDG KNVTPAWAKK RGAVFMDRDM EE