Gene Cphamn1_1773 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphamn1_1773 
Symbol 
ID6375460 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides BS1 
KingdomBacteria 
Replicon accessionNC_010831 
Strand
Start bp1920149 
End bp1921549 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content46% 
IMG OID642684266 
ProductTetratricopeptide TPR_2 repeat protein 
Protein accessionYP_001960172 
Protein GI189500702 
COG category[R] General function prediction only 
COG ID[COG4783] Putative Zn-dependent protease, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCTTCC TTGACTTTTT TGATGAAGGC GGCAACAGCG ATACGAACGG CAACTACAGG 
AATCTTCAAC TTGACGACTT AGATCTCACA TCCATATATG ACACCGAAGA GCTGATAGAG
ATAATCATCC AGTTCAACGA AGAGGGGAAA CATCCGGAAG CGCTGGCTGT GGCCCGGCAT
TTAACCGAAA CAGCATCCTA CAATGCGGAA TCCTGGTTTC ACCTGGGAAA CTGTCTGACA
GTCAACGGTT CATTTGACGA CGCGAAAGCT GCCTTCAGTA AAGCGACTGT TCTCAGTCCC
GCTGACAGTG AGATGAGACT GAATCTCGCA CTCGCGCATT TCAATACCGC CGAATACAAG
ACCGCTCTCG ATAAACTCGA CTCCATCCTG TGCGATTCGA CTCTTGAAAA GGAGATGTAT
TTCTATAGAG GACTTATTCT TCAGAAAATG GAGCGTTACC GGGAGTCCGA AAAATATCTT
GAAAAATGCC TCGCCCTGGA ACCGGACTTC GCGGAAGCGT GGTATGAGCT CGCGTTCTGC
AAGGATGTCC TTGGAAAAAT GGAGGAGAGC GCGACCTGTT ATCAAAAAAC CATTGATCAG
GACCCCTACA ATGTCAATGC CTGGTACAAT AAAGGGCTTG TGCTGAGTAA ACTGAAAAAA
TATGACGATG CCCTTGAATG TTACGATATG GCTATTGCAA TAGCCGATGA TTTCAGCTCG
GCCTGGTATA ACAGGGCAAA CGTTCTGGCT ATTACAGGAA AAATTGAAGA AGCCGCTGAA
AGTTACCTCA AAACGATTGA ATTTGAACCT GACGACATCA ATGCGCTTTA CAATCTTGGC
ATAGCTTTTG AAGAGCTTGA AGACTACGAC AAAGCGATAA CGCACTATAC GAGGTGCATC
GAGCTGAAAC CGGATTTCGC GGATGCATGG TTTGCACTTG CCTGCTGTCA TGAGGCTGAC
AATGCCTATG AGGAGGCATT GAAATCGGTA AATCAGGCGC TGAACTATCT TCCGGGTTCG
GTTGATTTTC TTCAGTTGAA AGCCGAGATT TACTATAACA TGGAAAGTCT CGAACGATCG
ATCAGCACCT ATAAAAAAAT CCTTACGATC GAGGCTGACT CTCCCCAGCT TTGGGTTGAT
TACGCTGTCG TGCTGCGGGA GGCTTCCCTC TATACAGAAT CTCTGGAAGC CTTTGAACAT
TCTCTTGAGC TCCAGCCACA ATCAGCAGAG ACCCATTTTG AGATCGCAGC GACCTATTTT
GCCCTTGGCG ATAAAAAAAG CACGATCAAA TCATTGACAA AGGCTTTCTC CATCGATCCT
GACAAGAAAC AGCTTTTCAA GACAACCTTC CCTGAGCTCT ATAATCAGGA TTCGGTTCGG
GAAATACTGG GTATTGTCTG A
 
Protein sequence
MSFLDFFDEG GNSDTNGNYR NLQLDDLDLT SIYDTEELIE IIIQFNEEGK HPEALAVARH 
LTETASYNAE SWFHLGNCLT VNGSFDDAKA AFSKATVLSP ADSEMRLNLA LAHFNTAEYK
TALDKLDSIL CDSTLEKEMY FYRGLILQKM ERYRESEKYL EKCLALEPDF AEAWYELAFC
KDVLGKMEES ATCYQKTIDQ DPYNVNAWYN KGLVLSKLKK YDDALECYDM AIAIADDFSS
AWYNRANVLA ITGKIEEAAE SYLKTIEFEP DDINALYNLG IAFEELEDYD KAITHYTRCI
ELKPDFADAW FALACCHEAD NAYEEALKSV NQALNYLPGS VDFLQLKAEI YYNMESLERS
ISTYKKILTI EADSPQLWVD YAVVLREASL YTESLEAFEH SLELQPQSAE THFEIAATYF
ALGDKKSTIK SLTKAFSIDP DKKQLFKTTF PELYNQDSVR EILGIV