Gene Cag_1344 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1344 
Symbol 
ID3746859 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp1811730 
End bp1813145 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content45% 
IMG OID637773882 
ProductTPR repeat-containing protein 
Protein accessionYP_379647 
Protein GI78189309 
COG category[R] General function prediction only 
COG ID[COG4783] Putative Zn-dependent protease, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.641665 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTCACA TGCCTGATTT TTTCGACGAC GACCGTTTTG AGTTTTCAAG CAATAACGGA 
GAGTTGCCAC CTGATCTTGA TGGACTTGAC TCCATATTTG ACTCGGAAGA GTTAGTGGAG
CGCATTATGC AATACATGGA GGATGGCTTT CCGCTTGAAG CCCTTGCCGT TGCTCGCCGT
CTTGAGCAAA TTGCTCCTTA CAACAGCGAA ACATGGTTTT ACCTTGGCAA CTGCCTTACC
ATGAATGCCT TTTTTGACGA AGCCCTTGAG GCTTTCCATA AAGCATTGCT ATTGAGCCCA
ACGGATAGCG AAATGCAACT TAATTTAGCG CTTGGCTACT TTAACAACAG CATGTACGAA
GAAGCGCTTG AGCAAATTGA GCGTGTGATG GTAGATTTTG CATTTGAAAA GGAGTACCAC
TACTATCGCG GCATTATTTT GCAACGGCTT GATCGCTATG ACGAGGCTGA AAAGGCGTTC
CTTATGGCAT TGGAGCTTGA TAACGAGTTT GCGGATGCGT GGTATGAAAT TGCTTATTGC
CACGATGTAT GCGGTCGGCT CGAAGAGAGC ACTACCACCT ACAACACAGC GTTGGATCAC
GATCCGTACA ATATTAACGC ATGGTATAAT AACGGCTTAG TGCTCAGCAA AATGAAGCAC
TACGACGAAG CGCTTTTTTG CTACGACATG GCACTTGCTA TTGCCGACGA CTTTAGCTCG
GCATGGTACA ACCGCGCCAA TGTGCTTGCT ATTACAGGGC GCATTCAGGA AGCGGCTGAA
AGTTATGAGC AAACGCTGGA GTTAGAGCCT GAAGATATTA ATGCACTCTA CAATCTTGGT
ATTGCGTATG AAGAGCTTGA GCGCTACCCC GATGCTATGG AGTGTTACCG CCGCTGTATT
ACCATTGTGC CTGAATTTGG CGATGCGTGG TTTGCGCTTG CCTGCTGCCA CGAAGTACTC
GAAGAGTTTG ACGAAGCGTA CAGCGCAACA CTTGAAGCTC TGAAAACTTC AGCAGATTGT
GTAGAATTTT TATTGCTGAA AGCTGAAATT GAATACACGC TCAATAAGGC CGAAGAGTCT
ATTCATACGT ATGAAAGAAT TATTGAGCTG GAACCCGACA ATCCTCAAAT TTGGGTTGAT
TTTGCCATTG TGCTGCGCGA AGCTGGTATG GTTAATGCCT CAATTGAAGC GCTGCACTGC
TCGCTGAAAT TACAGCCAAT GTCGGCTGAT GCTCATTTTG AAATTGCCGC CGCTTACTTT
GCGCTTGGCG ATAAATTGAG CACGCTTAAA GCGCTCAGTA AAGCCTTTAA AATTGATCCC
GATAAAAAAG AGCTGTTTCA AAGCACTTTC CCTGAACTCT ATCAACAGGA TTCGGTAAGA
AGAATGTTGG GAATTCTTGA AATGCCGAAT GAATAG
 
Protein sequence
MIHMPDFFDD DRFEFSSNNG ELPPDLDGLD SIFDSEELVE RIMQYMEDGF PLEALAVARR 
LEQIAPYNSE TWFYLGNCLT MNAFFDEALE AFHKALLLSP TDSEMQLNLA LGYFNNSMYE
EALEQIERVM VDFAFEKEYH YYRGIILQRL DRYDEAEKAF LMALELDNEF ADAWYEIAYC
HDVCGRLEES TTTYNTALDH DPYNINAWYN NGLVLSKMKH YDEALFCYDM ALAIADDFSS
AWYNRANVLA ITGRIQEAAE SYEQTLELEP EDINALYNLG IAYEELERYP DAMECYRRCI
TIVPEFGDAW FALACCHEVL EEFDEAYSAT LEALKTSADC VEFLLLKAEI EYTLNKAEES
IHTYERIIEL EPDNPQIWVD FAIVLREAGM VNASIEALHC SLKLQPMSAD AHFEIAAAYF
ALGDKLSTLK ALSKAFKIDP DKKELFQSTF PELYQQDSVR RMLGILEMPN E