Gene Cag_1056 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1056 
Symbol 
ID3747039 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp1436182 
End bp1437912 
Gene Length1731 bp 
Protein Length576 aa 
Translation table11 
GC content46% 
IMG OID637773587 
Producthemolysin activation/secretion protein-like 
Protein accessionYP_379361 
Protein GI78189023 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2831] Hemolysin activation/secretion protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000536803 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTCCAA AAATTATAAC ATCTCTTGTA GCGGGAAGCG TGGTTTTTTC TGCTTCACTT 
CAAGCGGCTC CGCTTGTACC CAATGCGGGT AGCTTACAGC AGCAACAGCG CCCAGCGGCG
GTTTCAAAAC AGTTCAAACA AAACGTTCAA GCTGACAAAA AAGCTACTGA AAAAAGTAAG
CCATTAGCTA TTAAACCTTC GGCTGAAGGT AAGGTTTTTG TAAAGCGTTT TACCTTTTCT
GGTTATGAGG GCACAGTGTC GCAAGATGAG TTGCAGAATA TGGTAAAGCC TTATGTTGGC
AAGCAATTTA GTATGGAGCA ACTTGATGCG GTGTCTGCCA ATATCACTTC TGAGCTGCGT
GCAAAAGGAT GGTTGGCATT AGCAACCCTT CCACCGCAAG ATGTTACCTC TGGTACAGTT
CATGTGGCTA TTAACACTGG TAAAGCTGCC ATGACCTCTA TTACGAGCGA TGGATCAATT
CGCATTTGCA AGCGTCCGCT TCGCCAAATT GCTGAAAAAA CCTGCCCTCC CGGTTCTCCC
CTTAATACTA ATGATCAAGA GCGTGCTGTG CTTTTGATGA ACGATATTCC TGGTATTGCA
GCCACCACAT CGCTTTCAAA AGGAATGCAG GCTGGTACTA CCGACGTTAA TTATCTCATT
CACGAAGGTG CATTGCTTTC AGGCGTTTTG TGGGCTGATA ATTATGGCAA CCGCTACACT
GGCTCGTTGA TGCAATATGC CGTGCTTAAT ATTAACGATC CTTTCCACTG TGGCGAGCAA
ATTATGCTTA ATGCTGCTCA TTCGGCTGGT ATGTGGCGAG GTGGCGCGAA TTATAGCGTG
CCAATGCCCT TCCTTTTTGC AGGTTTAACG GGTCATGCCG GTGTTTCGGG AATGCAATAT
GAATTGCTTG AGGAGCTTGA AGTGCTTGAT TATAAAGGCA CGAGCGTTAA AGCTGATGCT
GGGTTCAGTT ACGCTTTGCA TCGTAGTCGT AAAGCCAATC TTACCTCTGA TGTTTCCTAC
ACATACAAAG GTTTAAAAGA CAGGATGAGC AACACCGATT TGCGTGATGG CACCATTCAA
TTTGTAACCT TTGGTTTATC GGGAAATTAC CACGACGACC TCTTTTTTGG CGCTTTAACA
ACGGCTGATG TAAGCATTAC TAAGGGTTCG CTTGATGAGA AAATTCGTGA TATCCACTTA
AGCGGCGCTC AAGGTGGTTA CACGCGGTTT AATCTGGAGC TTACGCGTTA TCAGCGCTTT
TCGGAACCTT GTGCACTCGA TCTCACTTTT TCTGCCCAAC ACACGTTAAA AAATCTTGAT
AGCAGCGACA AATTCTACCT TGGTGGTCCA TACACTGTTC GTGCTTATCC GCTTGGTGAG
GCGGCAGGCG ATCACGGTGC GCTCTTTAAG GCTGATTTAC GCCACCGCAT TCCTGTACCG
GCTGAGTGGG GCGATATGTT TGTTAACGCA TTTTATGATG TGGGCCATGT TACACTCAAT
AAAGATCGCT ATGCGGGTGA TTCGGCTACA ATGAACGCAA CTGGTAGTAA CGATTACTGG
CTGCAAGGTG CGGGTGTTGG TCTCCGCTAC GATATTTCAG AAACCTTCAC CCTTCAAGGG
TGCTGGGCGC ACACCATTGG CAAAAATTCT GGTCGCGCAT TTGATGGCAA TAACTCTGAT
GGCAAGAGCG ATAATCATCG CTTTTGGGTT CAGGGACTTA TGAATTTCTA A
 
Protein sequence
MVPKIITSLV AGSVVFSASL QAAPLVPNAG SLQQQQRPAA VSKQFKQNVQ ADKKATEKSK 
PLAIKPSAEG KVFVKRFTFS GYEGTVSQDE LQNMVKPYVG KQFSMEQLDA VSANITSELR
AKGWLALATL PPQDVTSGTV HVAINTGKAA MTSITSDGSI RICKRPLRQI AEKTCPPGSP
LNTNDQERAV LLMNDIPGIA ATTSLSKGMQ AGTTDVNYLI HEGALLSGVL WADNYGNRYT
GSLMQYAVLN INDPFHCGEQ IMLNAAHSAG MWRGGANYSV PMPFLFAGLT GHAGVSGMQY
ELLEELEVLD YKGTSVKADA GFSYALHRSR KANLTSDVSY TYKGLKDRMS NTDLRDGTIQ
FVTFGLSGNY HDDLFFGALT TADVSITKGS LDEKIRDIHL SGAQGGYTRF NLELTRYQRF
SEPCALDLTF SAQHTLKNLD SSDKFYLGGP YTVRAYPLGE AAGDHGALFK ADLRHRIPVP
AEWGDMFVNA FYDVGHVTLN KDRYAGDSAT MNATGSNDYW LQGAGVGLRY DISETFTLQG
CWAHTIGKNS GRAFDGNNSD GKSDNHRFWV QGLMNF