Gene Cag_1054 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1054 
Symbol 
ID3747035 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp1429504 
End bp1431234 
Gene Length1731 bp 
Protein Length576 aa 
Translation table11 
GC content46% 
IMG OID637773583 
Productactivation/secretion signal peptide protein 
Protein accessionYP_379359 
Protein GI78189021 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2831] Hemolysin activation/secretion protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000217377 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTCCAA AAATTATAAC ATCGCTTGTA ACGGGGAGTA TTATGTTATC GGCTTCTCTT 
CAAGCCGCTC CGTATGTACC TGATGCGGGC AGCTTACAGC AGCAGCAACG TCCAGCAGCC
GTCTCTAAGC AGAAAAAACA AATGGTTCAA GACAACAAAG GAAAGAGTGA GCCCAGCAAG
CCCTTTGTTA TTAAGCCATC GGCAAATGCT AAGGTTCCCG TTAAGCGCTT TACTTTTTCA
GGGTATGAAG GAACCGTGTC GCGAAGTGAG TTACAGGATA TGGTAAAGCC TTATGTCGGT
AAAAACCTTA GTATGGAGCA GCTCCATGCT GTTTCTGCAA ACATCACTTC TGAACTTCGT
GCAAAAGGAT GGTTGGCATC AGCAACCTTG CCGCCGCAAG ATGTTACCGC GGGCACCGTG
CATATTACTA TCAATAGCGG TAAAACGGCA ATGACCTCCA TTACAGGCGA TGAATCGGTT
CGCATTTGCG AGCGTCCACT TCGCCAAATT GCTGAAAAAA CCTGTCCTTC AGGTTCTCCT
CTTAATACTG ACGATCAAGA GCGTGCGGTG CTTTTGATGA ATGATATTCC CGGCATTGCT
GCCACCACAT CGCTCTCAAA AGGGATGCTG GCAGGCACAA CCGATGTTAA TTACCTTATT
CGCGAAGGTG CCTTGCTTTC AGGTGTGCTT TGGGGCGACA ACTATGGTAA CCGTTACACC
GGTACATGGA CCCAAAATGC TGTATTGAAC ATTAATGATC CTATTCACTA TGGCGAGCAA
TTCTCGCTTA ATGTTGGTCA TTCGGCTGGT ATGTGGCGAG GTGGCGTAAA TTATCGGGTG
CCAATGCCAT TTCTTTTTGC AGGCTTAACT GGTCATACGG GTGTTTCGGG AATGCAATAT
GAGTTGCTTG AGGACTTTGA AGTGCTTGAT TACGAAGGTA GCAGCATTAA TGTTGATGCT
GGATTAAGTT ACGCATTGCT TCGTAGCCGT AAAGCAAATC TTACCTCCGA TGTTTCTTAC
ACCTACAAAG GCTTAAAAGA CTCGATGGGT AACACCGATT TACGCGATGG CACTATTCAA
AGTGTAACTT TTGGTTTATC GGGCAATTAC CGCGATGACC TCTTTTTTGG AGCGTTAACA
ACGGCTGACT TAAGCATTAC AAATGGTTCG CTTGAAGAGA AAATTCGCGA TATCAGCTTA
AGTAATTCGG AAGGCGGTTA CACCCGCTTA AACATGGGGT TAGCTCGTTA TCAACGCTTT
TCTGAACCAT TTGTGCTTGA CCTCGCCTTT TCTGCCCAAC GTGCATTAAA TAATCTTGAT
AGCAGCGAAA AATTCTTCCT TGGCGGTCCA CAGCGTGTTC GCGCTTACCC GCTTGGTGAG
GCGGCGGGCG ATCACGGTGC ACTTTTTAAA GCCGACTTCC GTCACCGCAT TTCTGTACCA
GAGGAGTGGG GCGATATGTT TGTTAACGCC TTTTACGATG CTGGTCATGT TACCCTTAAC
AAAGATCGTT ATGCAAGTGA TTCTGCAACT ATCACCGCTA CCGGTCGTAA CGACTATTGG
TTGCAAGGTG CAGGTTTAGG GCTTCGCTAC GATATTTCGG AAAACTTTAC CCTACAAGGG
TGCTGGGCAC ATACCATTGG TAAAAACTCT GGTCGCTCGG TGGATGGCAA TAACTCTGAT
GGCAAGAGCG ATAACAATCG CTTTTGGGTT CAGGGGCTCT ACTATTTCTA A
 
Protein sequence
MVPKIITSLV TGSIMLSASL QAAPYVPDAG SLQQQQRPAA VSKQKKQMVQ DNKGKSEPSK 
PFVIKPSANA KVPVKRFTFS GYEGTVSRSE LQDMVKPYVG KNLSMEQLHA VSANITSELR
AKGWLASATL PPQDVTAGTV HITINSGKTA MTSITGDESV RICERPLRQI AEKTCPSGSP
LNTDDQERAV LLMNDIPGIA ATTSLSKGML AGTTDVNYLI REGALLSGVL WGDNYGNRYT
GTWTQNAVLN INDPIHYGEQ FSLNVGHSAG MWRGGVNYRV PMPFLFAGLT GHTGVSGMQY
ELLEDFEVLD YEGSSINVDA GLSYALLRSR KANLTSDVSY TYKGLKDSMG NTDLRDGTIQ
SVTFGLSGNY RDDLFFGALT TADLSITNGS LEEKIRDISL SNSEGGYTRL NMGLARYQRF
SEPFVLDLAF SAQRALNNLD SSEKFFLGGP QRVRAYPLGE AAGDHGALFK ADFRHRISVP
EEWGDMFVNA FYDAGHVTLN KDRYASDSAT ITATGRNDYW LQGAGLGLRY DISENFTLQG
CWAHTIGKNS GRSVDGNNSD GKSDNNRFWV QGLYYF