Gene Cag_0041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_0041 
Symbol 
ID3747240 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp45345 
End bp46718 
Gene Length1374 bp 
Protein Length457 aa 
Translation table11 
GC content54% 
IMG OID637772567 
Productouter membrane protein, putative 
Protein accessionYP_378363 
Protein GI78188025 
COG category[M] Cell wall/membrane/envelope biogenesis
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG1538] Outer membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.229451 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACAAA ACATCTCCTT TCATAAAAAA ATAAGCGCAA CCTCCTTAGC GCTTTTGCTT 
GCAACATCTT CCATGAGCTA TGCCGTGGAG CCAACCTCTT CGCCATCTAC CGCATTTGCC
GCGCCATCCG TAACACCTCT GACGCCCCTT ACGTTAGCAC AAGCGCTACA AAAAATGCAG
GCGCATTATC CCGCGTTACA CGCTGCAAGC GAAGAGGTGA TGGCGGCTGA CGCGCGTGTG
CGCCAAAGCA AAAGCAGCTT TCTGCCGCAG GTTACCGCTA ATGCGGGCTA TCTTTGGCGC
GATCCCGTTT CGGAAATGAG TTTTGGTGGT GGCACGCCCA TGCAGTTTAT GCCGCACAAC
AACTACCATG CAACGGTTAG CGCCGAGGCG ATTCTTTTTG ATTTTGGGAA GCGCAGCCGC
GAGTTAGCAC TTGCCCAAAG TGGTACGCGC ACGGCAGAGG AGCAAGTAGC GTTAAGCCGT
CGTGAAGCGG CATGGCAGGT GGTGCAGCTT TTTTACGGAA TACTCTTTTT GCAAGAAGAG
CAGCGTGTGC AGCAAAAAGA GTTCCAAGCG CTGAACAAAG CGTTGGAGTT TACCACCAAG
CGGTATCAAG CAGGCACGGC AACCTCGTTT GACCTTGCTA CCACGAAAGC GCGCCTTGCC
GCATTGCAAA GCCGTATGGC TGACAGTGCT CATGCGTTGG AACGGAGCGA AATGCACTTT
TGCCGTTTAA CGGAAATGAA TGCAACGCAG CCGCTTGCCT TGCAAGGCAG CTTGATGGCA
TCGGTTGCAC CATCAAGCAA TCAAGCGCAG TTAACCGAGC AAGCGCTAAA AAATCGGGTT
GAAACTCGCT TAGCGCGTGA AGCCGAAGCG GCGGCGGGGC AGCGTCAAGC ACTTGCGAGC
AAGGGTGGTG CGCCACAGCT TCGGGGCAAT GTGGCGTATG GCGTTGCTAA CGGTTATCAG
CCCGATATTG ATGAAATTCG CACCACGCTT AGTGCAGGCG TTACGCTTGA TGTGCCCATT
TTTAGCGGCT TTCGCACCAC TGCTCGTCAG CAAGAGAGTG CGGCGGCTTT GCGGGCTGCA
ACCCAGCGTC GGTTAGATGC CGAAGCACAA GCGGCTACCG AAGTGGCAGA GTTGCTTAAT
GCGTTGCAGC ACAATGGTGA AAAGCTGAAC GCAACCGCAA TGCAAGCCGA GCAAGCCTCT
TTAGCCGCAA GCCATGCACG GGCGCGTTAC GAAAATGGCA TGGCAACCAC GCTTGATTTG
CTTGATACCG AAGCGGCGCT TTCGCAAGCG GAACTGGCTC GTTTGCAAGC GGCATATGCG
GTAACGCTAA ATCGCTATGC GCTGCAACGA GCAACGGGCG AGGTGTTCTG GTAA
 
Protein sequence
MKQNISFHKK ISATSLALLL ATSSMSYAVE PTSSPSTAFA APSVTPLTPL TLAQALQKMQ 
AHYPALHAAS EEVMAADARV RQSKSSFLPQ VTANAGYLWR DPVSEMSFGG GTPMQFMPHN
NYHATVSAEA ILFDFGKRSR ELALAQSGTR TAEEQVALSR REAAWQVVQL FYGILFLQEE
QRVQQKEFQA LNKALEFTTK RYQAGTATSF DLATTKARLA ALQSRMADSA HALERSEMHF
CRLTEMNATQ PLALQGSLMA SVAPSSNQAQ LTEQALKNRV ETRLAREAEA AAGQRQALAS
KGGAPQLRGN VAYGVANGYQ PDIDEIRTTL SAGVTLDVPI FSGFRTTARQ QESAAALRAA
TQRRLDAEAQ AATEVAELLN ALQHNGEKLN ATAMQAEQAS LAASHARARY ENGMATTLDL
LDTEAALSQA ELARLQAAYA VTLNRYALQR ATGEVFW