Gene Cag_0844 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_0844 
Symbol 
ID3746803 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp1175145 
End bp1176338 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content52% 
IMG OID637773373 
Producthypothetical protein 
Protein accessionYP_379152 
Protein GI78188814 
COG category[S] Function unknown 
COG ID[COG4924] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCTGGA CAACCCCCGC CGAACTGAAA CGTCAGGTGC AAAAGCTCTG GGATCGAGGC 
ATGTTGCTTG CCACCTTTTG TAATGGTAAG GCACTATTTC CCCGTCGCCT CATGCTGAAA
GCGCCTGATG CCCGCCAATT GAGTACCTCC TTTCCCGAAG TGCGCGAATG GATTGCCCAA
CTTTCAAATG CAGCAAAACA CTACCGCATC GTATGGCGCA CCATCAACCA CCGCATTTTG
GGAGCAAATG AACTTCCTGC TGAAATTTGG ATTGATTCGC TTGACAATGC ACTCTTGCTG
ATTGGCAAAC AACGAGAAGC TCAGCAGTTT GCCGCCATGG TTACGCTTAC CCGCACCATG
CAACCCGCTC TTCTGCCATG GCTTGAAAAA CGCCCGTTAC GTGCGCTTGA ATTAGCCCCA
GAGTGGCATC GCCTGCTCTC CATTGTGGCA TGGCGCATAA CACATCCAAA ACCAGCAATC
TACCTGCGCC AAATTGACCT GCCCGGCATC CACAGCAAAT TTATCGAACA GCACCGAGGC
GTACTTGGGG AACTCTTCGA TCTTGTCCTT CCTCCGGAAG AGATTGATAC CACAGCGATT
GGTGTTGGAG GATTCTGCCG CCGTTACGGC TTTCAGGACA AACCCCTGCG TGTTCGCTTC
CGCATTCTCG ACCCAGCACT CGCGCTGCTG CCGACGGTCA GCGATCACGA TATTACCGTA
ACGCAAGCAA CCTTTGCCTG CTTAGAAATA GCGGTTACAA AAGTCTTCAT CACCGAAAAC
GAAATCAACT TTCTCGCCTT TCCCAATGTT CCGCAAGCAA TGGTGATTTT TGGAGCTGGC
TATGGTTTTG AAAATTTAGC CTCAGTCAAA TGGTTGCATG ATTGCGCTAT CCATTACTGG
GGCGACCTTG ACACCCACGG CTTCGCCATC CTCAACCAAT TGCGCAGATT CTTTCCACAT
GCAACCTCGT TTCTAATGGA TAGCAAAACG CTGATGGAGC ATCAAGCGCT TTGGGGCATT
GAACCGTCTC CCGAAACCGG CGAACTCACG CGACTGACCG CTGAAGAGAG TGCGCTGTAC
GATCAGTTGC GGCAGAATGA GTTAGGTCAT CACATTCGTT TAGAGCAGGA GAGGATTGGG
TTTGAGTGGC TGGTTGGGGC GCTGGGGAGG GGGACAGAGA AAGCGGCTGT TTAA
 
Protein sequence
MSWTTPAELK RQVQKLWDRG MLLATFCNGK ALFPRRLMLK APDARQLSTS FPEVREWIAQ 
LSNAAKHYRI VWRTINHRIL GANELPAEIW IDSLDNALLL IGKQREAQQF AAMVTLTRTM
QPALLPWLEK RPLRALELAP EWHRLLSIVA WRITHPKPAI YLRQIDLPGI HSKFIEQHRG
VLGELFDLVL PPEEIDTTAI GVGGFCRRYG FQDKPLRVRF RILDPALALL PTVSDHDITV
TQATFACLEI AVTKVFITEN EINFLAFPNV PQAMVIFGAG YGFENLASVK WLHDCAIHYW
GDLDTHGFAI LNQLRRFFPH ATSFLMDSKT LMEHQALWGI EPSPETGELT RLTAEESALY
DQLRQNELGH HIRLEQERIG FEWLVGALGR GTEKAAV