Gene Cag_1562 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1562 
Symbol 
ID3746562 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp2047657 
End bp2048772 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content40% 
IMG OID637774102 
Producthypothetical protein 
Protein accessionYP_379860 
Protein GI78189522 
COG category[S] Function unknown 
COG ID[COG4804] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAACAAA AAGAAGTAGT TGCTCATGCA ACGTTTGTGC AGCTTGTTGA AAGCATTCGT 
AATGTTCACC AAGAGCTTAT TGCACAAGCC AATAGGGCGG TTAATGTGAG CCTTACCTTG
CGTAATTGGT TGATTGGTTA TTACATCGCT GAATATGAGT TGCAAGGCAA AGATAGAGCG
GAGTATGGCG ACCGCTTATT TAGTGAACTT GCTCGTGCGT TAAAGTCGTT GAGTAACTGT
AACCGTCGTC AACTCTATCG TTATTATCGT TTTTATACAT TTTATCCTAT AATTGTAGAA
TTACTCCCCC CACAATTCAA GTCGTTATCG TTATTGTCGT CAATAGAAAT AGTGGGGACA
GTGTCCCCAC TATCCCGGCC ATCATCCACA GCGTCATTAA ATATCGCAAA AAAGCTTAGT
TACAGCCATT TTGAAGAACT TATCGCTCTT GACGATCCAA CCAAACGAGC TTTTTACGAA
GTGGAGTGCA TTCGAGGCAA TTGGTCGGTG CGTGAGCTAA AACGTCAAAT TGGTAGCCTT
TATTATGAGC GCACAGGGCT TTCATTCAAT AAAACAAAAC TTGCGGAGCT TACCCTGCAA
GAGAGGGAAA TGCAACCTCT TTTTAATATT CGTGATCCTT ACATTTTTGA GTTTCTTGGT
TTAAAACCTG TTGAGGTAAT GAGTGAATCT CATGTAGAGC AACAGCTTAT TGAAAAGCTA
CAAGATTTTT TGCTTGAGCT TGGTCACGGC TTTTGTTTTG AAGCACGTCA AAAGCGTCTG
CTTATTGGCG ATGAATATTT TTTTATTGAT TTGGTTTTTT ACCATCGTCT TTTAAAATGC
CATGTATTGG TTGAGCTAAA GTTGGATCAT TTTAAACATG AGCATCTTGG GCAACTTAAT
ACGTATGTTA GTTGGTATCG TCAGCATGTT ATGAGCAAGG GTGATAATCC TCCTATTGGA
ATGTTGCTTT GTACCAGCAA AAATAATTCG CTTGTTGAGT ATGCCTTGGC AGGTATGGAT
AATCAGCTAT TTGTTTCGCA ATATCAGCTT GAACTACCCA AAAAAGAAGA GATGCAAGAA
TTTATAGCAA CGCAGTTACG GGAGCTTGGT GAATGA
 
Protein sequence
MEQKEVVAHA TFVQLVESIR NVHQELIAQA NRAVNVSLTL RNWLIGYYIA EYELQGKDRA 
EYGDRLFSEL ARALKSLSNC NRRQLYRYYR FYTFYPIIVE LLPPQFKSLS LLSSIEIVGT
VSPLSRPSST ASLNIAKKLS YSHFEELIAL DDPTKRAFYE VECIRGNWSV RELKRQIGSL
YYERTGLSFN KTKLAELTLQ EREMQPLFNI RDPYIFEFLG LKPVEVMSES HVEQQLIEKL
QDFLLELGHG FCFEARQKRL LIGDEYFFID LVFYHRLLKC HVLVELKLDH FKHEHLGQLN
TYVSWYRQHV MSKGDNPPIG MLLCTSKNNS LVEYALAGMD NQLFVSQYQL ELPKKEEMQE
FIATQLRELG E