Gene Cag_0021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_0021 
Symbol 
ID3747891 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp21262 
End bp22461 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content48% 
IMG OID637772545 
Producthypothetical protein 
Protein accessionYP_378343 
Protein GI78188005 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.0629415 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCTCTG AAGAATTGCA ACAGCTTCTC ACGCCTGAAG CGCAAGCAAT GCTGCAAGCG 
CACCAGCACG ACAATCCTAC AACGTTTGCC TTGCGTTATT CCAATCGCCA CGACTTGCCA
ATTCGTGCGC TTGCGGAGCA ACTTGCCTGC CGTCGCAAAG CTGAACGTAA GCTTCCCACG
CTTTCGCGCC ACAACCTTCT CTACACAACG CTCTCGCTTG AGCAAGCTTC AAGCGAACGC
ACGGCACGTT TTAAATGCAC CTTCATGCAA GGAAAGCGCT GCATTGATTT GAGCGGTGGT
TTGGGGATTG ATGCCATCTT TTTAGCCGCT CATTTTGAGG AGCTACTTTA TTGCGAACGC
AATGAACTGT TGTGCAACGT GGTTCGGCAC AATATGGTGC GTTGCGGGAT TGGCAACGTT
CGATTGCAGC AAGGCGATAG TCTCAGCTTT TTAGCAAGTC AGCCCGATAA TGCCTTTGAT
TGGATTATGG TTGATCCCGC TCGTCGTGAG GAGGGGAAAC GCTCCATTGG GTTGGAGGCA
GCAAGTCCCA ATGTGGTGGC ATCTCAGGAA TTGTTGCTTG CCAAAGCGCC ACACATTTGC
ATTAAAGCCT CGCCAGCCCT TGAAATCAGC AATCTTAAAA TGCTCTTACC TGCGCTCCAT
ACCATTTTGG TAGTTTCGGT TTCGGGTGAA TGCAAAGAAA TTTTATTGCT CTTAAAGCGA
GGGGCTGAAG CTGAACATCC AATTACGAAA GCAATCTGTT TGCAAGCCGA CAATAATGCG
GTTGTAGAGA TTGTTGGAAC GCATGAACAG CATCGTTCAC TTGCTGAATC TCTGCAATGT
TACTTGTATG AACCTGATGC GGCAATTATT AAAGCGCGAC TTAGCGGAGT GGTCGCTAAG
CAAGAGGGGT TAGAATTTCT TAATAAGAGC GTTGATTATT TAACAAGCAA TCATGTTGTT
GCAAGTTTTG CAGGTAAAGT ATTTCAAGTG ATTGAAAGCG TGCCCTACAA GCCAAAAGAG
TTTCGGAAGT TTTTGGATCG CCACGCTATC AGCGCCGCCA GCATTCAGCG GCGTGATTTT
CCCCTTTCAG CCGATGAGTT ACGCAAGAAG TTCCGCTTGC GCGAAGATGA AAAGCATTTT
CTCATTTTTA CCCGCAACCG CAACGCTGAG CCTATTTGCA TTTACGCTGA GCGCTGTTGA
 
Protein sequence
MTSEELQQLL TPEAQAMLQA HQHDNPTTFA LRYSNRHDLP IRALAEQLAC RRKAERKLPT 
LSRHNLLYTT LSLEQASSER TARFKCTFMQ GKRCIDLSGG LGIDAIFLAA HFEELLYCER
NELLCNVVRH NMVRCGIGNV RLQQGDSLSF LASQPDNAFD WIMVDPARRE EGKRSIGLEA
ASPNVVASQE LLLAKAPHIC IKASPALEIS NLKMLLPALH TILVVSVSGE CKEILLLLKR
GAEAEHPITK AICLQADNNA VVEIVGTHEQ HRSLAESLQC YLYEPDAAII KARLSGVVAK
QEGLEFLNKS VDYLTSNHVV ASFAGKVFQV IESVPYKPKE FRKFLDRHAI SAASIQRRDF
PLSADELRKK FRLREDEKHF LIFTRNRNAE PICIYAERC