Gene Cag_0020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_0020 
Symbol 
ID3747890 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp19790 
End bp21253 
Gene Length1464 bp 
Protein Length487 aa 
Translation table11 
GC content50% 
IMG OID637772544 
Producthypothetical protein 
Protein accessionYP_378342 
Protein GI78188004 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAACGCC CGATGAGTGC CCCAAATAGT GCCCTAATGA GCTCCAAAAT TAGTTCCCCC 
ATCAGTCCAA CAACTCTCCA TCATCTGCTT GATATTCCGC TTACTACGCC GCTTACGCTG
GAGGCAATGT GCCGCAATCT TCGTCCAATT TACTATCCTG CGCCGCACCT ATCGCCACGC
TTAGAGCGCT TTAGCTTGGC GCTGGTAGAG GCAATGCAAG GCTTGGGCAT TCAGGTGCAT
TCGCCAGAGG AGTTGGCACT ACACGATGGG CGTTTTCCTG CTGGCACGGT GATTGTTGCG
CCGGGTATTT TTGATGATGA TGCGCTGCCG ATTAACCGCG TCAGTACGCT TTACAACAAC
ATTATTGTTG GCATTTACGA TGAAGCTGCA CCCGTATCGA ACAGCTCATT GCCGCAAGAG
CGGCTTGATG CGATTGTGGG GCGTTTAGCT CGCGATATGG TGCATATTTT AATTTTTGTA
ACCGATGAGT CGTGGACAAT TTGCACCATG AATGGCGGCA TTGCAACCTT TGCTACGCCA
CTGCCACACG TTGCGGATGT GCGCTCTACG TTGGTTCCAA AACTAACGGC GCAAGTAGTT
CCACCCAGAA ATGAGGCGTT TACTTTTGTT GATGGAGCGT TGGATATTGC CTCACCAACG
TTTAGCGCAA TTGCAGAGGA TTTTGTGCAG TGCAGTGCCT TGTGGAGCCA AAGCAGTGCG
CTCCTTACGC ACACTTCAAC CGAAGGGTTA CACTACCGCA ATTCATTTTA TAAACGTATT
GTTGCTCGCT ACCTTGATGA GCGCAGTGGT ATGAGCTATG GCTTTTTTGC ACGCCAATTG
CCTATTCCTA CGCTTCAACC CGCTCAAAAA AAGAAGGCTG ATGGATTGAT GGAAGTACAG
CTTGCAGGTG AGCAATGGTT TGTAGCAATT CCAGAGGTAA GCATTATTAC CACGCGCTCG
GGATGCCGCA AGCATTGCTT AAATCCGCTG GAAGATTTAG TAGCTCTTGG CTTAAAAGAG
GAGCAGGGGA AGCGAGTTGC CTCCATTACC ACACCGTCAA CTTCGTGCAA CACCGTTATT
AAGCCCTCGT TTGATACGTT GGCAATTCTT GCCCATGCGT TGGGCAATGC TATTGTGGGG
AGCATTTTGT TGGTACTTCA GCCCAATGCG CCTTTTTCCC GCCATCTTGC ACGTAACGGT
GCTACCATTA CCCATTGGCA CGGTTATCCG CAAAAGAGCG ATCTTCCCGA TGGCTATTGG
TTGCATGGTG CCGAAAATCC GCCCGTAGCC TGCTCAACCC CGCAATCTGC CGCTTACAGT
TTACTTGGCA AACTTAGTGC TCTTGAGCAA GCGCTCACGC AACAGGGCAT CTATCACGGC
GATGTTCACA CCGAACCGCA TCACGGCACC AACATTGTCG GCATTCTTTC CCTTACCGAG
GTTGCTCGCC ATTTTGCAAG ATAG
 
Protein sequence
MERPMSAPNS ALMSSKISSP ISPTTLHHLL DIPLTTPLTL EAMCRNLRPI YYPAPHLSPR 
LERFSLALVE AMQGLGIQVH SPEELALHDG RFPAGTVIVA PGIFDDDALP INRVSTLYNN
IIVGIYDEAA PVSNSSLPQE RLDAIVGRLA RDMVHILIFV TDESWTICTM NGGIATFATP
LPHVADVRST LVPKLTAQVV PPRNEAFTFV DGALDIASPT FSAIAEDFVQ CSALWSQSSA
LLTHTSTEGL HYRNSFYKRI VARYLDERSG MSYGFFARQL PIPTLQPAQK KKADGLMEVQ
LAGEQWFVAI PEVSIITTRS GCRKHCLNPL EDLVALGLKE EQGKRVASIT TPSTSCNTVI
KPSFDTLAIL AHALGNAIVG SILLVLQPNA PFSRHLARNG ATITHWHGYP QKSDLPDGYW
LHGAENPPVA CSTPQSAAYS LLGKLSALEQ ALTQQGIYHG DVHTEPHHGT NIVGILSLTE
VARHFAR