Gene Cag_1341 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1341 
SymboluvrC 
ID3746856 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp1808936 
End bp1810825 
Gene Length1890 bp 
Protein Length629 aa 
Translation table11 
GC content45% 
IMG OID637773879 
Productexcinuclease ABC subunit C 
Protein accessionYP_379644 
Protein GI78189306 
COG category[L] Replication, recombination and repair 
COG ID[COG0322] Nuclease subunit of the excinuclease complex 
TIGRFAM ID[TIGR00194] excinuclease ABC, C subunit 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAACCAC TTGATGCGCT CGAAAAGCAT GGTGATATAA AAAAAGTATT AACCGAAAAA 
CTTGCAACAC TACCAACTTC ACCCGGCATC TATCAATTTA AAAATAGTGC AGGACGCATT
ATTTACGTTG GCAAAGCTAA AAACCTCCGC AATCGCGTTC GCTCCTACTT TAGAAATAGC
CATCAGCTTT TTGGCAAAAC CTTAGTGTTA GTTAGCCATA TTGACGACCT CGAAGTAATT
ATTACCTCAT CAGAAGTTGA AGCACTTATT CTTGAAAACA ATCTTATAAA AGAACTTAAA
CCCCGCTACA ACGTTAACCT TAAGGACGAC AAAACCTATC CCTACCTGGT AATTACCAAC
GAACCTTATC CACGCATTCT TTTTACCCGA CATCGCCGCA ACGATGGCTC CATTGCTTTT
GGACCTTACA CCGAAGCACG GCAGTTGCGC TCCATCCTCG ATTTAATTGG CTCCATTTTT
CCTGTACGCA GTTGCAAACT TCGCCTTACA CCCGACGCAA TAGCTTCAGG CAAGTACAAA
GTGTGCCTCG ATTACCACAT CCACAAGTGC AAAGGCGCTT GCGAAGGCTT ACAGCCTGAA
GATGAGTATC GGCAGATGAT TGATGAAATT ATTAAGCTGC TCAAAGGTAA AACCTCAGCG
CTTATTCGCT CGCTCACTGA AAATATGCAC CTCGCTGCAA CTGAACTACG CTTTGAGCAA
GCTGCTGAAA TTAAAGCGCA AATTGAAAGC CTCAAGCGCT ACGCCGAGCG GCAAAAGGTA
GTTGCTGCCG ACATGGTGGA TCGCGATGTG TTTGCCATAG CCGCTGGCGA AGATGATGCG
TGTGGGGTAA TTTTTAAAAT TCGAGAAGGC AAATTGCTTG GCTCACAACG CATTTACATT
AACAATACCA ATGGCGAAAG CGAAGCCTCC ATGCAATTGC GCATGTTGGA AAAATTTTAT
GTAGAGAGTA TTGAACCTGT GCCCGATGAA ATTCTGTTAC AAGAGGCACT AAGCGAAGAG
GAGGAGGAGA CGTTACGGGC GTTTTTGCTT GTAAAAGCAA AAAATGAAGG GCAGGAGAAA
AAAGGTATTC GTCTGGTTGT GCCACAAATT GGCGATAAAG CGCATTTGGT TGGCATGTGC
CGCCAAAATG CACGCCACCA TTTGGAAGAG TACCTCATTC AAAAGCAAAA ACGGGGTGAA
GCCGCTCGTG AGCACTTTGG GCTAACCGCA CTTAAGGAGC TTCTACACCT CCCTACGCTA
CCACAGCGCA TTGAGTGTTT CGACAACTCA CACTTTCAAG GCACCGATTA CGTTAGCTCA
ATGGTTTGTT TTGAAAAAGG CAAAACGAAA AAGTCGGATT ACCGAAAGTT TAAAATTAAA
ACCTTTGAAG GCTCCGATGA CTATGCAGCA ATGGATGAAG TGCTTCGGCG GCGCTACAGT
GGTTCATTAA CCGAATCGTT AGCGTTGCCT GATTTAATTG TGGTGGATGG CGGTAAAGGG
CAGGTAAACA CGGCTTATAA AACATTGCAA GAGCTTGGCG TAACCATTCC CGTGATTGGC
TTAGCAAAAC GCATTGAGGA AATTTTTACC CCTCACTCCT CCGATCCTTT TAATTTGCCA
AAAACCTCAC CAGCGCTGAA GCTTCTTCAA CAATTGCGCG ACGAAGCGCA CCGCTTTGCC
ATTACCTATC ATCGTAAGCT ACGGAGCGAC CGCACCTTAC AAACCGAGCT CACTACCATT
GCAGGCATTG GCGAAAAAAC AGCTTTTAAG CTCCTTGAAC ACTTTGGCTC AGTTGAAAGC
GTTGCCCAAG CATCGCGTGA AGAGTTGCAG GCAGTAATAG GCGCTAAAGC AGGTGAAACG
GTTTACACCT TTTATCGCCC TGAAGGGTAA
 
Protein sequence
MEPLDALEKH GDIKKVLTEK LATLPTSPGI YQFKNSAGRI IYVGKAKNLR NRVRSYFRNS 
HQLFGKTLVL VSHIDDLEVI ITSSEVEALI LENNLIKELK PRYNVNLKDD KTYPYLVITN
EPYPRILFTR HRRNDGSIAF GPYTEARQLR SILDLIGSIF PVRSCKLRLT PDAIASGKYK
VCLDYHIHKC KGACEGLQPE DEYRQMIDEI IKLLKGKTSA LIRSLTENMH LAATELRFEQ
AAEIKAQIES LKRYAERQKV VAADMVDRDV FAIAAGEDDA CGVIFKIREG KLLGSQRIYI
NNTNGESEAS MQLRMLEKFY VESIEPVPDE ILLQEALSEE EEETLRAFLL VKAKNEGQEK
KGIRLVVPQI GDKAHLVGMC RQNARHHLEE YLIQKQKRGE AAREHFGLTA LKELLHLPTL
PQRIECFDNS HFQGTDYVSS MVCFEKGKTK KSDYRKFKIK TFEGSDDYAA MDEVLRRRYS
GSLTESLALP DLIVVDGGKG QVNTAYKTLQ ELGVTIPVIG LAKRIEEIFT PHSSDPFNLP
KTSPALKLLQ QLRDEAHRFA ITYHRKLRSD RTLQTELTTI AGIGEKTAFK LLEHFGSVES
VAQASREELQ AVIGAKAGET VYTFYRPEG