Gene Cag_1301 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1301 
Symbol 
ID3747449 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp1766885 
End bp1769713 
Gene Length2829 bp 
Protein Length942 aa 
Translation table11 
GC content49% 
IMG OID637773838 
Productexcinuclease ABC subunit A 
Protein accessionYP_379604 
Protein GI78189266 
COG category[L] Replication, recombination and repair 
COG ID[COG0178] Excinuclease ATPase subunit 
TIGRFAM ID[TIGR00630] excinuclease ABC, A subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0773804 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGCTC ACGGGCAACT GACCGACACC TCCTTACCGG ATATTGTGCT GAAAGGCATT 
AACACCCACA ACCTCCGCAA CATTTCCGTT CGCATTCCTC GCAATAAATT TATTGTTATA
ACGGGCGTTA GTGGCTCAGG CAAATCAAGC CTTGCTTTCG ACACCCTTTA CGCCGAAGGG
CATCGCCGTT ATGTGGAATC GCTCTCGGCG TATGTTCGCC AATTTCTTGA GCGAATGCCT
CGCCCCGATA TTGAGCACGT TGAAGGCATT GCGCCTGCCA TTGCTATTGA GCAAAAAGCA
CTCCCTAAAA ATCCCCGCTC AACCGTTGGC ACCGTGTCGG AAATTTATGA CTACCTCCGC
TTGCTCTATG CCCGCATTGG TAAAATTTAC TCGCGCGACA CCAACGAGTT AGTGCTCAAG
CACACACCCG ATGACGTCAG CTTGCAAGCA GGTTTTATTG AGGATGGCAA AAAATTTTAT
GTGGGATTTT TTTTTCCTCA CCATCATACC GCTCAACAGC TCGACTGCTC GCCCGAAGAG
GAAATTGCAA ATCTCCTGAA AAAAGGCTTT TTCCGCTTGC TTGCAGGCGA TGAGCTGCTT
GACCTAAACC AAGAAGCTGA CTACCAAAAA GTGCTCGACA TGCCCGCTAA GGTTCGCGCT
GAACTCTTAG TGGTGGTTGA CCGCTTTGTT GCCCGCAATA ACGACAAACT CTTTAGCCGC
ATTTCGCAAG CTGCCGAAAG CAGTTTTATG GAATCGGGCG GACACGCAGT GCTAAAAGTA
GTTGACGGCA AAACCTACCG CTTTAGCGAT CGCCTTGAGC TGCACGATAT TGAGTATCAA
GAGCCTTCGC CCCAACTCTT TGCCTTTAAC TCCCCCATTG GCGCTTGCAC CACCTGCCAA
GGTTTTGGGA GAATTATGGG AATTGATGAA GATGCCGTTA TTCCCGATAA ATCACTTTCC
ATTGAAGAGG GAGCAATTGC TTGCTGGAAT TCTGAAAAAT ATCGCTGGAA TTTATTGGAG
CTGATGCACT ATGCGCCGAA GTTTGGTGTT CCACTACGAG AGCCTTACGA AAAGCTCACC
TTTGAACAAA AAGAGATTAT TTGGAAAGGA ACTCCTGACG GAAGCTTTAA TGGCATTCGC
GCTTTTTTTG CGGAAATAGA AAAAGATGCC GGTTACAAAA TGCACTACCG CGTTTTTTTA
AGCCGCTACC GAGGCTACGC CATCTGCCCC GATTGCGAAG GAAGCCGCTT AAACCCCGAT
GCTCTTCAGG TAAAAATTTC AGGACGCCAC ATTGGCGAAG TAACTCGCAT GAGCATTGGC
GAAGTGGCTG AATTTTTCCG CAACCTCAAC ATCTCCCCCT TTGACCGCTC GGTAGCTGAA
GTGATTTTGC AAGAAATTAA TCGCCGACTT GGCTACTTGC TCGACGTAGG ACTTGATTAC
CTCACGCTTG ACCGCTTAAC CCACACCCTA AGCGGCGGCG AATTCCAACG CATCAACCTC
TCCACCTCGC TTGGCTCACC GCTTGTAGGC ACCATGTACA TTCTTGACGA ACCAAGCATT
GGGCTACACC AAAGCGACTC CGCACGCTTG ATTGCGCTGC TCCGCAAATT ACGCGACCTT
GGCAACACCG TTGTGGTAGT TGAGCACGAC CGCGAAATTA TTGAAGCCGC CGATGAGGTG
ATTGATCTTG GACCATTTGC TGGACGGCTT GGTGGCGAAG TAGTATTTCA AGGCAGCATG
GAAGCCATGC GCTCATCGGG CACCTCGCTC ACTGCACAAT ACATGAATGG CGAACAACAA
ATTGAGGTAC CCCAACAGCG CCGCACGGTT GATTTCTCCG CCTGCATTAC CATCAGCGGT
GCCATGCAAA ACAACCTCAA AAACATTGAT GTTCAAATTC CGCTTAAAGT AATGACCTGC
ATAACCGGCG TTAGTGGTTC AGGCAAATCA ACCCTCATTA ACGATATTCT TTGCAAAGGC
ATTCTCCGCG AAAAACATGG AAGCCGTGGC ACCGTAGGCA CCCACCGCTC GCTAACAGGC
GCATGGCTCA TTGACCGCAT TGAGCACGTT GATCAATCGC CCATTGGCAA GTCAAGCCGT
AGCAATCCGG TTACCTACAT GAAAATTTTC GACGACATCC GCACCCTTTT TGCCAACACG
CCCGATGCTC GCAAGAAAAA AGTAAAAGCA GGCTACTTCT CTTTTAACAT TCCCGGCGGT
AGGTGCGAAG TGTGCTCGGG CGAAGGCAGC GTGCATATTG AAATGCAATT TCTTGCCGAC
ATTGAAGCCG TATGCGAAGC CTGCAACGGA CTTCGCTACC AACCCGAAGC GCTTGCCATT
AAGTTCAACG GTAAATCCAT TGCCGAAGTG CTCGACATGA CGGTAAGCGA AGCACTGAGC
TTTTTTAAAG GCGAAAAAAA CATTGTAAAA AAACTCAGCG TTCTCGATCA AGTAGGACTT
GGCTACATAC GTCTTGGGCA ATCCTCCAGC ACCTTCTCAG GCGGCGAAGC ACAACGCTTG
AAGCTTGCCA CCTTTATTGC CCACGCCGAC ACCACTCACA CGCTTTTCGT GTTTGATGAA
CCAACCACAG GACTACATTT TGAGGATATT AAAAAGCTCA TCCTTTGCTT TGAAAAGCTC
CTTGAGCAAA ACAACAGCCT TATTATTATT GAGCACAATC TCGATATTAT TAAGCAAGCT
GATTGGGTAA TTGATTTAGG ACCAGGCGCA GGCGATAAAG GTGGGCACTT GGTAGAACAA
GGCACACCCG AAGAGGTTGC TCAATGCACT GAATCACTGA CGGGGCAATA TTTGCGAGGG
GTGGTATAA
 
Protein sequence
MNAHGQLTDT SLPDIVLKGI NTHNLRNISV RIPRNKFIVI TGVSGSGKSS LAFDTLYAEG 
HRRYVESLSA YVRQFLERMP RPDIEHVEGI APAIAIEQKA LPKNPRSTVG TVSEIYDYLR
LLYARIGKIY SRDTNELVLK HTPDDVSLQA GFIEDGKKFY VGFFFPHHHT AQQLDCSPEE
EIANLLKKGF FRLLAGDELL DLNQEADYQK VLDMPAKVRA ELLVVVDRFV ARNNDKLFSR
ISQAAESSFM ESGGHAVLKV VDGKTYRFSD RLELHDIEYQ EPSPQLFAFN SPIGACTTCQ
GFGRIMGIDE DAVIPDKSLS IEEGAIACWN SEKYRWNLLE LMHYAPKFGV PLREPYEKLT
FEQKEIIWKG TPDGSFNGIR AFFAEIEKDA GYKMHYRVFL SRYRGYAICP DCEGSRLNPD
ALQVKISGRH IGEVTRMSIG EVAEFFRNLN ISPFDRSVAE VILQEINRRL GYLLDVGLDY
LTLDRLTHTL SGGEFQRINL STSLGSPLVG TMYILDEPSI GLHQSDSARL IALLRKLRDL
GNTVVVVEHD REIIEAADEV IDLGPFAGRL GGEVVFQGSM EAMRSSGTSL TAQYMNGEQQ
IEVPQQRRTV DFSACITISG AMQNNLKNID VQIPLKVMTC ITGVSGSGKS TLINDILCKG
ILREKHGSRG TVGTHRSLTG AWLIDRIEHV DQSPIGKSSR SNPVTYMKIF DDIRTLFANT
PDARKKKVKA GYFSFNIPGG RCEVCSGEGS VHIEMQFLAD IEAVCEACNG LRYQPEALAI
KFNGKSIAEV LDMTVSEALS FFKGEKNIVK KLSVLDQVGL GYIRLGQSSS TFSGGEAQRL
KLATFIAHAD TTHTLFVFDE PTTGLHFEDI KKLILCFEKL LEQNNSLIII EHNLDIIKQA
DWVIDLGPGA GDKGGHLVEQ GTPEEVAQCT ESLTGQYLRG VV