Gene Cag_0314 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_0314 
Symbol 
ID3748109 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp349424 
End bp351469 
Gene Length2046 bp 
Protein Length681 aa 
Translation table11 
GC content52% 
IMG OID637772841 
Productexcinuclease ABC subunit B 
Protein accessionYP_378630 
Protein GI78188292 
COG category[L] Replication, recombination and repair 
COG ID[COG0556] Helicase subunit of the DNA excision repair complex 
TIGRFAM ID[TIGR00631] excinuclease ABC, B subunit 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGAACC GAACCGATAA TGAGTACCAG TTAGTAAGCC CGTATCAGCC CGCAGGCGAT 
CAGCCAAAAG CTATTGAAGC GCTGGTGCAA GGCGTGCGCG ATGGGCGCCA TTGGCAAACC
TTGCTGGGCG TTACGGGTTC GGGCAAAACC TTCACCATTT CCAACGTTAT AGCGCAGCTT
AATCGCCCCG TGTTGGTAAT GAGCCACAAT AAAACCCTTG CGGCACAGCT TTATGGGGAA
CTCAAGCAGT TTTTTCCCCA CAATGCGGTT GAATACTTTA TTAGCTACTA CGACTTTTAC
CAGCCCGAAG CTTATCTCCC TTCGCTCGAT AAATACATTG CTAAAGACCT CCGCATTAAC
GATGAAATTG AACGCCTCCG CTTGCGGGCA ACCAGCGCTT TGCTGAGTGG GCGTAAGGAT
GTGATTGTGG TAAGCTCCGT GAGTTGCATT TACGGACTTG GTTCGCCTGA GGAGTGGAAA
GCGCAAATCA TAAAATTGCG AGCTGGCATG GAAAAAGATC GCGATGAATT TTTGCGCGAA
CTGATTTCAT TGCACTACCT CCGCGACGAT GTGCAACCAA CGTCGGGCAG ATTCCGCGTG
CGGGGCGATA CTATTGACCT TGTGCCTGCC CACGAAGAGC TTGCGTTGCG CATTGAATTT
TTCGGCTCCG AAATTGAAAG CCTTCAAACC TTCGATATTC AAACGGGCGA AATTCTTGGC
GACGATGAGT ACGCCTTTAT TTACCCCGCA CGGCAGTTTG TGGCAGATGA GGAGAAGCTG
CAAGTGGCAA TGTTGGCAAT TGAAAACGAG TTAGCAGGGC GCTTAAACTT GTTACGCTCC
GAAAATCGCT TTGTGGAAGC ACGCCGCCTT GAAGAGCGCA CCCGTTACGA CTTGGAGATG
ATGAAAGAGC TTGGCTACTG TTCGGGCATT GAAAACTACT CGCGCCATAT TTCAGGACGT
CCTGCGGGAG AGCGCCCCAT TTGCCTGCTC GACTACTTCC CTGAAGATTA CATGGTGGTG
GTTGATGAAT CGCACGTAAC ACTGCCCCAA ATTCGAGGTA TGTATGGCGG CGACCGTTCT
CGCAAAACCG TGCTTGTAGA GCACGGCTTT CGCCTTCCTT CCGCTCTCGA CAACCGTCCC
CTCCGCTTTG AAGAGTATGA AGAGATGGTG CCGCAAGTAA TTTGCATTAG CGCCACACCG
GGCGAGCATG AGCTCATGCG CTCAGGTGGC GAAGTGGTTG AATTATTAGT TCGCCCCACC
GGTTTGCTTG ATCCACCTGT GGAAGTGCGC CCCGTTAAAG GGCAAATTGA TAACCTCTTA
GCCGAAATTC GCCACCACAT TAGCATTGGG CACAAAGCCC TTGTAATGAC GCTCACCAAA
CGCATGTCGG AAGATTTGCA CGACTTTTTT CGCAAAGCAG GCATCCGCTG CCGCTACCTC
CACTCAGAAA TTAAAAGTCT TGAGCGAATG CAAATTTTGC GCGAACTACG AGCAGGCGAT
ATTGACGTAC TTGTTGGGGT GAACCTTCTC CGTGAAGGGC TGGACCTCCC CGAAGTTTCC
CTTGTTGCAA TTCTTGATGC CGATAAAGAG GGCTTTTTGC GTAACACCCG CTCACTCATG
CAGATTGCAG GACGCGCCGC CCGCAACCTC GACGGCTTTG TGGTGCTCTA CGCCGACGTT
ATTACCCGCT CCATTCAAGA GGTGCTTGAC GAAACCGCCC GCCGCCGCGC CATTCAGCAA
CGCTATAACG AAGAGCACGG TATTACGCCA CGCTCCATTG TAAAATCAGT TGACCAAATT
CTCGACACCA CAGGCGTAGC CGATGCCGAA GAGCGCTATC GTCGTCGCCG CTTTGGCTTA
GAACCCAAGC CCGAGCGCGT GCTTTCAGGT TATGCCGATA ACCTTACGCC CGAAAAAGGC
TATGCCATTG TTGAAGGATT ACGGCTTGAA ATGCAAGAAG CCGCCGAGCA CATGGAGTAC
GAAAAAGCGG CTTACCTCCG TGATGAAATC ACAAAGATGG AGCAAGTGTT GAAAAAGGAT
GGGTAG
 
Protein sequence
MENRTDNEYQ LVSPYQPAGD QPKAIEALVQ GVRDGRHWQT LLGVTGSGKT FTISNVIAQL 
NRPVLVMSHN KTLAAQLYGE LKQFFPHNAV EYFISYYDFY QPEAYLPSLD KYIAKDLRIN
DEIERLRLRA TSALLSGRKD VIVVSSVSCI YGLGSPEEWK AQIIKLRAGM EKDRDEFLRE
LISLHYLRDD VQPTSGRFRV RGDTIDLVPA HEELALRIEF FGSEIESLQT FDIQTGEILG
DDEYAFIYPA RQFVADEEKL QVAMLAIENE LAGRLNLLRS ENRFVEARRL EERTRYDLEM
MKELGYCSGI ENYSRHISGR PAGERPICLL DYFPEDYMVV VDESHVTLPQ IRGMYGGDRS
RKTVLVEHGF RLPSALDNRP LRFEEYEEMV PQVICISATP GEHELMRSGG EVVELLVRPT
GLLDPPVEVR PVKGQIDNLL AEIRHHISIG HKALVMTLTK RMSEDLHDFF RKAGIRCRYL
HSEIKSLERM QILRELRAGD IDVLVGVNLL REGLDLPEVS LVAILDADKE GFLRNTRSLM
QIAGRAARNL DGFVVLYADV ITRSIQEVLD ETARRRAIQQ RYNEEHGITP RSIVKSVDQI
LDTTGVADAE ERYRRRRFGL EPKPERVLSG YADNLTPEKG YAIVEGLRLE MQEAAEHMEY
EKAAYLRDEI TKMEQVLKKD G