Gene Cag_0154 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_0154 
Symbol 
ID3747719 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp171329 
End bp174184 
Gene Length2856 bp 
Protein Length951 aa 
Translation table11 
GC content49% 
IMG OID637772681 
Productexcinuclease ABC subunit A 
Protein accessionYP_378475 
Protein GI78188137 
COG category[L] Replication, recombination and repair 
COG ID[COG0178] Excinuclease ATPase subunit 
TIGRFAM ID[TIGR00630] excinuclease ABC, A subunit 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.676306 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCATTCA GCCACATTAG CATACGAGGC GCCCGCGTTC ATAACCTCAA GAACATTTCG 
CTTGATATTC CCCGCAACCA ATTTGTGGTT ATTACAGGGC TTTCAGGTTC AGGAAAATCG
AGTCTTGCTT TCGACACCAT TTATGCCGAA GGGCAGCGCC GCTTTATGGA AACGCTCTCC
CCTTATGCAC GCCAATATAT TGGCAACATT GAGCGCCCTG ATGTGGATTT TATTGAAGGA
CTGTCGCCCG TTATTGCTAT TGATCAAAAA AGTACCAGCC GCTCCCCTCG CTCAACGGTT
GGCACTATTA CCGAAATCCA CGACTTTATT CGGTTGCTGT ATGCAAAAGC GGGACGCCGT
TACAATCCCG AAACGGGTGC CATGGTGCAA GCACAAAGCG CCGACAACAT TCTTGCAACC
ATTCTTGCCC TACCCGAAGG AAGCAAGGTG CAAATTCTTT CACCACTTGT TACAGGGCGA
AAAGGGCATT ATCGGGAGCT ATTTGAGCGC TTACGCAGCA AAGGCTTTTT GCGGGTGCGT
GTTGATGGCG AATTGCAAGA AATGGTGCCC AACATGCAGC TTGAGCGTTA CAAAAGCCAC
ACCATTGAGT TGGTGGTTGA TCGGCTTGTT CTTGCGCCTG AAAGCGAAGC ACGAGTGCGC
GAAGCCGTCA TGCTGGCTAT TAGTATCTCG GAGCACAAGT CGTCGGTTAT TTGCACGCCC
TTTGAGGGTG GCTTTACCGA GCTTGCTTTT ACGCTCAGCA AAGGGGATAA TGAGGATGCC
CTGCCAACAT CAACCTTGGC ACCGAACCAC TTTAGCTTTA ATTCCCCTTA TGGCGCTTGC
CCAACCTGTA ACGGATTGGG TGAATTGATG CAGCTTTCGG GTGAATTGAT GATTCCCGAT
CCTTCGCTGT CGCTCAATCA AGGTGGGCTT GACCCTTTTG GAAAAGCTGG CAAACGCAAC
CATTGGCAGG TAATTCGCGC TATTGCAAAA GAGTTCGATT TTACGCTCGA TACTCCCATG
AGCAAAATTC CCAAAAGCGC ACTTAAAATA TTGCTCAATG GCTCAGGCAA GCGCACCTTT
GAGGTAGCTT ACACCTCTTC AGGACACACC AGCTTATATC CACAGCCTTT TCAAGGTGCC
GTAGCATATG TGCAAGAAAT TCTCAATAAC GCCACAACCT CGAAAGTGCG GGAGTGGGCT
GAAGCCTACA TGCTCCACCA ACCCTGCCCC GTATGCCTTG GCGCACGCTT AAAACCCGAA
AGCTTGCAGG TTAAAATTCA TGGCTTAAAC ATTGCTGAAC TCGAAGCTTT GCCACTACCT
GAAACCCTTG CCTTTTTTAA TAATCTACCG CCCAATCTTA GCCAAAAAGA GTTGATAATT
GCCACTCCCG TGTTGCATGA AATCACCAAA CGGCTCCAAT TTTTATTGGA TGTTGGGTTA
GGCTATCTCT CGCTTGACCG TAGCTCGCAC ACACTTTCGG GCGGCGAAGC ACAGCGCATT
CGGCTTGCCT CGCAGCTTGG CTCGCAACTG AGCGGCGTGC TCTATGTGCT TGACGAGCCG
AGTATTGGAT TGCATCAGCG CGACAACCAC AAGCTCATTA CCTCATTGAA GCATTTGCGC
GACCTTGGCA ACACCGTGTT AGTGGTTGAG CACGATAAAG ATACCATGCT GGAAGCTGAT
ACCATTGTGG ATCTTGGTCC GGGTGCGGGC GCTTACGGAG GCGAAATTGT GGCTTTTGGC
GCAGCCCGTG AGCTTGACCC TTCGTCGCTA ACGGCAGGCT ACCTCAATGG CACCAACCGC
GTTTTTTATG CAAGCGAAGC TTCATCCGAA AAAACTGATG CCGATGCCGA TGCCACACCA
CTTTTTCTTA CGCTGAAAGG ATGTAAAGGC AACAATCTTA AAAACATTGA CGCACAAATT
CCGCTCCGCA AATTAGTAAG CATTACGGGT GTAAGTGGCT CAGGTAAATC AACCTTGATT
AATGAAACCC TTTACCCAAT CCTTGCACGC CACTTCTACC GCTCAAAAGT AGTAACCGCA
CCATTCGACG CTATTGAAGG GATAGAGCTG CTTGACAAGG TGGTAAATGT TGACCAATCA
CCCATTGGAC GCACACCGCG CTCCAATCCC GCAACCTACA CGGGAGCCTT TACCTTTATT
CGCGACTTCT TTACCCGCTT GCCCGAAGCG CAAATTCGTG GCTACAAAGC GGGACGTTTT
AGCTTTAACG TAAAAGGGGG GCGCTGCGAA GTGTGCCAAG GCGCAGGCAC GCGCAAAATT
GAGATGAATT TTTTGCCCGA CGTTTACGTG CAGTGCGAAA ATTGCAAAGG CGAACGCTAC
AACCGCGAAA CGCTGATGGT AAAGTATCGC GGTAAATCCA TTGCCGACGT ATTGGAAATG
AGCATTACCG AAGCCGCTGA ATTTTTTACC GACTTCCCTC GCATTCGCCG CATTCTCAAT
ACCATGCAAA GCGTTGGGCT TGGCTATCTC AAGCTGGGGC AACCCTCGCC CATGCTTTCA
GGCGGCGAAG CACAACGCAT TAAATTATCG GCAGAGTTGG CTAAAATTCA AACAGGCAAA
ACGCTCTATA TTTTAGATGA ACCAACCACG GGACTTCATT TTCAGGATAC GCAACATTTG
CTGGAAGTGC TCCGCAAATT AGTAGAGAAA GGCAATAGCG TCATTATTAT TGAGCACAAT
CTCGATATTA TTAAAAACAG CGACTGGGTT ATTGATTTAG GAGCAGAAGG GGGATTTGAA
GGGGGAACAA TTATTGCAGA AGGCACACCT CAGCAAATTG CCGATACGCC TCATTCGCAT
ACAGGTAGAT TTTTAAAGAT GGAGATGGGG GGTTAG
 
Protein sequence
MSFSHISIRG ARVHNLKNIS LDIPRNQFVV ITGLSGSGKS SLAFDTIYAE GQRRFMETLS 
PYARQYIGNI ERPDVDFIEG LSPVIAIDQK STSRSPRSTV GTITEIHDFI RLLYAKAGRR
YNPETGAMVQ AQSADNILAT ILALPEGSKV QILSPLVTGR KGHYRELFER LRSKGFLRVR
VDGELQEMVP NMQLERYKSH TIELVVDRLV LAPESEARVR EAVMLAISIS EHKSSVICTP
FEGGFTELAF TLSKGDNEDA LPTSTLAPNH FSFNSPYGAC PTCNGLGELM QLSGELMIPD
PSLSLNQGGL DPFGKAGKRN HWQVIRAIAK EFDFTLDTPM SKIPKSALKI LLNGSGKRTF
EVAYTSSGHT SLYPQPFQGA VAYVQEILNN ATTSKVREWA EAYMLHQPCP VCLGARLKPE
SLQVKIHGLN IAELEALPLP ETLAFFNNLP PNLSQKELII ATPVLHEITK RLQFLLDVGL
GYLSLDRSSH TLSGGEAQRI RLASQLGSQL SGVLYVLDEP SIGLHQRDNH KLITSLKHLR
DLGNTVLVVE HDKDTMLEAD TIVDLGPGAG AYGGEIVAFG AARELDPSSL TAGYLNGTNR
VFYASEASSE KTDADADATP LFLTLKGCKG NNLKNIDAQI PLRKLVSITG VSGSGKSTLI
NETLYPILAR HFYRSKVVTA PFDAIEGIEL LDKVVNVDQS PIGRTPRSNP ATYTGAFTFI
RDFFTRLPEA QIRGYKAGRF SFNVKGGRCE VCQGAGTRKI EMNFLPDVYV QCENCKGERY
NRETLMVKYR GKSIADVLEM SITEAAEFFT DFPRIRRILN TMQSVGLGYL KLGQPSPMLS
GGEAQRIKLS AELAKIQTGK TLYILDEPTT GLHFQDTQHL LEVLRKLVEK GNSVIIIEHN
LDIIKNSDWV IDLGAEGGFE GGTIIAEGTP QQIADTPHSH TGRFLKMEMG G