Gene Acid345_4773 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4773 
Symbol 
ID4073367 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp5639770 
End bp5642322 
Gene Length2553 bp 
Protein Length850 aa 
Translation table11 
GC content64% 
IMG OID637986817 
Productexcinuclease ABC, A subunit 
Protein accessionYP_593846 
Protein GI94971798 
COG category[L] Replication, recombination and repair 
COG ID[COG0178] Excinuclease ATPase subunit 
TIGRFAM ID[TIGR00630] excinuclease ABC, A subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.235666 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0143445 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCGCC CCCACGATCC CGCGAACCGC AACGGCTTCG TCCGCGTCCG CGGTGCGCGC 
GAACACAATC TCAAGAACAT CGACGTTCAG ATTCCCCGCG ACGCTCTCGT CGTCTTCACT
GGTGTCTCCG GCTCTGGGAA ATCGTCGCTC GCATTCGGCA CGCTCTACGC CGAGGCCCAA
CGCCGCTATC TCGAATCTGT ATCGCCCTAC GCGCGCCGCC TCTTTCATCA GATGGCCGTC
CCCGAAGTCG ATGCCATCGA CGGCCTTCCG CCCGCCGTCG CCTTGCAGCA ACAGCGCGGA
TCACCAACCA CCCGCTCCTC CGTCGGCAGC GTCACCACGC TCTCGAACCT GTTGCGCATG
CTCTACTCGC GCGCCGGTGA CTATCCGAAA GGTCAACCGC TGCTCTACGC CGAGTCATTC
TCGCCCAACA CACCGGAAGG CGCTTGCCCG CGGTGCCATG GCCTCGGCCG CATCTACGAG
GCCACGGAAG ACTCCATGGT GCCCGACCCG TCGCTCACTA TTCGCGAACG CGCCATTGCC
GCATGGCCGC CCGCGTGGCA GGGCCAGAAC CAGCGCGACA TCCTCACCAC CCTCGGCTAC
GACGTCGACA TTCCGTGGCG CCAGCTCTCG AAGAAAGACC GCGACTGGAT CCTCTTTACC
GACGATCAGC CCGTAGTGCC GGTGTACGCG GGCTTCACCC GCGCGGAAGT GAAGCAGGCC
CTCAAGCGCA AAGAAGAGCC CAGCTACATG GGGACGTTCA CCAGCGCGCG TCGCTACGTG
CTCCATACCT TCGCCACAAC CGAGAGCGCG ATGATGAAGA AACGGGTCGC GCAATACGTG
CGCGGTGCGG AGTGCCCGTT GTGCCGCGGA AAGCGGCTGC GCCGCGAGTC GTTGTCCGTC
AAGTTCGCCG GTTACGACAT TGCCGAACTC TCCGGTCAAA CTCTTTCGAA GCTCGGTGAC
ATCTTTCATC CCTACGTTGA GCGCTCAACC GGGAAGCTGG CAAAAGAGCA TCCTGAGAAA
GCGGAGGTTG TGCGGCTCAT CGCGCAGGAT CTCTGCGGGC GCCTCACCGT CCTCCTCGAT
CTCGGGCTTG GTTATCTCAC CCTCGATCGC AGCACGCCTA CGCTCTCGCC CGGCGAGCTG
CAGCGCCTGC GCCTCGCCAC GCAAGTGCGT TCGAATTTGT TTGGCGTGGT TTACGTGCTC
GACGAACCCT CAGCCGGCCT GCATCCAGCC GATACTGAAG CCCTGCTGCG CGCGCTCGAC
AAGCTCAAGG CCTCCGGCAA TTCGCTCTTC GTCGTCGAGC ATGAACTCGA TGTCGTGCGT
CATGCCGATT GGATCGTGGA TGTCGGCCCG GCAGCGGGAG TGCACGGCGG CGAAGTCCTC
TTCAGCGGCA CTCCCGATGA TTTGCGCAGC GTCGCCAAGT CGCAAACCCG CCGCTACCTC
TTCGATCCGC CCCCGCTTCC CGTGCGTCCC GCGCGCCCGC CCATTGGCTG GCTGAAATTG
CGCGGCGTTA CCCGCCACAA CCTCCATAGT CTTGACGTCG AGTTCCCCGT TGGCACCTTC
ACTTCCGTCA CCGGCGTCTC GGGTTCCGGC AAATCGAGTC TCGTCAGCCA GGTGCTCGTG
GAGCTTGTCT CCGAACAACT GGGAACCACG GTCGCGGAAG AGCCAGTTGA AGGTGACACG
CTCGAACGCG ACACTCCTGC CACCACCTCA GGCAAGATCG TCTCCGGTCT CGAGCTGATT
CAGCGTCTCG TGGTGGTCGA TCAGAAAGCC ATCGGTCGCA CTCCGCGCTC AAACCTCGCC
ACCTACACGG GCCTGTTCGA CCACGTGCGC AACCTCTTCG CGGCGACGAA GGAAGCTCGC
GCTCGCCGCT ACAACGCTGG ACGCTTCTCC TTCAACGTCG CAGCCGGACG CTGCCCCAAC
TGCAAGGGCG AAGGCTTCGT CATGGTGGAG CTGCTCTTCC TGCCCAGCGT GTACTCACCA
TGCCCGGTCT GCAAAGGCGC GCGCTACAAC CCCAGGACGC TGGAGATTCA CTATCGCGGC
AAGAACATCG CCGAGGTCCT CAACCTCACC GTGGATGCCG CGTACGAGTT CTTCGCCGAC
GATCTCCCGA CCCGCCGCGC GCTGCACGTC CTGCGCGAAG TTGGACTCGG CTACCTGCGC
CTCGGCCAGC CCGCCACCGA ACTTTCCGGC GGCGAAGCGC AGCGCATCAA GCTCGCAACC
GAGCTGCAGC GGGGCCAGCA AGGCAACACC CTCTACATCC TCGACGAACC CACCACCGGC
CTTCATCCCG CCGACGTCGA GCGCCTCGTT GCCCAGCTCG ATCGCCTGGT GGATGCGGGC
AATACCGTCA TCGTCGTCGA ACATGACATG CGCGTCGTCG CCGGCAGCGA TTGGGTCGTC
GACATCGGCC CCGGCGCCGG CGAAGAGGGC GGAAAGATCG TCATCGCCGC GCGGCCTCAG
GAACTGGCGC GGGATTCGAA GAGCCGCACC GCTCCTTATC TCGCCAGCTT CCTCGCGTTG
GCGAGCCCGG AACCCGCCGA GCGCTCCGCC TGA
 
Protein sequence
MSRPHDPANR NGFVRVRGAR EHNLKNIDVQ IPRDALVVFT GVSGSGKSSL AFGTLYAEAQ 
RRYLESVSPY ARRLFHQMAV PEVDAIDGLP PAVALQQQRG SPTTRSSVGS VTTLSNLLRM
LYSRAGDYPK GQPLLYAESF SPNTPEGACP RCHGLGRIYE ATEDSMVPDP SLTIRERAIA
AWPPAWQGQN QRDILTTLGY DVDIPWRQLS KKDRDWILFT DDQPVVPVYA GFTRAEVKQA
LKRKEEPSYM GTFTSARRYV LHTFATTESA MMKKRVAQYV RGAECPLCRG KRLRRESLSV
KFAGYDIAEL SGQTLSKLGD IFHPYVERST GKLAKEHPEK AEVVRLIAQD LCGRLTVLLD
LGLGYLTLDR STPTLSPGEL QRLRLATQVR SNLFGVVYVL DEPSAGLHPA DTEALLRALD
KLKASGNSLF VVEHELDVVR HADWIVDVGP AAGVHGGEVL FSGTPDDLRS VAKSQTRRYL
FDPPPLPVRP ARPPIGWLKL RGVTRHNLHS LDVEFPVGTF TSVTGVSGSG KSSLVSQVLV
ELVSEQLGTT VAEEPVEGDT LERDTPATTS GKIVSGLELI QRLVVVDQKA IGRTPRSNLA
TYTGLFDHVR NLFAATKEAR ARRYNAGRFS FNVAAGRCPN CKGEGFVMVE LLFLPSVYSP
CPVCKGARYN PRTLEIHYRG KNIAEVLNLT VDAAYEFFAD DLPTRRALHV LREVGLGYLR
LGQPATELSG GEAQRIKLAT ELQRGQQGNT LYILDEPTTG LHPADVERLV AQLDRLVDAG
NTVIVVEHDM RVVAGSDWVV DIGPGAGEEG GKIVIAARPQ ELARDSKSRT APYLASFLAL
ASPEPAERSA