Gene Lcho_0303 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLcho_0303 
Symbol 
ID6161500 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameLeptothrix cholodnii SP-6 
KingdomBacteria 
Replicon accessionNC_010524 
Strand
Start bp320880 
End bp323849 
Gene Length2970 bp 
Protein Length989 aa 
Translation table11 
GC content67% 
IMG OID641663047 
Productexcinuclease ABC, A subunit 
Protein accessionYP_001789343 
Protein GI171056994 
COG category[L] Replication, recombination and repair 
COG ID[COG0178] Excinuclease ATPase subunit 
TIGRFAM ID[TIGR00630] excinuclease ABC, A subunit 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones74 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGCACGT CCGATGACGT CAACCCCGTC GTTATGATCG TCCGCTTGGC CTCGACAGAT 
TGCATCCTCA TGTCTACCGG TGTTTCCACC GTGCATCCGC ACCCGAAGAT CAGCGTGCGT
GGTGCCCGCA CCCACAACCT CAAGAACATC GACCTCGACC TGCCCAAGCA TGCGCTGGTC
GTGATCACCG GACTGTCCGG CTCGGGCAAG TCCAGCCTGG CGTTCGACAC CCTCTATGCG
GAGGGCCAAC GCCGCTACGT GGAAAGCCTG TCGGCCTACG CGCGGCAGTT CCTGCAACTG
ATGGACAAAC CCGATGTCGA CGTGATCGAG GGCCTGAGCC CGGCGATCTC GATCGAGCAG
AAGGCCACCA GCCACAACCC GCGCTCGACC GTCGGCACGA TCACCGAGAT CCACGACTAC
CTGCGTCTGC TCTACGCCCG CGCCGGCACG CCGCATTGCC CCGACCACGG CCAGCCGCTC
GAAGCTCAGA GCGTCAGCCA GATGGTCGAC GCCGTGCTGG CCATGCCTGC CGAAACCCGA
CTGATGGTGC TGGCACCCGT GGTGCGTGAT CGCAAGGGCG AGTTCGTCGA ACTGTTCGAG
AGCATGCAGG CGCAAGGCTA CGTGCGCTTC CGGGTCGACG GCCAGGTGGT CGAGGCCGCC
GACCTGCCCA AGCTCAAGAA AGCCGAGAAA CACGACATCG ACGTCGTCAT CGACCGCCTC
AAGAGCCGCG CCGACGTCAC CCAGCGCCTG GCCGAGAGCT TCGAAGCCGC GCTGCGCATC
GCCGAAGGCC GCGCCATCGC GCTCGAGATG GACACGGGCA GGGAACACCT CTACTCCAGC
AAGTTCGCCT GCCCGGTGTG CAGCTACTCG CTGTCCGAAC TGGAACCGCG CTTGTTCTCG
TTCAACTCGC CGGTCGGCGC CTGCCCCAGC TGCGACGGCC TCGGCATGGT CACCGTGTTC
GACCCCGAGC GCGTGGTCGC CTTCCCGTCG CTCAGCCTGG CCAGCGGCGC CGTCAAGGGC
TGGGACCGCC GCAACTCGTA CACGTTCTCG CTGCTCGAGA GCGTGGCCGC GCACTACGAC
TTCGACCTCG ACACCGCCTT CGAAGAACTC GCCCCCGAAC ACCGGCAGGT GCTGCTGCGT
GGCTCGGGCG AGGAGGAGAT CGCCTTCACG TATGAAGCCG AAGGCGCAGG CGGCAAGAAA
CGCAGCGTCA AGCGCAAGCA TCCCTTCGAA GGCATCCTGC CGAGCCTGGA ACGGCGCTTC
CGCGAGACCG ATTCCCCCGC CGTGCGCGAA GACCTGACGC GCTACCAGAG CGTCAAGCAC
TGCCCCGACT GCGACGGCGC CCGGCTGCGA CGCGAGGCCC GCCACGTCCT GCTGGTCGAC
GACCAGGACG ACGCGCTCGG CCGCAAGCCG CTGGCGATCT ACCAGGTCGA ACACGCCACG
CTGGCCGACT GCCTGGCCTA CTTCGAGACC CTGCACCTCA AGGGCGCCAA GGCCGAGATC
GCCGACAAGG TGGTGCGCGA GATCCGCGCG CGGCTGCGTT TCCTCAACGA CGTCGGCCTC
AATTACCTGA GCCTCGACCG CAGCGCAGAC ACCCTGTCGG GCGGCGAAGC CCAGCGCATC
CGCCTGGCCA GCCAGATCGG TTCGGGCCTG TCGGGCGTGA TGTACGTGCT CGACGAACCC
AGCATCGGCC TGCACCAGCG CGACAACGAT CGCCTGATCG GCACCCTCAA GCACCTGCGT
GATCTCGGCA ACAGCGTGCT GGTGGTCGAA CACGACGAGG ACATGATCCG CGCCGCCGAC
CACGTCATCG ACATGGGCCC GGGTGCCGGC GTACACGGCG GGCAGGTGAT GTCGCAAGGC
ACGCCCGAAC ACGTGGCGGC CGACCCCAAC TCGCTGACCG GTCGCTACCT CGGCCAGGTA
CTCAAGATCG CCATCCCCCA GCGCCGCAAC AAGCTGTCGG ACCAGAAAGA TCCGCGCGTG
CTGCGCATCG TCAACGCCCA CGGCAACAAC CTGCGCGGCG TGACGGCCGA GATCCCGGTC
GGCCTGTTCA CCTGCGTGAC CGGCGTGTCG GGCTCGGGCA AGAGCACGCT GGTCAACGAC
ACGCTCTACG CCGCGGTGGC ACGCAAGCTC TACCAGAGCC ACCTCGAGCC GGCGCCGCAC
GACGAGATCG AAGGCCTGGA CGCCTTCGAC AAGGTCATCA ACGTCGACCA GAGCCCGATC
GGCCGCACGC CGCGCAGCAA TCCGGCCACC TACACCGGCC TGTTCACGCC GATCCGCGAG
ATGTTCGCCG AGGTGCCGAT GGCGCGCGAA CGTGGCTACG GGCCGGGGCG CTTCAGCTTC
AATGTCGCCG GCGGGCGCTG CGAGTCCTGC CAGGGCGACG GTGTGCTGAA GGTCGAGATG
CACTTCCTGC CCGACGTCTA CGTCGCCTGC GACGTCTGCC ACGGCAAGCG CTACAACCGC
GAAACGCTGG AGGTGCTCTA CAAGGGCAAG AACATCACCC AGGTGCTGGA GCTGACGGTC
GAGGACGCAC ACGCCTACTT CAACGCCGTG CCCAGCATCG CGCGCAAGCT GCAGACGCTG
CTCGACGTCG GACTGGGCTA CATCAAGCTC GGCCAGGCGG CGACCACGCT GTCGGGCGGC
GAGGCGCAGC GCGTCAAGCT GGCGCTGGAG CTGTCCAAGC GCGACACCGG CCGCACGCTC
TACATATTGG ACGAGCCGAC CACCGGCCTG CACTTCCACG ACATCGGCCT GCTGCTGAAA
GTGCTGCACC AGCTGCGCGA CGCCGGCAAC ACCATCGTCG TGATCGAGCA CAACCTCGAC
GTCATCAAGA CCGCCGACTG GCTGCTCGAC ATGGGCCCTG AAGGCGGCTC GGGCGGCGGG
CGGCTGCTGA TCGCCGGCAC GCCCGAAGAG ATCGCCGAGT GCGCCGAGAG CCACACCGGG
CGCTTCCTCA AGCCGCTGCT GGCGCGATGA
 
Protein sequence
MCTSDDVNPV VMIVRLASTD CILMSTGVST VHPHPKISVR GARTHNLKNI DLDLPKHALV 
VITGLSGSGK SSLAFDTLYA EGQRRYVESL SAYARQFLQL MDKPDVDVIE GLSPAISIEQ
KATSHNPRST VGTITEIHDY LRLLYARAGT PHCPDHGQPL EAQSVSQMVD AVLAMPAETR
LMVLAPVVRD RKGEFVELFE SMQAQGYVRF RVDGQVVEAA DLPKLKKAEK HDIDVVIDRL
KSRADVTQRL AESFEAALRI AEGRAIALEM DTGREHLYSS KFACPVCSYS LSELEPRLFS
FNSPVGACPS CDGLGMVTVF DPERVVAFPS LSLASGAVKG WDRRNSYTFS LLESVAAHYD
FDLDTAFEEL APEHRQVLLR GSGEEEIAFT YEAEGAGGKK RSVKRKHPFE GILPSLERRF
RETDSPAVRE DLTRYQSVKH CPDCDGARLR REARHVLLVD DQDDALGRKP LAIYQVEHAT
LADCLAYFET LHLKGAKAEI ADKVVREIRA RLRFLNDVGL NYLSLDRSAD TLSGGEAQRI
RLASQIGSGL SGVMYVLDEP SIGLHQRDND RLIGTLKHLR DLGNSVLVVE HDEDMIRAAD
HVIDMGPGAG VHGGQVMSQG TPEHVAADPN SLTGRYLGQV LKIAIPQRRN KLSDQKDPRV
LRIVNAHGNN LRGVTAEIPV GLFTCVTGVS GSGKSTLVND TLYAAVARKL YQSHLEPAPH
DEIEGLDAFD KVINVDQSPI GRTPRSNPAT YTGLFTPIRE MFAEVPMARE RGYGPGRFSF
NVAGGRCESC QGDGVLKVEM HFLPDVYVAC DVCHGKRYNR ETLEVLYKGK NITQVLELTV
EDAHAYFNAV PSIARKLQTL LDVGLGYIKL GQAATTLSGG EAQRVKLALE LSKRDTGRTL
YILDEPTTGL HFHDIGLLLK VLHQLRDAGN TIVVIEHNLD VIKTADWLLD MGPEGGSGGG
RLLIAGTPEE IAECAESHTG RFLKPLLAR