Gene Sde_2053 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_2053 
SymboluvrC 
ID3967437 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp2632491 
End bp2634329 
Gene Length1839 bp 
Protein Length612 aa 
Translation table11 
GC content49% 
IMG OID637921143 
Productexcinuclease ABC subunit C 
Protein accessionYP_527525 
Protein GI90021698 
COG category[L] Replication, recombination and repair 
COG ID[COG0322] Nuclease subunit of the excinuclease complex 
TIGRFAM ID[TIGR00194] excinuclease ABC, C subunit 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.747011 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCAGC ACCAACCCCC TCCAGTATTT GATTCTACAA GTTTTCTAAA AAACGTAACA 
AAATTGCCCG GCGTCTACCA AATGTACGAC GCGGATGGGG CGATTTTGTA CGTTGGTAAG
GCCAAAAACC TCAAAAACAG GCTTAGTAGC TATTTTAGGG CAACAGGCCT TACCCCCAAA
ACGCACGCTT TAGTAAAGCG TATTCAGGCC ATAGAGGTAA CGGTTACGCC TAGTGAGGCA
GAGGCGCTGG TACTTGAGCA CAACCTAATC AAATCGCAGA AGCCGCCGTT TAATATTCTG
CTGCGCGATG ACAAGTCCTT CCCCTATATC TTTATCTCAG AAGGGGAGCC ATACCCAAAG
TTGGCTTTTC ACCGTGGTCC CAAAAAGAAA AAAGGCCAGT ATTTTGGCCC ATTCCCGAAT
GCATCTGCTG TAAAAGAAAC GCTTAACTTC TTGCAGCGCA CGTTCCGCGT AAGGCAGTGT
GAAGACTCTG TATTTAGAAG CCGAACGCGG CCCTGCTTGC AGTACCAAAT AGGGCGCTGC
ACAGGCCCGT GTGTAGAGGC CATAAGCGTA GAAGATTACG CGGTAGATCT TGCTCACACC
GCCATGTTTT TAGACGGTAA AAGCGAGGTT TTGCAGCAAG AGCTGCAAGT TGAGATGGAA
CAAGCCTCGC AGGCGCTGGA TTTTGAGCGC GCGGTGGTTG TGCGCGACCA AATTACCGAT
TTGCGGCAAG TACAAGCGCA GCAGGTGATG GAGGCGGGCT ATAGCAACCA AGATGTTGTT
GCATGTGCTT CTGAATCTGG GGTGCATTGC ATTCATATAC TCTATGTGCG CCAGGGACGC
ATTGTGGGTA GCAAAAGCTA TTTGCCAAAA ACCAAGCTCG ATAGCACCGA GGAAGACGTG
CTAAGCGCAT TTCTCGCTCA CCACTACCTT GGTGGGGCCG CTATGGATGT ACCGCCACAC
ATTATTATTA GCCATAAGCT GGCCGACCAA TTAATTATTG GTGAGGCAGT AGAAAAGGCC
ACTGGCAAGC AGCTTAAGCT AACGCATAAC GTGCGTACCT ACCGCGCTAA GTGGCTAGCA
ATGGCCTTAG AAGCTGCGCG TCAAAACCTC AAGAACCATC TAAATAACAA GCAAACCTTG
GTGGCTAGGT TTGAATCCCT GCAAGACATA CTTGGTTTGG ATGAAACGCC TAATCGTATT
GAATGCTTCG ATATAAGCCA TAGCAGCGGC GAGTTAACTG TGGGTAGCTG TGTGGTGTTT
GACCAAAATG GCGCTAAAAA GTCAGATTAT CGCCGGTTTA ACATAGAAGG TATAAAAGCA
GGGGACGACT ATGCGGCGAT GGAGCAGGTA TTAACCCGCC GGTACACGCG CCTGCAGAAA
GAATCCAGCA GCATGCCTGA TTTGGTGTTA ATTGATGGCG GCAAAGGCCA GTTGTCTAAG
GCAAAAGCCG TTGTGGAGGA GCTGGGTATC CACGATATGA TGCTCATAGG TGTGGCCAAG
GGCACCACGC GTAAACCGGG TTTCGAAACC TTGGTGCTAA CCAGCGGGGC AGAGCGTGTA
CTAAAGGCCG ATAGCGCAGC CCTGCATCTT ATTCAGCAAA TACGCGATGA GGCCCACCGT
TTTGCAATTA CGGGTCACAA GCAGCGTCGA GACAAAAAGC GTCGTACTTC TGTGTTAGAA
GGAATACCGG GCGTGGGGCC GAAGCGCAGA AAAGAGTTGT TGGTGCATTT TGGCGGTTTA
CAAGAAGTAT TGCGTGCTAA TGTCGATGAT TTGGCCAAAG CGCCTTCAAT TAGCAAAAAA
ATGGCACAAG AGATTTACAA TGTACTGCAT AGTGAGTAA
 
Protein sequence
MSQHQPPPVF DSTSFLKNVT KLPGVYQMYD ADGAILYVGK AKNLKNRLSS YFRATGLTPK 
THALVKRIQA IEVTVTPSEA EALVLEHNLI KSQKPPFNIL LRDDKSFPYI FISEGEPYPK
LAFHRGPKKK KGQYFGPFPN ASAVKETLNF LQRTFRVRQC EDSVFRSRTR PCLQYQIGRC
TGPCVEAISV EDYAVDLAHT AMFLDGKSEV LQQELQVEME QASQALDFER AVVVRDQITD
LRQVQAQQVM EAGYSNQDVV ACASESGVHC IHILYVRQGR IVGSKSYLPK TKLDSTEEDV
LSAFLAHHYL GGAAMDVPPH IIISHKLADQ LIIGEAVEKA TGKQLKLTHN VRTYRAKWLA
MALEAARQNL KNHLNNKQTL VARFESLQDI LGLDETPNRI ECFDISHSSG ELTVGSCVVF
DQNGAKKSDY RRFNIEGIKA GDDYAAMEQV LTRRYTRLQK ESSSMPDLVL IDGGKGQLSK
AKAVVEELGI HDMMLIGVAK GTTRKPGFET LVLTSGAERV LKADSAALHL IQQIRDEAHR
FAITGHKQRR DKKRRTSVLE GIPGVGPKRR KELLVHFGGL QEVLRANVDD LAKAPSISKK
MAQEIYNVLH SE