Gene SNSL254_A2109 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A2109 
SymboluvrC 
ID6484935 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp2041368 
End bp2043200 
Gene Length1833 bp 
Protein Length610 aa 
Translation table11 
GC content54% 
IMG OID642737465 
Productexcinuclease ABC subunit C 
Protein accessionYP_002041212 
Protein GI194445436 
COG category[L] Replication, recombination and repair 
COG ID[COG0322] Nuclease subunit of the excinuclease complex 
TIGRFAM ID[TIGR00194] excinuclease ABC, C subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0217985 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value2.91711e-15 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGAGTGAAA TATTTGACGC AAAGGCGTTT TTGAAAACCG TTACCAGCCA ACCCGGTGTC 
TATCGTATGT ATGACGCCGG CGGTACGGTT ATTTATGTCG GTAAAGCAAA AGACCTGAAA
AAGCGTCTTT CCAGCTACTT TCGCAGCAAC CTGGCTTCGC GTAAAACCGA AGCGCTAGTT
GCGCAAATTC AACACATTGA TGTCACGGTA ACGCATACCG AAACGGAAGC GCTGCTGCTT
GAGCATAATT ACATCAAGTT GTATCAACCG CGTTACAACG TGTTGTTGCG TGATGATAAA
TCCTATCCTT TTATTTTTCT GAGCGGCGAT ACCCATCCGC GTCTGGCGAT GCACCGTGGC
GCCAAACATG CGAAAGGCGA GTACTTCGGC CCATTTCCTA ATGGCTATGC CGTGCGGGAA
ACGCTGGCGC TGTTACAGAA AATTTTCCCT ATCCGCCAGT GCGAAAACAG CGTATATCGC
AACCGCTCGC GTCCTTGCCT GCAATATCAG ATTGGCCGCT GTTTAGGCCC CTGCGTTGCA
GGACTGGTCA GCGAGGAGGA GTATGCGCAA CAGGTGGAGT ATGTCCGGTT GTTTTTGTCC
GGTAAGGACG ACCAGGTCTT AACACAGCTG ATTGCCCGGA TGGAAAAGGC CAGCCAGGAT
CTGGCATTTG AAGAGGCGGC GCGTATTCGC GATCAGATCC AGGCGGTACG CCGGGTCACG
GAAAAACAGT TTGTCTCCAA TGCGGGCGAC GATCTCGACG TGATCGGCGT GGCTTTTGAT
GCGGGTATGG CCTGTGTGCA TGTGCTGTTT ATTCGCCAGG GTAAGGTGCT GGGCAGCCGC
AGCTATTTTC CAAAAGTACC TGGCGGTACG GAACTGGGCG AAGTGGTGGA AACCTTTGTC
GGGCAGTTTT ACCTGCAGGG TAGCCAGATG CGCACGCTAC CGGGCGAGAT ACTGCTCGAT
TTTAATCTGA GTGATAAAAC GCTGCTGGCC GATTCGTTGT CGGAACTTGC CGGACGGCGT
ATTCATGTCC AGACAAAACC GCGCGGCGAT CGCGCCCGTT ATCTCAAGCT GGCGCGGACC
AACGCGGCAA CGGCCTTAAT CACTAAACTC TCGCAGCAGT CCACCATCAC GCAGCGTTTG
ACCGCGCTGG CGGCGGTATT AAAACTTCCT GCGATCAAGC GGATGGAATG TTTTGACATC
AGCCATACCA TGGGGGAGCA AACGGTCGCA TCCTGTGTGG TATTTGACGC TAACGGGCCG
TTACGCGCCG AGTATCGTCG TTATAATATC GCGGGCATCA CGCCGGGTGA TGACTACGCG
GCGATGAATC AGGTGCTACG TCGGCGTTAT GGCAAGGCCA TAGAAGAAAG TAAGATTCCG
GATGTTATTC TGATCGACGG CGGAAAAGGG CAGCTCGCCC AGGCGAAAGC CGTTTTCGCT
GAGCTGGATG TCCCCTGGGA TAAGCATCGT CCTTTGCTGC TTGGCGTCGC CAAAGGCGCG
GACAGAAAGG CCGGTCTGGA AACACTCTTT TTTGAACCGG AAGGCGAGGG GTTTAGCCTG
CCGCCGGACT CGCCGGCGCT GCATGTTATT CAGCATATTC GCGATGAGTC GCACGATCAC
GCGATCGGCG GGCACCGTAA AAAACGCGCG AAGGTTAAAA ATACCAGTAC GCTGGAAACT
ATTGAAGGCG TTGGGCCTAA ACGTCGCCAG ATGCTGCTGA AATATATGGG CGGTTTGCAA
GGACTACGTA ACGCCAGCGT AGAAGAAATT GCAAAAGTGC CGGGTATTTC GCAAGGTCTG
GCAGAAAAGA TCTTCTGGTC GTTGAAACAT TGA
 
Protein sequence
MSEIFDAKAF LKTVTSQPGV YRMYDAGGTV IYVGKAKDLK KRLSSYFRSN LASRKTEALV 
AQIQHIDVTV THTETEALLL EHNYIKLYQP RYNVLLRDDK SYPFIFLSGD THPRLAMHRG
AKHAKGEYFG PFPNGYAVRE TLALLQKIFP IRQCENSVYR NRSRPCLQYQ IGRCLGPCVA
GLVSEEEYAQ QVEYVRLFLS GKDDQVLTQL IARMEKASQD LAFEEAARIR DQIQAVRRVT
EKQFVSNAGD DLDVIGVAFD AGMACVHVLF IRQGKVLGSR SYFPKVPGGT ELGEVVETFV
GQFYLQGSQM RTLPGEILLD FNLSDKTLLA DSLSELAGRR IHVQTKPRGD RARYLKLART
NAATALITKL SQQSTITQRL TALAAVLKLP AIKRMECFDI SHTMGEQTVA SCVVFDANGP
LRAEYRRYNI AGITPGDDYA AMNQVLRRRY GKAIEESKIP DVILIDGGKG QLAQAKAVFA
ELDVPWDKHR PLLLGVAKGA DRKAGLETLF FEPEGEGFSL PPDSPALHVI QHIRDESHDH
AIGGHRKKRA KVKNTSTLET IEGVGPKRRQ MLLKYMGGLQ GLRNASVEEI AKVPGISQGL
AEKIFWSLKH