Gene Tfu_2021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTfu_2021 
SymboluvrC 
ID3580894 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermobifida fusca YX 
KingdomBacteria 
Replicon accessionNC_007333 
Strand
Start bp2361510 
End bp2363480 
Gene Length1971 bp 
Protein Length656 aa 
Translation table11 
GC content68% 
IMG OID637685714 
Productexcinuclease ABC subunit C 
Protein accessionYP_290077 
Protein GI72162420 
COG category[L] Replication, recombination and repair 
COG ID[COG0322] Nuclease subunit of the excinuclease complex 
TIGRFAM ID[TIGR00194] excinuclease ABC, C subunit 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.387091 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGTCC GCCCGACTTT GCGTCCGAAG CCCGGATCGA TCCCCACCGA TCCGGGGGTC 
TACCGTTTCC GGGACGAGCA CGGCCGTGTG ATCTACGTCG GCAAGGCGAA GAACCTGCGG
GCCCGCCTGT CCTCGTACTT CCAGGATTTC AGCGCGCTGC ACCCCCGCAC CCAGACCATG
ATCTCCACCG CCGCCGACGT CGACTGGACG GTGGTGAACA CCGAGGTGGA GGCCCTGCAA
CTGGAGTATT CCTGGATCAA GGAGTACTCT CCGCGGTTCA ATGTCCGCTA CCGCGACGAC
AAAAGCTACC CCTACTTGGC GGTGACCCTC AACGAAGAGT TCCCCCGGGT GCAGGTGATG
CGCGGGGCCC GCCGCCGCGG GGTGCGCTAC TTCGGGCCCT ACTCCTATGC GTGGGCGATC
CGCGACACCG TCGACCTGCT GCTCCGCGTG TTCCCGGTGC GCACCTGCTC GGCTGGGGTG
TTCAAACGCG CTCGGTCCAG TGGCCGCCCT TGCCTGCTGG GCTATATCGA CAAGTGCTCT
GCCCCGTGCG TGGGGCGGAT CGGCGTGGAG GAGTACCGGG CGCTCGCCGA GGATTTCTGC
GCTTTCATGG CAGGGGAGAC CGGCCGGTTC CTGCGGCAGT TGGAAGCCGA GATGAAACAG
GCGGCGGCCG CGCAGGAGTA CGAGCGTGCC GCCCGGATCC GTGACGATAT CCAGGCGCTG
CGCACGGTCA TGGAGAAGCA GGCGGTGGTG CTCGGGGACA GCACCGACTG CGACGTGATC
GCGATCGCCG AGGACCAGTT GGAAGCCGCG GTCCAAGTGT TCTACGTGCG CGGCGGCCGG
ATCCGCGGGG AGCGCGGCTG GGTGGTGGAC AAAGTCGAGG ACGTCTCCAC GGGGAAACTC
GTCGAGCAGT TCCTGGCCCA GACGTACGGG GGTGCCGACG ATGAGGAGTC CACCACGGCG
ATTCCCCGCG AAGTGCTGGT GTCGGCCGAG CCCGCCGACC GGGACGCGGT CGTCGCCTGG
CTGTCGAAGC GGCGCGGCGC GGCCGTGGAT GTGCGGGTCC CCCAGCGCGG CGACAAACGG
GCCCTCATGG AGACCGTGCT CAAAAACGCG GAGCAGACCC TGGCCCGCCA CAAGAGCCAG
CGCGCTTCCG ACTTGACGAC CCGGTCCAAA GCCCTGGCGG AGATCGCGGA GGCGCTCGGT
CTGGCGGAGG CGCCGCTGCG CATCGAATGC TTCGACATCT CCACCCTGCA GGGGGAGCAC
ACTGTGGCGT CCATGGTGGT GTTCGAAGAC GGTCTGGCCC GCAAGTCCGA GTACCGGCGG
TTCAGTATCC GCGGGGCGGA AGGCGCGGAC AGTGACGTGG CCGCGATGTA CGAGGTCATC
AGCCGGCGGT TCACCCGTTA CCTGGAGGAG AGCCAGCGTG TCGGCGAACT GGACACGCTG
GGGGAGAGCG GTGCGCCGCA GGGGGCGGAG CGGAAGGCCC CCCGTTTCGC CTATCCCCCT
AACCTCGTCG TCGTGGACGG CGGCCGTCCG CAGGTCGCGG CTGCCCAGCG GGCCTTGGAC
GATCTGGGGA TCGAGGATGT GGCGGTCTGC GGGCTGGCGA AACGCTTGGA GGAAGTGTGG
TTGCCCGGGG AAGAGGACCC GATCATCCTG CCGCGGACGA GTGAGGGGCT CTACCTGCTG
CAGCGTGTGC GAGACGAGGC GCACCGGTTT GCGATCTCCT ACCATCGACG CAAGAGAGCG
AAGGCGCTGA CAGCCAGCGT GCTGGATGAC ATCCCCGGGC TGGGGCCGGT CCGCCGCGCC
GCTCTGCTGA AGCATTTCGG GTCGGTGCGG CGGCTGGCGC AGGCCACGGC CGCGGAGATC
GCTGAGGTGC CGGGGATCGG GGAGCGGACC GCGCAGACTA TCTACGAGCG GCTCACGAGC
GTGGAGGGCG GACAGCGGAC ACAACCGGAG AACAGCAAGG CAGACGAGTG A
 
Protein sequence
MTVRPTLRPK PGSIPTDPGV YRFRDEHGRV IYVGKAKNLR ARLSSYFQDF SALHPRTQTM 
ISTAADVDWT VVNTEVEALQ LEYSWIKEYS PRFNVRYRDD KSYPYLAVTL NEEFPRVQVM
RGARRRGVRY FGPYSYAWAI RDTVDLLLRV FPVRTCSAGV FKRARSSGRP CLLGYIDKCS
APCVGRIGVE EYRALAEDFC AFMAGETGRF LRQLEAEMKQ AAAAQEYERA ARIRDDIQAL
RTVMEKQAVV LGDSTDCDVI AIAEDQLEAA VQVFYVRGGR IRGERGWVVD KVEDVSTGKL
VEQFLAQTYG GADDEESTTA IPREVLVSAE PADRDAVVAW LSKRRGAAVD VRVPQRGDKR
ALMETVLKNA EQTLARHKSQ RASDLTTRSK ALAEIAEALG LAEAPLRIEC FDISTLQGEH
TVASMVVFED GLARKSEYRR FSIRGAEGAD SDVAAMYEVI SRRFTRYLEE SQRVGELDTL
GESGAPQGAE RKAPRFAYPP NLVVVDGGRP QVAAAQRALD DLGIEDVAVC GLAKRLEEVW
LPGEEDPIIL PRTSEGLYLL QRVRDEAHRF AISYHRRKRA KALTASVLDD IPGLGPVRRA
ALLKHFGSVR RLAQATAAEI AEVPGIGERT AQTIYERLTS VEGGQRTQPE NSKADE