Gene Svir_21990 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSvir_21990 
Symbol 
ID8387523 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharomonospora viridis DSM 43017 
KingdomBacteria 
Replicon accessionNC_013159 
Strand
Start bp2367669 
End bp2369027 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content65% 
IMG OID644976252 
Productputative proteasome component/protein of unknown function, DUF275 
Protein accessionYP_003134034 
Protein GI257056202 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.0167932 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGCGGC GGATCTTTGG GATCGAAACC GAGTTCGGGG TTACCTGCAC TTTCCACGGA 
CAGCGCAGGT TGTCACCCGA CGAAGTGGCG CGGTACCTGT TCCGGCGGGT GGTGTCATGG
GGTCGGTCCT CGAACGTGTT CCTGTCCAAC GGTTCCCGGC TCTATCTGGA CGTGGGGTCG
CATCCCGAGT ACGCGACCGC CGAATGTGAC GACCTTGCCC AGTTGGTGAC GCACGACAAG
GCAGGGGAGC GGATCCTGGA GGATCTGCTG ATCGACGCGG AACGTCGGCT CGCCGAGGAG
GGGATCGGCG GCGACATCTT CCTGTTCAAG AACAACACCG ACTCGGCGGG GAACTCGTAC
GGGTGTCACG AGAACTACCT GGTGACGCGT GCGGGTGAGT TCTCGCGGGT GGCCGACGTG
TTGCTGCCGT TTCTGGTGAC GCGGCAGCTG GTGTGCGGGG CGGGAAAGGT GCTGCAGACC
CCCCGTGGTG GGGTGTATTG CCTGTCGCAG CGTGCCGAAC ACATCTGGGA GGGCGTGTCC
AGCGCGACCA CGCGGTCACG GCCGATCATC AACACGCGGG ACGAACCGCA CGCCGACGCG
GAACGCTACC GCCGGCTGCA TGTGATCGTC GGCGATTCGA ACATGGCGGA GCCGACGACC
TTGCTGAAGG TCGGCTCGGT GCACCTGGTC CTGCAGATGA TCGAAGAGGG TGTGCAGTTC
CGGGACTTCA CCCTGGACAA CCCCATCCGA GCGATCCGGG AGATCAGTCA CGACCTGACG
GGGCGGCGTC AGGTTCGGCT GGCCGGTGGC CGGGAGGCCT CGGCCCTGGA GATCCAGCGG
GAGTACTACG CGCGTGCGGT GCAGCACGTG GAGTCGGGCG ATCCGTCGCC GACCACGCAA
TACCTGATCG ACCTTTGGGG ACGGGCACTG GATGCGGTGG AACAGCAGGA CTTCTCGAGT
ATCGACACCG AGATCGATTG GGCGATCAAG CACCGCCTGG TGGAGCGTTA CCGCAGTAAG
CACAACTTGA CGTTGTCGGA CCCGCGGGTG GCGCAGCTGG ACCTGGCCTA CCACGACATC
CGGCGGGGGC GTGGGGTGTT CGATCTGCTG CAGCGCAAGG GCATGGTGCG GCGGATCACC
GACGACGGGG AGATCGAGCT GGCCAAGGAC AGTCCACCTC AGACCACGCG GGCGAAGTTG
CGAGGTGACT TCATCGCGGC GGCGCAGGAG GCGGGGCGGG ACTTCACGGT GGACTGGGTC
CACCTGAAGC TGAACGACCA GGCGCAGCGA ACAGTGCTGT GCAAGGACCC GTTCCGGTCG
GTGGACGAGC GGGTGGAGCG GTTGATCAAC TCGCTGTGA
 
Protein sequence
MQRRIFGIET EFGVTCTFHG QRRLSPDEVA RYLFRRVVSW GRSSNVFLSN GSRLYLDVGS 
HPEYATAECD DLAQLVTHDK AGERILEDLL IDAERRLAEE GIGGDIFLFK NNTDSAGNSY
GCHENYLVTR AGEFSRVADV LLPFLVTRQL VCGAGKVLQT PRGGVYCLSQ RAEHIWEGVS
SATTRSRPII NTRDEPHADA ERYRRLHVIV GDSNMAEPTT LLKVGSVHLV LQMIEEGVQF
RDFTLDNPIR AIREISHDLT GRRQVRLAGG REASALEIQR EYYARAVQHV ESGDPSPTTQ
YLIDLWGRAL DAVEQQDFSS IDTEIDWAIK HRLVERYRSK HNLTLSDPRV AQLDLAYHDI
RRGRGVFDLL QRKGMVRRIT DDGEIELAKD SPPQTTRAKL RGDFIAAAQE AGRDFTVDWV
HLKLNDQAQR TVLCKDPFRS VDERVERLIN SL