Gene Sare_3809 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3809 
Symbol 
ID5705304 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4342511 
End bp4343686 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content71% 
IMG OID641273231 
ProductnifR3 family TIM-barrel protein 
Protein accessionYP_001538593 
Protein GI159039340 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0042] tRNA-dihydrouridine synthase 
TIGRFAM ID[TIGR00737] putative TIM-barrel protein, nifR3 family 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0210631 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTGACCA CGACGAAGGC TGCACCGTAC CGTCCGCTGA CCATCGGGCG GCATCAGGTG 
TGGCCGCCGG TGGTGCTGGC GCCGATGGCG GGCATCACCA ACGTCGGGTT CCGCCAGCTT
TGCCGGGAGC AGGGCGGTGG CATCTACGTC TGCGAGATGA TCACGACGGT TGCGCTGGTC
GAGCGGAATC CCAAGACCCT GCGCATGATC GCCTTCGGTG GGGGAGAGTC CCCCCGTAGC
CTCCAGCTCT ACGGCACCGA TCCGGAGGTC ACCGCCGCCG CTGTACGGAT CGTCGTCGAG
CGGGACCTCG CCGACCATAT CGACCTGAAC TTCGGCTGCC CGGTCCCGAA GGTCACCCGC
AAGGGTGGTG GGGCGGCCCT GCCGTGGCGG CGTCGGCTCT TCGCCCGGCT GGTGACGGCT
GCGGTGGACG CCGCGTCACC GGCCGGGGTG CCGGTCACGG TCAAGATGCG TAAGGGCATC
GACGACGACC ACCTGACGTA CGTCGAGGCC GGTCTCGCCG CCCAGGACGC GGGTGTCGCG
GCGGTGGCCC TGCACGGGCG TACGGCTGCC CAGCGTTATT CGGGTACCGC CGACTGGGAC
GCCATCGCGA CGCTGAAAGC GGCGTTGGAC GTGCCGGTAC TGGGTAACGG CGACATCTGG
GAGGCCGACG ACGCGCTGCG GATGGTGGCG CACACCGGCG TCGACGGGGT CGTGGTGGGG
CGTGGCTGCC TCGGCCGACC GTGGCTCTTC GCCGACCTGG AGGCCGCCTT CTCTGGTTCG
CCGCAACGGC GGCTGCCGAC GTTGGGCGAG GTGGCGACAA CCATGCGCCG GCACGCGGAG
TTGCTGGTCG AGCAGTTCTC CGCGGGTGCC CGCAGTGCGG CGCGCGGTGA GCGGGATGGC
TGCACCGACT TCCGGAAGCA CGTCGCCTGG TATCTGAAGG GCTTCCCGGT CGGTGGTGAG
CTCCGCCGGT CGCTCGCGAT GGTCGAAAGC CTGGTGCAGC TCGACGACCT GCTCGGCAAG
CTCGATCCGG CGGTGCCGTT CCCGGTGGAG GCGCTGGGCC AGCCACGTGG CCGGACCAAC
TCGCCGGGCA AGGTCTTCCT CCCCAACGGG TGGCTGGACA GCCGCGACGA CGACGCCGTG
CCGCAGGGCG CGGAACTGGC CGATTCGGGC GGCTGA
 
Protein sequence
MVTTTKAAPY RPLTIGRHQV WPPVVLAPMA GITNVGFRQL CREQGGGIYV CEMITTVALV 
ERNPKTLRMI AFGGGESPRS LQLYGTDPEV TAAAVRIVVE RDLADHIDLN FGCPVPKVTR
KGGGAALPWR RRLFARLVTA AVDAASPAGV PVTVKMRKGI DDDHLTYVEA GLAAQDAGVA
AVALHGRTAA QRYSGTADWD AIATLKAALD VPVLGNGDIW EADDALRMVA HTGVDGVVVG
RGCLGRPWLF ADLEAAFSGS PQRRLPTLGE VATTMRRHAE LLVEQFSAGA RSAARGERDG
CTDFRKHVAW YLKGFPVGGE LRRSLAMVES LVQLDDLLGK LDPAVPFPVE ALGQPRGRTN
SPGKVFLPNG WLDSRDDDAV PQGAELADSG G