Gene Sare_2054 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2054 
Symbol 
ID5704726 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2349512 
End bp2351317 
Gene Length1806 bp 
Protein Length601 aa 
Translation table11 
GC content65% 
IMG OID641271541 
ProductUBA/THIF-type NAD/FAD binding protein 
Protein accessionYP_001536912 
Protein GI159037659 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0476] Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 2 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000790139 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGAGTAAGC GGAGCCTCAG CGACTGGCAA CGCCAGGTAC TGGCCGAGCT TGATGCCATC 
GCGGAAGCGT TCCCGGGAGA AGTCGAAGTG ATGGGACGCC ATCGCATCGA CAAATCCGGG
GCGATACGGT TGCGTCTGCG GCTTCGGACC GCCGACCTTC CTCGCGTTGC TCGTGGCATG
CCGTTGGGCG AGCATGAGGA GTTCATCGTC ACCGTCGGCG CCTCCAGTCT GACGCCGCCA
CGTGTAGAGG TCGATCACCT TCGGTTTCTT CATCATGCGC ACGTACTCCA GGGACATCGC
CTGTGCCTCT ACCTCGACCC GTCGAGGGAG TGGGACCCGA TCAACGGCTT TGGCGGTTTC
CTCGACCGCC TTGTCGACTG GCTCGCCGAC GCGGCGGGCG ATCGTTTCGA CGCCCAAACC
GCGCTGTACC ACGCCGTCGG CGGCGTGCTT CACGCCGCCG ACGGCGCTCC TACCGTCGTC
GTGCGAGACA CGATTCCAAC CGGCAAGCGC GCGCACCACG CATTGCTGAT GGCTCGGGCG
CCGCACCGAT TCGACCTGCA CCTCAACCGT CCCGCAGAGC CCGACCAAGG CGACCACATC
CCCGTAGTCA TACTCGACGC CGATCTCCCC TTCGGCGCAG GGATTGATCT CGGCGCGCTT
CTGCAGACGG TCGACGACCG CTACCATGGC CGTCCTGCCC CCGACGAGGC GTTCCTGAAG
CCGCGCAGCA CCTCCTGCAC TTCAGCGACG ATGTTGACCG TGCTCGGGGC AAGCGCCATC
CGAAAGGCCA CCGGCACACC GCAACGAATA CTCATCGCTG TCCCTCACCC GACTGGTGGC
CCTCCGCACC TGCTCGTCGC AAGCATCCCG GGCATCGGAG CCGATCACAT CCGAACACTC
GTCAAGGCGG GCAGGAAGAA GAGTTCCATG ATCGACATCG ATCCGGCGAA GATTTATCAG
GCGACGCCGC TGGAGTGGTT GCCGGTCTCT GATGAGCGTA AGGAGGTCAC GACGCGTCGC
GACTCGGCGC GGCCCGTCGC CGCTTACGCG GGTAGGACTG CCCACGTCTG GGGCTGCGGT
GGCCTCGGCT CATGGATCGC AGAGTATCTC GTCCGCGCTG GAGCAAAGAA GGTCATCCTG
TGCGACCCGG GCACCATCTC AGGCGGCCTC TTAGTCCGAC AGAACTTCGT GGAGGCCGAT
GTCGGCAACA CGAAGGTCGA AGCGCTCGCT CAGCGTCTAC GCGCCATCAG TGACACGGTC
GAGGTAGTGG CGACACGGGC CCTGGTCCCC GGCGAAGGGA ATCTCGCCGA GGCTGACCTG
ATCATCGATG CCACCGTCAA TATTGCGATC AGCCGACTTC TCGACAGTTT CGCTGCTGCT
TCTGACCAGC GCCCGGTGAT GGCCCAGGTC GCTGTCGATG CCCGGACAGG CACGCTAGGC
ATCATGACTG TTTCGATGCC GCCACTGGAG GCTGGCCCAC TGACCATCGA CCGGAAAGCT
GGAGCGCAAA TCCTCCAGGA TGGAGCGCAC GAGGCATTCC ATTGCCTGTG GAACACTTCG
GCATCTGGCG ACGAGATCAT CCCAACGCGA GGCTGCTCGA CACCTACGTT CCACGGGTCT
GCCGCCGACC TGGCCGGGGT GGCCGGGAGT CTGACCAGCA TTCTCGGCGC TCATCTGAAC
GCCGGAACGT CCGTCTCCGG CACGCACCTG ATCTCGCTCC CGCACAGCGA AGCCGGGCCA
TTGCGCGCCT TCGTGCCGGC CCTGGCGCCA CCCCTCGGAA GGCCGCCGGC CGACGACGCT
GATTGA
 
Protein sequence
MSKRSLSDWQ RQVLAELDAI AEAFPGEVEV MGRHRIDKSG AIRLRLRLRT ADLPRVARGM 
PLGEHEEFIV TVGASSLTPP RVEVDHLRFL HHAHVLQGHR LCLYLDPSRE WDPINGFGGF
LDRLVDWLAD AAGDRFDAQT ALYHAVGGVL HAADGAPTVV VRDTIPTGKR AHHALLMARA
PHRFDLHLNR PAEPDQGDHI PVVILDADLP FGAGIDLGAL LQTVDDRYHG RPAPDEAFLK
PRSTSCTSAT MLTVLGASAI RKATGTPQRI LIAVPHPTGG PPHLLVASIP GIGADHIRTL
VKAGRKKSSM IDIDPAKIYQ ATPLEWLPVS DERKEVTTRR DSARPVAAYA GRTAHVWGCG
GLGSWIAEYL VRAGAKKVIL CDPGTISGGL LVRQNFVEAD VGNTKVEALA QRLRAISDTV
EVVATRALVP GEGNLAEADL IIDATVNIAI SRLLDSFAAA SDQRPVMAQV AVDARTGTLG
IMTVSMPPLE AGPLTIDRKA GAQILQDGAH EAFHCLWNTS ASGDEIIPTR GCSTPTFHGS
AADLAGVAGS LTSILGAHLN AGTSVSGTHL ISLPHSEAGP LRAFVPALAP PLGRPPADDA
D