Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_2054 |
Symbol | |
ID | 5704726 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 2349512 |
End bp | 2351317 |
Gene Length | 1806 bp |
Protein Length | 601 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 641271541 |
Product | UBA/THIF-type NAD/FAD binding protein |
Protein accession | YP_001536912 |
Protein GI | 159037659 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0476] Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 2 |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.000790139 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGAGTAAGC GGAGCCTCAG CGACTGGCAA CGCCAGGTAC TGGCCGAGCT TGATGCCATC GCGGAAGCGT TCCCGGGAGA AGTCGAAGTG ATGGGACGCC ATCGCATCGA CAAATCCGGG GCGATACGGT TGCGTCTGCG GCTTCGGACC GCCGACCTTC CTCGCGTTGC TCGTGGCATG CCGTTGGGCG AGCATGAGGA GTTCATCGTC ACCGTCGGCG CCTCCAGTCT GACGCCGCCA CGTGTAGAGG TCGATCACCT TCGGTTTCTT CATCATGCGC ACGTACTCCA GGGACATCGC CTGTGCCTCT ACCTCGACCC GTCGAGGGAG TGGGACCCGA TCAACGGCTT TGGCGGTTTC CTCGACCGCC TTGTCGACTG GCTCGCCGAC GCGGCGGGCG ATCGTTTCGA CGCCCAAACC GCGCTGTACC ACGCCGTCGG CGGCGTGCTT CACGCCGCCG ACGGCGCTCC TACCGTCGTC GTGCGAGACA CGATTCCAAC CGGCAAGCGC GCGCACCACG CATTGCTGAT GGCTCGGGCG CCGCACCGAT TCGACCTGCA CCTCAACCGT CCCGCAGAGC CCGACCAAGG CGACCACATC CCCGTAGTCA TACTCGACGC CGATCTCCCC TTCGGCGCAG GGATTGATCT CGGCGCGCTT CTGCAGACGG TCGACGACCG CTACCATGGC CGTCCTGCCC CCGACGAGGC GTTCCTGAAG CCGCGCAGCA CCTCCTGCAC TTCAGCGACG ATGTTGACCG TGCTCGGGGC AAGCGCCATC CGAAAGGCCA CCGGCACACC GCAACGAATA CTCATCGCTG TCCCTCACCC GACTGGTGGC CCTCCGCACC TGCTCGTCGC AAGCATCCCG GGCATCGGAG CCGATCACAT CCGAACACTC GTCAAGGCGG GCAGGAAGAA GAGTTCCATG ATCGACATCG ATCCGGCGAA GATTTATCAG GCGACGCCGC TGGAGTGGTT GCCGGTCTCT GATGAGCGTA AGGAGGTCAC GACGCGTCGC GACTCGGCGC GGCCCGTCGC CGCTTACGCG GGTAGGACTG CCCACGTCTG GGGCTGCGGT GGCCTCGGCT CATGGATCGC AGAGTATCTC GTCCGCGCTG GAGCAAAGAA GGTCATCCTG TGCGACCCGG GCACCATCTC AGGCGGCCTC TTAGTCCGAC AGAACTTCGT GGAGGCCGAT GTCGGCAACA CGAAGGTCGA AGCGCTCGCT CAGCGTCTAC GCGCCATCAG TGACACGGTC GAGGTAGTGG CGACACGGGC CCTGGTCCCC GGCGAAGGGA ATCTCGCCGA GGCTGACCTG ATCATCGATG CCACCGTCAA TATTGCGATC AGCCGACTTC TCGACAGTTT CGCTGCTGCT TCTGACCAGC GCCCGGTGAT GGCCCAGGTC GCTGTCGATG CCCGGACAGG CACGCTAGGC ATCATGACTG TTTCGATGCC GCCACTGGAG GCTGGCCCAC TGACCATCGA CCGGAAAGCT GGAGCGCAAA TCCTCCAGGA TGGAGCGCAC GAGGCATTCC ATTGCCTGTG GAACACTTCG GCATCTGGCG ACGAGATCAT CCCAACGCGA GGCTGCTCGA CACCTACGTT CCACGGGTCT GCCGCCGACC TGGCCGGGGT GGCCGGGAGT CTGACCAGCA TTCTCGGCGC TCATCTGAAC GCCGGAACGT CCGTCTCCGG CACGCACCTG ATCTCGCTCC CGCACAGCGA AGCCGGGCCA TTGCGCGCCT TCGTGCCGGC CCTGGCGCCA CCCCTCGGAA GGCCGCCGGC CGACGACGCT GATTGA
|
Protein sequence | MSKRSLSDWQ RQVLAELDAI AEAFPGEVEV MGRHRIDKSG AIRLRLRLRT ADLPRVARGM PLGEHEEFIV TVGASSLTPP RVEVDHLRFL HHAHVLQGHR LCLYLDPSRE WDPINGFGGF LDRLVDWLAD AAGDRFDAQT ALYHAVGGVL HAADGAPTVV VRDTIPTGKR AHHALLMARA PHRFDLHLNR PAEPDQGDHI PVVILDADLP FGAGIDLGAL LQTVDDRYHG RPAPDEAFLK PRSTSCTSAT MLTVLGASAI RKATGTPQRI LIAVPHPTGG PPHLLVASIP GIGADHIRTL VKAGRKKSSM IDIDPAKIYQ ATPLEWLPVS DERKEVTTRR DSARPVAAYA GRTAHVWGCG GLGSWIAEYL VRAGAKKVIL CDPGTISGGL LVRQNFVEAD VGNTKVEALA QRLRAISDTV EVVATRALVP GEGNLAEADL IIDATVNIAI SRLLDSFAAA SDQRPVMAQV AVDARTGTLG IMTVSMPPLE AGPLTIDRKA GAQILQDGAH EAFHCLWNTS ASGDEIIPTR GCSTPTFHGS AADLAGVAGS LTSILGAHLN AGTSVSGTHL ISLPHSEAGP LRAFVPALAP PLGRPPADDA D
|
| |