Gene Caci_0566 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_0566 
Symbol 
ID8331894 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp650697 
End bp652721 
Gene Length2025 bp 
Protein Length674 aa 
Translation table11 
GC content72% 
IMG OID644953718 
ProductUBA/THIF-type NAD/FAD binding protein 
Protein accessionYP_003111344 
Protein GI256389780 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1179] Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 1 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.297383 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.30294 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACTCCC CGTGGATCTT CACCGCCTTC CGCGCCCCGG CCGACCCGGG CGCGGCGCCG 
GACGCCCTCC AGGAGATACG GCATCTCCGC GCCCGGATCC TCTTCGACGA AGGCCGCCGC
CCGGCCTTCC GCAGCCGCGC CGGCGACCAC GCCGACGACC AGCCGCAGGA TCACGGCGCC
TGGCACTTCG CCGCCCGGCG CGCCCGCGAC GGCGCGCACG GCCCGCCGCT GGGCTACGTC
CGCCTCCTGA CACCCGTGAC CGCTTCCCTT TACCAGTCCA GGGAATTCCT CGGCAGCACC
CACTACGAGG AGGTCCTGAG CACCGAGGGC CTGAACCCCG CGGCAGTGTT CGAACACAGC
CGGCTCGTCG TCGAGCACCG GGCCCGCAAG CTCGGCCTCG GACTGCACCT GAACGCCCTG
GCCGTGGCCG CCGCCCACGA GCTCGGCGCG GCGGCGATGA TCGGCACGTC CGGCACCGCC
GACGGCCAGG ACCGCTTCCA CGCGCGCTTC GGCTTCCACC CGGTCCCCGG CACCCGGCGC
TACGTCGAGA AGTACACCGA AGACGTCGTC ATCCTGCTGC ACCGCACCGA GGAAGGCGCT
GGCGAGCACA CTGCCCTGGT CGACGAGTAC CGGCTGCTGT TCCGCTACGA CCTGATGGAG
GACGCCGGCC TCGCCGAGCC GTCCGTCCCG CTCGCCCGCT CCGCTCAGTC GTCGCTGTCG
TCGCTGTCTT CCCTGTCCTC GCGATCCGCC GCCAAAGCAG AACCCTCATT CACCGTCCCC
GACCGCGGAC CGGTCTGGCG TGACGCCCCG CCGCGCCGCC TCACCGCGAT CGGCCCGTCG
GACCCGCACC GCTGGAAGCC CGTCCTGTTC GAGCCGGCCC AGCCCGACGC CCGCGCGGCG
ATGACCTCGC TGCTCAACAC CGGCGCGGTC CGCGAGGTCC TCGACACCGT CGACGCCCAG
CTCGAGGAAC TGATCCGGGC CCGCGACCCC GGCCGCGTAT TCGACGCCGA CGCCCTCGCC
GACGCCCGCA GCGACCAGCT GGAGGGCGCG CGCCCCTGGG CCTACGGCAC GTGGGCCTGG
TACCCGTGGT CCGGCCGCCT GGTCCACGTC CTGCCCCGCG AAGAGTTCCG CCTCGTCCGC
ACCGACCGCA ACCGCGGCAA GATCGACCGT CCCGAACAGC GCAGGCTGCT GGACAAACGG
ATCGGCGTCA TCGGCCTGTC GGTCGGCAAC AGCGCCGCGC TGACCCTGGC GCAGGAAGGC
GTCGCAGGAG CCTTCAAACT CGCCGACTTC GACACCCTGA GCGTGTCCAA CCTGAACCGC
CTGCGCGCCG GCCTGCACCA GATCGGCGTG AACAAGGCGG TGCTGGCAGC CCGGCAGATG
TTCGAGATCG ACCCGTATCT GGACATCGAG ATCTTCCCCG CCGGTCTGAC CGAGGACACG
GTGAGGGAGT TCTTCCTCGG CGGCCACGGT CCGATCGACC TGCTCGTCGA AGAATGCGAC
ACCCCGTGGG TGAAGCTCGC CGCCCGCGAA GCCGCGCGGG ACCTGGGCGT GCCGGTGGTG
ATGGACGCCA ACGACCGCGG TCTGCTGGAC ATCGAGCGCT TCGACCGCGA ACCGCACCGG
CCGCTGCTGC ACGGTCTGCT CGGCGCGCTG ACCCCCGACG ACTGCCTGGA TCTGACCTTC
GCCGAACGCG TGGACCTGAT CCTGGCCATG GTGGACGCCG ACCGCGTCTC CCCGGCGCTG
GCGGCCAGCA TCCCGGAGAT CGGCCGCACG CTCAGCAGCT GGCCGCAGCT CGCCTCCGGG
GTGGCTCTCG GGGGCGCGCT GGTGACCGAG GCCGCCCGGC GGATCCTGCT CGGCAGCCCG
TGTCCTTCCG GCCGGTTCTA CGTCGATCTG GAGGAGCTGA TCCGCGCCGA CAAAAACGTG
GTGGGCGCTA CGGATTCGCC GGGCGGTACC GCGTCAGGCG ATCCCGGTCC CGATCCTGAT
CCTGATCCGG CTCAGGCCGC GGTCAGCTTG CCCAGCGCCC CGTAG
 
Protein sequence
MDSPWIFTAF RAPADPGAAP DALQEIRHLR ARILFDEGRR PAFRSRAGDH ADDQPQDHGA 
WHFAARRARD GAHGPPLGYV RLLTPVTASL YQSREFLGST HYEEVLSTEG LNPAAVFEHS
RLVVEHRARK LGLGLHLNAL AVAAAHELGA AAMIGTSGTA DGQDRFHARF GFHPVPGTRR
YVEKYTEDVV ILLHRTEEGA GEHTALVDEY RLLFRYDLME DAGLAEPSVP LARSAQSSLS
SLSSLSSRSA AKAEPSFTVP DRGPVWRDAP PRRLTAIGPS DPHRWKPVLF EPAQPDARAA
MTSLLNTGAV REVLDTVDAQ LEELIRARDP GRVFDADALA DARSDQLEGA RPWAYGTWAW
YPWSGRLVHV LPREEFRLVR TDRNRGKIDR PEQRRLLDKR IGVIGLSVGN SAALTLAQEG
VAGAFKLADF DTLSVSNLNR LRAGLHQIGV NKAVLAARQM FEIDPYLDIE IFPAGLTEDT
VREFFLGGHG PIDLLVEECD TPWVKLAARE AARDLGVPVV MDANDRGLLD IERFDREPHR
PLLHGLLGAL TPDDCLDLTF AERVDLILAM VDADRVSPAL AASIPEIGRT LSSWPQLASG
VALGGALVTE AARRILLGSP CPSGRFYVDL EELIRADKNV VGATDSPGGT ASGDPGPDPD
PDPAQAAVSL PSAP