Gene Caci_3782 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_3782 
Symbol 
ID8335135 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp4273664 
End bp4275031 
Gene Length1368 bp 
Protein Length455 aa 
Translation table11 
GC content69% 
IMG OID644956922 
Productnitrilotriacetate monooxygenase 
Protein accessionYP_003114525 
Protein GI256392961 
COG category[C] Energy production and conversion 
COG ID[COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACAG ACAAGACCGG CAAGAAGCTC CATCTCAACG CGTTCCTCAT GGACACCGGG 
CATCACGAGG CGTCGTGGCG GCTGCCGGAG TCCGATCCGT ACGCGGTCTG GGACGTCGAG
TACTACAAGC GCGTGGCGCG GATCGCCGAG CGCGGCAAGC TGGACTCGAT CTTCTTCGCC
GACAGCCCGG CTCAGGGCCA CGATCCCACG CGGCGTCCGC CAGGCAAGCT CGAACCGACC
GTGCTGCTGA CCGTCATCGC CGGCGCCACC GAGCACATCG GGCTGATCGC CACCGCGTCC
ACCTCGTACA ACGAGCCGTT CAACCTGGCC CGGCGCTTCG CCTCGGTGGA CATCGCCTCC
CGCGGCCGGG TGGGCTGGAA CATCGTCACC ACGGCCGGCG CCGACGCCGC CCGCAACTTC
GGCCTGGACG ACGTGCCGCT GCACAAGGAG CGGTACGACC GGGCCGATGA GTTCCTCGAC
GTCGTCACCA AGCTGTGGGA CAGCTGGGCC GACGACGCGA CGGTCGCCGA CAAGGAGGCC
GGCGTCCACA CCCTGCGGGA GAAGGTGCGC GCGATCAACC ACCGCGGCCG GTTCTTCCGC
GTCGACGGTC CGCTGAACTC GCCGCGCCCG CCCCAGGGCT GGCCGCTGCT GGTCCAGGCC
GGTTCCTCGC AGGACGGCAA GGAGTTCGCC GCGGCCTGGG CCGAGGCGGT GTTCACCGCG
CAGCAGACGC TGGAGGATTC GCAGGCCTTC TACTCCGACC TGAAGGCCCG GACGGCCGCG
CACGGACGCG ACCCCGAAAG CATCAAGATC CTGCCCGGCA TCGTGCCGGT CATCGGCGAC
ACCGAAGCCG AGGCCCGCGA GATCGAGGAC CACCTGGAGC GGCTGATCGA CCCCGAACAC
CAGAAGCGCA ACCTCGCCGC CCGTTTCAAG CTGGACCCGG ACGCCCTCGA CCTGGACCAG
CCGCTGCCGC TGCACCTGCT GCCGAAGGAG GACGAGATCG AGGACGCCAA GAGCCGGTAC
ACGCTCATCG TCGACCTGGC CCGGCGGGAG AACCTGACCG TGCGCCAACT GATCGCCCGC
CTCGGCGGCG GCCGCGGCCA CCGCACCTTC ACCGGAACCC CGGTGCAGAT CGCCGACACC
CTCCAGCACT ACTTCGAGAA CGGCGCCGCC GACGGCTTCA ACATCATGCC GGCCGTCCTC
CCCTCCGGGC TGGAGGCGTT CGTCGACAAG GTCGTCCCGA TCCTTCAAGA ACGCGGCCTG
TTCCGCACGG AGTACGAAGG CGCCACGCTG CGCGAACGCT ACGGACTGGC GCGCCCGGCC
AACCGGATCC ACGATGTGGT CGACCGACGG GTGGGTACCG CTTCGTGA
 
Protein sequence
MSTDKTGKKL HLNAFLMDTG HHEASWRLPE SDPYAVWDVE YYKRVARIAE RGKLDSIFFA 
DSPAQGHDPT RRPPGKLEPT VLLTVIAGAT EHIGLIATAS TSYNEPFNLA RRFASVDIAS
RGRVGWNIVT TAGADAARNF GLDDVPLHKE RYDRADEFLD VVTKLWDSWA DDATVADKEA
GVHTLREKVR AINHRGRFFR VDGPLNSPRP PQGWPLLVQA GSSQDGKEFA AAWAEAVFTA
QQTLEDSQAF YSDLKARTAA HGRDPESIKI LPGIVPVIGD TEAEAREIED HLERLIDPEH
QKRNLAARFK LDPDALDLDQ PLPLHLLPKE DEIEDAKSRY TLIVDLARRE NLTVRQLIAR
LGGGRGHRTF TGTPVQIADT LQHYFENGAA DGFNIMPAVL PSGLEAFVDK VVPILQERGL
FRTEYEGATL RERYGLARPA NRIHDVVDRR VGTAS