Gene Caci_4620 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_4620 
Symbol 
ID8335974 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp5257352 
End bp5258899 
Gene Length1548 bp 
Protein Length515 aa 
Translation table11 
GC content70% 
IMG OID644957720 
Producthypothetical protein 
Protein accessionYP_003115322 
Protein GI256393758 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.456962 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.00062706 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCGCGCG ACTGGACCAG CTTCGACTGG CAGACCGCCT CCGTCACCCA GGGCGACCTG 
ATCCCCGGAG ACCCCTACCA GGTGGCCGCG CTCGGCAAGC GCTTCCGCGA CACCGCCGAC
GCCATCAACA AGCAGGCTGC GAACCTGCGC AACCTGGTCG ACGGCCAGGG CTGGGACGCC
GACTCCGGCC GCGCCTTCGC CAAGAAGGTC GGCGATACCG CGGACCTGCT GCAGAAGGCC
TTCGGCCGGT ACAACGAGGC CGCCGAAGCC CTGGGCAGCA CCGTGCGCCC GTCGGGACCG
CACGACGAGG TCTCCTGGCG CTCCGGCACC GAATGGGCCT CCACCCTGGA GCTGGCGCAG
GTGAAGGCTC AGGACGCGCT GAACCGCGGC CGTGCCGCCG ACGCCGACTC CCAGTCCGCG
GCGCGCCAGC TCAGCACCGC GCAGGGCAAC GCGCTGCCCA AGCCGGTCGC CGGCGCCCCG
GCGACGCCGA ACACGCACCC GGACCCCACC AACACCCCCG GTGCCAACGG CCTGACGCCC
GCGGAGCAGA AGGCGCACAA CGACAAGGCC GCCGCCGACG CCGTGGTGAA CAAGGCCGGC
GCCGACATCC AGCACGCCAT CGACATCCGC GACAGGGAAG GCAAAGCCGT CGCCGGCGCC
ATCAACGACT TCATCAACGG CGACGGCCTG AAGAACCCCA CCCACCACTG GTGGGACGTG
GACTGGAAGG ACCTGGTCGC CGACATCGGC CACATCGCCG GCGCGATCGC CGGCGTGTGC
GGGATCCTGG CGCTGGCGCT GGCCTGGGTG CCGATCCTCG GCGAGGTGCT GGGCGCGATC
GCCCTGATCG CCGGTGCCGT GGCGTTGATC AGCGACACCA TCTCCGCCCT GGACGGCAAA
GGCAACTGGT TCGACGTCGC CATCGACGTC GTCGGCCTGC TGTCCTGCGG CGCCGGCCGG
ATGCTGGGCA CCGCGGCGAA GCTGTCCAAG GGTGCTGAGG CGTTCAACGC CGCGCGCGCC
GGCGGCAAGG GCATCAGCGA GGCGCTGGAG CTCTCGGACA TGTCCGCCAA GGACGTGCTG
GCGCTGAAGA ACGGCAACAG CGTGTTCAAG GTCGCGCGCT CGGAGTTCGG CAAGGGTCTG
ACCACCGGGC CGTTCAAGGA CGTCCTGGGC AAGGTCGGGG ACCTGAAGGC CGGGAACCTG
AAGTTCGAGG CGCCGAGCCT GACCAGCTTC GGGCCGAACT TCGCCAAGGA AGCCGGCTGG
AAGTTCCACC TCGCCGGCTG GTCCAACAGC ACCTTCCCGC TGGGTCTGGG GCTGGCGAAC
CTGCAGATCC CGGAGAACAT GAAGTCCTGG GAGCCGGGGT TCATGAACGT CAACCTGTTC
AAGAGCGACC AGATCCCCAG CTGGGTGCCC GGTGTCGGCG GCGACCACGC GGGCGTGGGC
TGGCTGCACG CCGGCGACTG GAACGCCACC AACTCCGGCA TGGAGAACTA CGACGCCCAG
CCGCTGCGTC CCATCGCCTC CGCTGTGGGC GCCGACCCGG GCAACTGA
 
Protein sequence
MSRDWTSFDW QTASVTQGDL IPGDPYQVAA LGKRFRDTAD AINKQAANLR NLVDGQGWDA 
DSGRAFAKKV GDTADLLQKA FGRYNEAAEA LGSTVRPSGP HDEVSWRSGT EWASTLELAQ
VKAQDALNRG RAADADSQSA ARQLSTAQGN ALPKPVAGAP ATPNTHPDPT NTPGANGLTP
AEQKAHNDKA AADAVVNKAG ADIQHAIDIR DREGKAVAGA INDFINGDGL KNPTHHWWDV
DWKDLVADIG HIAGAIAGVC GILALALAWV PILGEVLGAI ALIAGAVALI SDTISALDGK
GNWFDVAIDV VGLLSCGAGR MLGTAAKLSK GAEAFNAARA GGKGISEALE LSDMSAKDVL
ALKNGNSVFK VARSEFGKGL TTGPFKDVLG KVGDLKAGNL KFEAPSLTSF GPNFAKEAGW
KFHLAGWSNS TFPLGLGLAN LQIPENMKSW EPGFMNVNLF KSDQIPSWVP GVGGDHAGVG
WLHAGDWNAT NSGMENYDAQ PLRPIASAVG ADPGN