Gene Caci_2049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_2049 
Symbol 
ID8333393 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp2319092 
End bp2320762 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content71% 
IMG OID644955199 
Producthypothetical protein 
Protein accessionYP_003112810 
Protein GI256391246 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.383654 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCCCTG TGACGAGTTC GATCCTTGAT GATCCCCGTG TTTCGCGGCG TCGGCTGCTG 
ACTTCCGCCT GCGCCGCCGC GGTGGTGGGT GTGACCGGGG TCCGTCCGGC CTCGGCGACG
ACGGCGGCGC CGGTCGCGAA CGGCTTGGAC TACGCCTCGG CGCCGCATCC GTCGGTGGGC
GCGATGGCTG CCGCCGGCTA CGCGTTCGTG GTCCGCTACC TCAGCTACAG CCCGGAGAAG
AACCTCACCG CCGACGAGGC GCGGGCGCTG ACCTCGGCGG GAATCGCAGT GGTCTGCAAC
TGGGAGGCGA CCGCCGACGG TCCGCGCCAG GGCTTCGCGC GGGGCGTCGC GGACGCCACC
GAGGCGGACA AGCAGGCGGC GGCGTGCGGG TCACCGGCGG ATCGGCCGAT CTACTTCAGT
ATCGACTGGG ATGTCCAAGC CGCCGACATG GACGCGGTCA ACGCGTACTT CGACGGCGTC
GCCTCGGTCA TCGGCGTCGC CCGCACCGGC GCCTACAGCA GCTATGACGC GCTCGGCTGG
TTGCTGGCCT CGGGACGGAT CCAGTGGGCT TGGCAGTCGT GCTCGACGGC GTACTCCAAC
GGCCGCAACC GCACGCCGTA TCCCGGCATC CAGCTGTGGC AGAACCGGAC GCCGTTCACG
TTCGACGGCG CCGACGTGGA CGGCGACCAG GCGCTGACGG CGGACTTCGG GCAGTGGGGC
GCCGGGGCGT TCATGGAGCC GCAGGGGAGC GGCGGGCGGA TCGCCGGCGG TGTGCACAGC
GATGGACGGA TCGAGCTGTT CGCGGTGACG CCGAGCGGCG GGATCACGAA TGCCGCCGAG
ACGGCGCCGA ACGGCGTGTG GTCGGGGTGG AGTGATTTCA GTCCGGCTAA AGGCTTCGGG
TTCGCGGCGC GCACGAGTTC GGTGGCGGTC GGACGGCACG CCGACGGTCG GCTGGAGGTC
TTCGCGGTGA TGAGCGACGG CTCGGTGCAG AACCGCTTTC AGGACTCGGC GGGCGGAGCG
TGGTCGGATT GGGGCGTGTT CGCATCGTCG AAGACTGCGA AAGCGTTGAC GGTTGTGGCA
CATGCCGACG GTCGGCTGGA GCTGTTCGCG GCGACGCCGA CCGGTGGGAT CTCGAATAAG
TCCGAGACGA CGCCGAACGG CGCGTGGTCC GGGTGGAACG AGGTCGGACC GCAAGGCGGC
GTCACGGAAA CCGTGAGTGC TGCCCGCCAC GCCGACGGAC GCCTGGAGGT GTTCGCGGTG
ATGAGCGACG GCTCGATGCG CAACCGTGTC GAGACAGCGG CGAACGGCGC GTGGTCCGCT
TGGGGCGTCT ACGGCCCAAC CGGCGGCGCG AACGGGTACG GCGCTCCCGG CACCGTGGCG
GCCGGGGCGC ATCAGGACGG CCGGGTCGAG GTTTTCGCGG TCACGCCGGG CGGCGGCGTC
CGGAACCGGT TCGAGGCGGT GGCGAACGGC GCCGAGTGGT CGAGGTGGGG CGACGGCTTC
GGTCCCGCCG GTCCGGTCAC CGCTGCTTCG GTGACGCGGC ATGCCGACGG TCGGATGGCG
GTCTTCGCCG TGCTGGCCGA CGGGTCGATC TGGAACCGGT CCGAGGCGGT GGCGAACGGC
GCGTGGTCCG AGTGGAACGG GTTCGTTGGG GCTGGAATGG TAAAGCCTTG A
 
Protein sequence
MVPVTSSILD DPRVSRRRLL TSACAAAVVG VTGVRPASAT TAAPVANGLD YASAPHPSVG 
AMAAAGYAFV VRYLSYSPEK NLTADEARAL TSAGIAVVCN WEATADGPRQ GFARGVADAT
EADKQAAACG SPADRPIYFS IDWDVQAADM DAVNAYFDGV ASVIGVARTG AYSSYDALGW
LLASGRIQWA WQSCSTAYSN GRNRTPYPGI QLWQNRTPFT FDGADVDGDQ ALTADFGQWG
AGAFMEPQGS GGRIAGGVHS DGRIELFAVT PSGGITNAAE TAPNGVWSGW SDFSPAKGFG
FAARTSSVAV GRHADGRLEV FAVMSDGSVQ NRFQDSAGGA WSDWGVFASS KTAKALTVVA
HADGRLELFA ATPTGGISNK SETTPNGAWS GWNEVGPQGG VTETVSAARH ADGRLEVFAV
MSDGSMRNRV ETAANGAWSA WGVYGPTGGA NGYGAPGTVA AGAHQDGRVE VFAVTPGGGV
RNRFEAVANG AEWSRWGDGF GPAGPVTAAS VTRHADGRMA VFAVLADGSI WNRSEAVANG
AWSEWNGFVG AGMVKP