Gene Caci_3809 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_3809 
Symbol 
ID8335162 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp4308806 
End bp4310836 
Gene Length2031 bp 
Protein Length676 aa 
Translation table11 
GC content73% 
IMG OID644956948 
Producthypothetical protein 
Protein accessionYP_003114551 
Protein GI256392987 
COG category[S] Function unknown 
COG ID[COG4289] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.502089 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.51688 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGCGA GGGTGCCGAA TCCACCCCTG TCGGACGAGT ACCTGCCAGG CGCCGACATC 
TCGCCTTACA CCGGCTGGAC TCGTGCCGAC TGGACCGCCC TGGCGGACCG CATGCTTCTC
GCGGCCGACC GCTACGCCTC ACCGGCCAAA GCCCTGATAA CGCCGCCCGG CGCGCCCGGC
GGCTACGGAA CGGCGATCGA CGGCCTCGAA GGCTTCGCCC GGACCTTCCT GCTGGCGGGT
TTCCGGGTCG CCGGGGAGCG CGGCCGGGAC CCGCTGAACC TGCTGGAACG CTACGCGAGC
GGCGTGGCGG CCGGCACCGA CCCGTCGTCG CCGGAACGCT GGACGTCGCC GAGCGAGCAC
GGCCAGGCCA AGGTGGAGGC GGCGTCGATC GCGCTGATCC TCGACATGAC GCGGCCGTGG
CTGTGGGACC GCCTCGGCGC GGGCGTTCAG GAGCGCGTCG TGAACTACCT GGCACAGGTG
GTCGGGGACC AGGACTATCC GAGGACGAAC TGGGTTTGGT TCCGCATCGT CGTCGAGCAG
TTCCTGGCGT CGGTCGGCGG TCCGTGGTCC CTGGAGGACA TGGAGTCCGA CCTGGCCGTC
CACGATTCGT TCGTCCGCGA GGGCGGCTGG TACTCCGACG GTGCCGAGCG CAGCTACGAC
CACTACTGCG GGTGGGCTCT GCACCTGTAC CCGATCCTGT GGAGCCGGAT GGCCGGCGCG
AAGCGGCTCG CCGCCCCGCG GTTGCCCGCC TACACCGAGC ACCTCGACCG ATATCTCCTC
GACGCGGTCC GCCTGGTCGG CGCGGACGGC TCGCCGCTGA TCCAGGGCAG GAGCCTGACA
TACCGCTTCG CGGCGGCGGC ACCGTTCTGG GTCGGGGCGT TCGCCGAGAC CGGGGCGCTG
GATCCGGGCC TGCTGCGGCG GGCCGCCGGG GGAATCGTCA AACACTTCGT CGATCGCGGA
GCGCCGGACG AGGACGGCCT CCTGACGCTC GGCTGGCACC ACGCCTGGCG GCCGATCGCG
CAGAACTACT CAGGGACCGG TTCGCCGTAC TGGGCCGCCA AGGGCATGCT CGGCCTTGCG
CTTCCGGCCG ACCACCCCGT GTGGACGAGC ACCGAACAGC CGCTGCCGGT CGAAGAAGCC
GACCAGCTGG CCGTCATCGC CGCCCCGGGC TGGGCCCTGA GCTCGACCAA GGCCGACGGC
GTGGTGCGCG TCTACAACCA CGGCACCGAC CATGCGCGGC CAGGGGACCG GACCGGCGAC
TCGCCGCTGT ACGCGCGGCT GGGGTACTCG ACGGGGACCG CGCCGATCCT CCTCGACGAG
GGCTGGGACG CGCCGCTCGA CCAGGCGGTC GTACTGCTCG ACGGCGCGGG GAACGCCACG
CACCGGAGCG GGTTCGAGAC GCTCGGCGTG GCAGCGCTCG ACGGTGCGGC GGTACTCGCC
TCACGAGCCC GCTGCCACTG GATCACGCCC GGCGCCGCCG GACCGGATCC CGATCACGGC
TCGGGCCGGC ACGGCGAGGC CCGCGACGCG GCCGTCGTCA CGACGGTCTC CATCGTCCGC
GGCGCATGGG AAGTGCGCTG CGTGTACGTC GACCCGTCCG ACGCGCCCGG CTGGTCCGAC
GTGGCCGCCC TGCGCATCGG CGGCTGGCCG ATCTCGGCCG GCGAGCCTCC GGCCGCCACG
ATCGGCGTCT CGCCCGCGAG CGCGAGCGCC GTCGGCGGCG GCCACACCTC GACCGTCGTC
GGCGTCGTCG GCGACGTCGG CTTCGCGGAT GAGTCCGCGA CCAGCGCCAC CGCCGGCGTC
CACCGCCTCG AAGACGCCAC ACCGCTCGGT GAATGGACCG CGACCCCCTG GCTCCAGACA
CCCCCGCGCG CCGCCACCTG GACCATCGCG GCCCTGGCCC TCAACGGCGG CCGCGTCACA
CCGCGCGTCG TCCTCACGGG CTCCGAGCGC GCGCCGACCG TCGCCATCAC CTGGCCCGAC
GGAGTCACCA CCAGTGCACC GCTGCCCGAC CTCCACCTCA TGTCTGCCTG A
 
Protein sequence
MSARVPNPPL SDEYLPGADI SPYTGWTRAD WTALADRMLL AADRYASPAK ALITPPGAPG 
GYGTAIDGLE GFARTFLLAG FRVAGERGRD PLNLLERYAS GVAAGTDPSS PERWTSPSEH
GQAKVEAASI ALILDMTRPW LWDRLGAGVQ ERVVNYLAQV VGDQDYPRTN WVWFRIVVEQ
FLASVGGPWS LEDMESDLAV HDSFVREGGW YSDGAERSYD HYCGWALHLY PILWSRMAGA
KRLAAPRLPA YTEHLDRYLL DAVRLVGADG SPLIQGRSLT YRFAAAAPFW VGAFAETGAL
DPGLLRRAAG GIVKHFVDRG APDEDGLLTL GWHHAWRPIA QNYSGTGSPY WAAKGMLGLA
LPADHPVWTS TEQPLPVEEA DQLAVIAAPG WALSSTKADG VVRVYNHGTD HARPGDRTGD
SPLYARLGYS TGTAPILLDE GWDAPLDQAV VLLDGAGNAT HRSGFETLGV AALDGAAVLA
SRARCHWITP GAAGPDPDHG SGRHGEARDA AVVTTVSIVR GAWEVRCVYV DPSDAPGWSD
VAALRIGGWP ISAGEPPAAT IGVSPASASA VGGGHTSTVV GVVGDVGFAD ESATSATAGV
HRLEDATPLG EWTATPWLQT PPRAATWTIA ALALNGGRVT PRVVLTGSER APTVAITWPD
GVTTSAPLPD LHLMSA