Gene Caci_5779 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_5779 
Symbol 
ID8337140 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp6677676 
End bp6679082 
Gene Length1407 bp 
Protein Length468 aa 
Translation table11 
GC content71% 
IMG OID644958883 
Productprotein of unknown function DUF21 
Protein accessionYP_003116478 
Protein GI256394914 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGACCC TCCTGGGCCT GCTGGCCATC GCGGTGCTCA CCGCCGCCAC CGGCTATTTC 
GTGGCGCAGG AGTTCGCCTA CATCGCCGCG GACCGGGGAC GGTTGCGGCA GCTCGCCGAG
GACGGCGACG CCGCCGCCGA GCGCGCCTTC GAGGTCACCG GTCGGCTGTC GTTCATGCTG
TCCGGGGCGC AGCTCGGGAT CACCGTGACC GCGCTGCTGG TCGGCTACGT CGCCCAGCCG
CTGCTCGGCT CCGGCCTGGC CGATCTGCTG GGGTTCACCG GCTGGTCGCA CGACGCGCGG
CTGTCGCTGT CGGTCGTGGT GGCGCTGGCC GTGGCGACCG TGGTGCAGAT GGTGGTCGGC
GAGTTGCTGC CGAAGAACCT GGCGATCGCC AAGCCGATCG AGGCGGCCAA GGCACTCGGC
GGCTCCACCC TCCTGTATTT GAAGGTGGTC GGTCCGGTCA TCCGGCTGTT CGACGGCGCC
GCCGTCCGGC TGGTCCGCGC CGTCGGCATC GAGCCGGTCG AGGAGCTGCC GCAGGGCGCC
AGCGAGGAGG ACCTGCAGCA CATCATCTCC GAGTCGCACA CGCAGGGCTT GCTGGACACC
GAGCTGTCCG AGCTGTTGGA CCGCGCGCTG GACTTCCGAG GGCTGACAGC CGGGCAGGCC
ATGACGCCGC GGGTGAAGGT GCACACCGTG TCGGCCGAAG CGCCGGTCTC CCTGGTGGTG
GAGATGCTGA TCACCGGCAA CGCCCGGTTC CCGGTGACCG GCCATGACAT AGACGATCTG
ATCGGTGTCG CCGGACTGAC CGAGGTCCTG GCGGTGCCGG CGGCACTGCG GGCCACGACG
CCGGTCCGGG ACGCGTGCGC GCCGGCGCTG CTGGTACCGG AGCACCTTCC GCTCCCCGAG
TTGCTGGAGC GGCTGCGCTC CGAGCACCGG CAGCTGGCGT GCGTCATCGA CGAGTTCGGC
GGCTTCGCCG GGGTGGTGAC GTTGGAAGAC GTCACCGAGG AGCTGGTCGG CGACATCTGG
GACGAGGACG ACCTGGACGA CGAGGTGGTC CGGCGACAGC CAGACGGCGC CTGGAGCGTC
CCGGCACGGA TGCGGATCGA CGAGGCCGCC GACGCCACCG GGATCCCGCT GCCGGAAGGC
GAGCACTACA CGACGGTCTC CGGCCTGGTG CTGGACCGCC TCGGCCGCAC CGCGCGCATC
GGCGACGAGG TGGAGCTGGC GGTCCGCGCG CCGTACACGC AGGACGGGCC CGGGATGCTG
TCGGTGCTGA TCCACATCGC GGCGGTCAGC CGGCAGGTAC CGGCGACCGT GCTGATCACG
ATGGACACCG AAGACCACGC CGAAGACTCC GAACACAGCG CAGACCCCGA ACACAGTGCC
GCCCCCGACG CCCGGGAGGC TTCGTGA
 
Protein sequence
MLTLLGLLAI AVLTAATGYF VAQEFAYIAA DRGRLRQLAE DGDAAAERAF EVTGRLSFML 
SGAQLGITVT ALLVGYVAQP LLGSGLADLL GFTGWSHDAR LSLSVVVALA VATVVQMVVG
ELLPKNLAIA KPIEAAKALG GSTLLYLKVV GPVIRLFDGA AVRLVRAVGI EPVEELPQGA
SEEDLQHIIS ESHTQGLLDT ELSELLDRAL DFRGLTAGQA MTPRVKVHTV SAEAPVSLVV
EMLITGNARF PVTGHDIDDL IGVAGLTEVL AVPAALRATT PVRDACAPAL LVPEHLPLPE
LLERLRSEHR QLACVIDEFG GFAGVVTLED VTEELVGDIW DEDDLDDEVV RRQPDGAWSV
PARMRIDEAA DATGIPLPEG EHYTTVSGLV LDRLGRTARI GDEVELAVRA PYTQDGPGML
SVLIHIAAVS RQVPATVLIT MDTEDHAEDS EHSADPEHSA APDAREAS