Gene Caci_4363 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_4363 
Symbol 
ID8335717 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp4950660 
End bp4953551 
Gene Length2892 bp 
Protein Length963 aa 
Translation table11 
GC content68% 
IMG OID644957466 
Productprotein of unknown function DUF1680 
Protein accessionYP_003115068 
Protein GI256393504 
COG category[S] Function unknown 
COG ID[COG3533] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0125989 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.151908 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCTGC CCCATGCGCC GCGGCCTTCC CTGCCGCTTT CCCGCCGCGC CGTGCTGCGC 
ACCGGCGCCC TCGCGGCGGC CGCACTCACC ACCGGTCCGT ACCTGGCGTC CTCCGCCTCG
GCGGCCACGG CTCGTCTCGC GCCGGCCGGC GGCGGCCTGT ACTCCCCGAA CGCCGCTCCC
CTGGCGCCGA CCGCGCTGCT GCGCCTGCCG CCCGGCGCCG TGCGCGCCTC CGGCTGGCTC
GCCGGACAGC TCCAGCTCCA GGTGGACGGT CTGTGCGGCA AGTACCAGGA CACCTCGCAC
TTCCTGAACA AGTCGACCAC CGGCTGGCTC AACCCGTCGC AGACCGGCTG GGAGGAGGTG
CCCTACTGGC TGCGCGGCTA CGGCGACCTC GGCTACGTCA CCGGCAACGC CGCAGTCCTG
GCCGACACCG CTAACTGGAT CAACGGCATC CTGGCCACAC AAGCCGCCGA CGGCTTCTTC
GGCCCGGCTT ACCTGCGCAC CAACCAGAAC GGCCAAGCCG ACTTCTGGCC GTACCTGCCC
CTGCTCCAGG CGCTGCGCAG CTACCAGGAG TACACCGGCA GCCAGCAGGT CCTGAACGCG
ATGACCGCGT TCCTCCGGTT CATGAACGCG CAGCCCGGCT CGGTGTTCTC CGCCTACTGG
CTCTCCTTCC GCGTCGCCGA CGGCCTGGAC GTCGTCTACT GGCTCTACAA CCGCACCGGG
GAGGCCTTCC TGCTCAACCT GGCCGACACG ATGCACGCCA ACAGCGCGAA CTGGCTGAAC
AACCTGCCCA CGCCGCACAA CGTCAACCTG GCGCAGGGCT TCCGCGAACC GGCGGTATAC
GCGCTGCGCT CGGGCCAGTC CGGCATGACG CAGAACGCGT ATCAGAACTA TGCGTCGATC
ATGGGACGCT GGGGTCAGTT CCCCGGCGGC GGCTTCACCG GCGACGAGAA CGGCCGGATC
GGCTACGCGG ACCCCCGCCA GGGCTTCGAG ACCTGCGGCG TGGTGGAGCT GATGGCCAGC
CACGAGCTGC TGAACCGGCT CACCGGCGAC CCGGTCTGGG CCGACCGCTG CGAGCAGCTG
GCGTTCAACA TGCTGCCGGC CACCCTGGAT CCGCAGGGCA AGGGCACGCA CTACATCACC
TCGGCGAACA GCGTGGACCT GTCGAACACC GCGAAGACCC ACGGCCAGTT CAGCAACGCC
TGGGCGATGC AGGCGTACAT GCCCGGCGTG GACCAGTACC GCTGCTGCCC GCACAACTAC
GGCCAGGGCT GGCCGTACTT CACCGAGGAG CTGTGGGCCG CCACGCCGGA CAACGGTCTG
TGCGCGGTGA TGTACGCCCC TTGCTCGGTC ACCGCAAACG TGTCCGGCGG CCACTCGGTC
ACCATCACCG AATCCACCGG GTATCCGTTC ACGCAGTCCG TGACGCTGAC GCTGACCATG
TCCGCCCCGG CAACGTTTCC GTTGTACCTG CGCGTCCCGG GCTGGTGCTC GGCTCCGGCG
GTCGCGGTCA ACGGCGGGCA CGTGAGCGCA CCGGCAGGAC CCGCCTACAC CTCGATCTCG
CGGACCTGGC ACACCGGGGA CACGGTGACG ATCCAGCTGC CTTCCACTCC CGTCGTCAGG
ACGTGGAGCG CGATCGGCGG CGCGCTGTCG GTGTCGAACG GCGCGCTGGA CTACTCGCTG
AAGATCGGCG AGAACTACGT CCAGTTCGCC GGGAACTCCG AGTTCCCCGA GTACGAGGTG
CACGCCACGA CGCCCTGGAA CTACGGGCTC TCGCTGCCCG CGGCGAACCC GGCGGGCGCT
CTGTCCTTCC ACGCCGCCGG CGGCGCTGTG CCAGCGAACC CGTTCACGCA GCAGAGCGTG
CCGGTCAGCA TCACCGCACC GGCCGCGCAG ATCGCCAAGT GGACCACCGA CGATCAGAAC
GTCGCCACGG AGCTGCCGAC CGGACCCTTC CAAACGTCCG GGACGACCAA CGTCACCCTG
ATCCCGATGG GTGCGGCGCG GCTGCGGATC ACCGCGTTCC CCGCCGCCGG CTCCAGCGGC
AACGCCTTCT CCCAGCCCGG CGGCTACTTC CGGCTGTTGA ACGCCAACAG CGGCAAGGTC
ATGGGCGTGT CGAACATGTC CTGGGGCGAC TCGGCGAACG TCGTGCAGTT CGACGACAGC
GGAACCGCCG ACCACGTCTG GCAGCTGCTG GACAACGGCG ACGGGAACGT CCGCATCCGT
AACGCGAACA GCGGTCTGGT GCTCGGCGTG GACGGCATGT CGACGGCGAA CTCGGCGAAC
GTCGTCCAGT TCGAGAACAC CAACACCCTG GACCATGTCT GGACCCTGAT CGACAACGGC
GACGGCCGGA TGCGCATCCG CAACGTCAAC AGCGGACGGG TCGCCGGCGT CGCCAACATG
TCGACCGCCG ACTCGGTGAA CGTGGTCCAG TACGACGACA ACGGCACGGC GGACCATCTC
TGGACCCTGA TTCCCGACGG GCCGGTACGG ATCGTCAACA AGAACAGCGG TCTGGTCCTC
GGCGTGGCGA ACATGTCCAC CGCGAACTCG GTCAACGTCG TGCAGTACGA CGACAACGCC
ACCGCCGATC ACCGCTGGAC CTTCCTGAGC GATTCCGGCG GCTGGTGGCG GATCCAGAAC
CAGAACTCCG GCAAGGTCAT GGGCGTGTCG AACATGGCGA CCACGGATTC GGCGAACGTC
GTGCAGTACG ACGACAACGG CACCGCCGAC CACCTGTGGC GGCTGCGCCC CGGCGGCGGT
CCGTGGTTCC GCATCCAGAA CAAGAACAGC GGCCTGGTGC TCGGCGTGGC GAACACGTCC
ACGGCCGACT CGGCGAACGT CGTGCAGTTC GACGACAACG GGTCCGCAGA CCACCTGTGG
CGGATTCTCT AG
 
Protein sequence
MSLPHAPRPS LPLSRRAVLR TGALAAAALT TGPYLASSAS AATARLAPAG GGLYSPNAAP 
LAPTALLRLP PGAVRASGWL AGQLQLQVDG LCGKYQDTSH FLNKSTTGWL NPSQTGWEEV
PYWLRGYGDL GYVTGNAAVL ADTANWINGI LATQAADGFF GPAYLRTNQN GQADFWPYLP
LLQALRSYQE YTGSQQVLNA MTAFLRFMNA QPGSVFSAYW LSFRVADGLD VVYWLYNRTG
EAFLLNLADT MHANSANWLN NLPTPHNVNL AQGFREPAVY ALRSGQSGMT QNAYQNYASI
MGRWGQFPGG GFTGDENGRI GYADPRQGFE TCGVVELMAS HELLNRLTGD PVWADRCEQL
AFNMLPATLD PQGKGTHYIT SANSVDLSNT AKTHGQFSNA WAMQAYMPGV DQYRCCPHNY
GQGWPYFTEE LWAATPDNGL CAVMYAPCSV TANVSGGHSV TITESTGYPF TQSVTLTLTM
SAPATFPLYL RVPGWCSAPA VAVNGGHVSA PAGPAYTSIS RTWHTGDTVT IQLPSTPVVR
TWSAIGGALS VSNGALDYSL KIGENYVQFA GNSEFPEYEV HATTPWNYGL SLPAANPAGA
LSFHAAGGAV PANPFTQQSV PVSITAPAAQ IAKWTTDDQN VATELPTGPF QTSGTTNVTL
IPMGAARLRI TAFPAAGSSG NAFSQPGGYF RLLNANSGKV MGVSNMSWGD SANVVQFDDS
GTADHVWQLL DNGDGNVRIR NANSGLVLGV DGMSTANSAN VVQFENTNTL DHVWTLIDNG
DGRMRIRNVN SGRVAGVANM STADSVNVVQ YDDNGTADHL WTLIPDGPVR IVNKNSGLVL
GVANMSTANS VNVVQYDDNA TADHRWTFLS DSGGWWRIQN QNSGKVMGVS NMATTDSANV
VQYDDNGTAD HLWRLRPGGG PWFRIQNKNS GLVLGVANTS TADSANVVQF DDNGSADHLW
RIL