Gene Caci_1333 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_1333 
Symbol 
ID8332669 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp1516326 
End bp1519550 
Gene Length3225 bp 
Protein Length1074 aa 
Translation table11 
GC content73% 
IMG OID644954479 
Productprotein of unknown function DUF214 
Protein accessionYP_003112097 
Protein GI256390533 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCGCCG CCGCGGTCGC GGTGGTGATG ACGGCGCTGG TGTTGGTGAC TCTCATTGGG 
CTGCATGCCT CGACGGTGCG GGCGGCGTCG GTGCGCAGTG CGCTTAGGGC CATGTCGGCG
GACACGCGGA CGGTGCGCGT GGTCTATGGG TCCGGTACTG AGTGGCAGGC GACGGCTGAT
GAACGCAGCG CGTTCTTCGC GCGGTTGCGG GCCCGAGTTG ACGCGGCGTT CGCCGGGGTT
GGCGCGCGGG TGGACACCAC GGTCGTGTCC GGGCCGTATG TGCTGCTCGA TGCGGGCCCG
TCCAGTACCG ACGCCGTTGC GTTCTGGTCC GGGGACGCGG TGAAGGACAA GGCTCGGCTG
CTGCGCGGGC AGTGGCCCGC CGCGTCGCCC GCGTCGTCCG CGTCGTCCGC CGGCGCCGTT
CCGGCTGTGG TCTCCGAACA GAGCCTGGCC GCGCATCACT GGAACCTCGG CGAGGAGGTT
CGCACCCGGT CCGGGTGGGA GTCCGCCAGC AGTCTCACGT TCCGTATCGT CGGCGTCTAT
CAGCCGATGG ATCCTACCGA CGCCTTCTGG CTGCACGACG GGGATCAGGG CACTGTTTTC
GTCGACTCCT CGGTCACGGC GCGCGCCGAC ACTGTGCTGG CGCAGGTTGC CGTCGCCGCG
GCTCCCGACT CCGCCGGCTT TCCGGCCGGG CGGCTGCCGC GGTTCGCCGG TCAGCTCGCG
GCTTTCGACG CCTGGGTACC GACCACGTCC TCCCCCGGCG GCGACTTCGC CGCGTCGGTC
GAGGACCGTC TCGCGCCGCG GATGAGGGCG CTGGTCGGGG TGACGGCGGT CAGCGTGCGC
CTGGAGTGGC TGGTGGCCGC CGAACTCGGT GTGTTGTCGG CGGCGGCGCT GGCGCAGATC
GCGCGGGCGC TGGCCGATCG CCGACACGCC ATGGACGGCC TGCTTCGCGC GCGTGGCTCG
TCGATCCGTG CGCTGGGGCG GGTTCAGATC GTCGAAGGGC TCCTCCTCGC CGTGCCCTGT
GCGCTGCTGG CGCTGTGGGC GGCTCGCTGG CTAGAGCGGC GGATGGGGGC GACGTGGGCG
CACGGGGACT CCGGCGCCGC GGCTTTGCTG GACGGCGCGG ACCGCGTCCA GACCGCGCAG
TGGGTGGTCG CGCTGATCGG CGCGGTGCTG GCCGCGGTCC TGCTCGGCGC GGTCGGCACG
CGCTCGGCGC TGACCCACAG CACGGTGGCC CGCGTCCGCC GCCGCTCCAC GCTGCGCGGC
ACGGTGCAGC GGGCCACGAT CGACCTCGCA GTGCTGGCCT TCGCGCTGGC CGCGTTGTGG
GAGCTCCTCC AGGTCGGGGA CGATGCCTCC CGCACCGGTT CGCTGCCCTG GCCGCAGGTG
GTGGCGCCGG CGGTGCTGCT GTTCGCCGGG GCGCTCATCG CTTTGCGGCT CCTGCCCTTC
CTCGCCGCGC CGGCCGGCCG GATCGCGGCG TACAGCAAGG GCCCGCTCGC CGCCCTGGCC
GGGTGGCGCC TCGGCCGTGC CGCGCGGACC CAGGCCCTTC CGATGGCGCT CGTGGTCGTC
GCGGTGGCCA TCGGGGTGCA AGCCGGCGTT CTGCTGTCCT CGGCCGATCG GTCGGACGCG
GACCAGGCCG CCTTCGTGGT CGGCGCCGAC GTGCGGGTCG CAGGGCTCGT GTCCACGGGT
CCCCAGACGC TGCACGCGCT GGCGACCACG CCCGGTACCG GCAAAGGGTT CGCCGCGCTC
CGCGCCTCCT ACCGCCTGGT CCGCGCGGAT CAGGCTGCTG AGACCCATGG AGTGGACAAC
ACCACGCCGA CGGCTGACGT GCTGGGTCTG GATCCGGCCC GCTCCGACGG CGTCCTCATG
CTCCGCAACG ACCTCGCGGG CGGCCGAAGC TGGTCCCGGA TCAGCCCTCT GTTAAGCACC
GATGGCTGGA ACGCGCACGG CCAGTCTGGG ATCGCACTGC CGGGAACACC GACTCAGTTG
TCTGCGAACA TCACCTACCG CCCCGGTACC AGCAGCACGT GCACGGACGG TCCGCTTCAG
GTGTCCGCGC ACTTCACCGA CGCTTACGGC TTTCCAGGCT CGGCGGTCCT CGGCACGATC
GCCGCACCGG ACGGCCTGGT CCATTCGCTG ACCGGGACTT TGATCGGAAG CGGTGCGCAC
GTCGCCTCCC CGGTCAGCAT CACCGGCTTG TCGGTCACGT CGGCCACGCT GTGCCCGATC
GGGCAGGTGA GCGTCGGCGC CCTGCGTGCC GACGGCAAGC CGGTGGCCCT CGCAGCCGGT
ATGGGTTTCG CGGCACCCGC CGTGTCCACC GATGCCGGTC TGGCGGGCGC CGATGCCAGT
GTGAAGGGCA TCACCGGGCA GCCGACCGAG CTTTTGCGGC TCACCGAGAC CGGCCGAACT
GGCCCGCTGG TCGTCCCCGC GATCGTCACC CGACACCTGC TGGCATCCCT GGACAAGCAC
GTCGGCGACA CCTTCGCCAC CGGTGCCTTC TCCCAGCCCG TGACCCTGCG CGTCGTCGCC
GAGATCGCCG GCGCCCCCGG CACCGCCGAC GGCGGCCAGG ACGCCGTGAT CGTGCCGATC
GGGCTGCTGG ACCGCGAGGC GGCCCGGGCC GCGGCCGGTC CCGGGCAGAC CGGCCAGCAG
ATCGGGCCGC TGGCCGGCGA GTGGTGGGCC GACACCGACG GCGGCGTCTC GGCACCCAGA
TTGGCGGACC GGTATCGCAG TACCCTGAAC GGCGCACATC TCGCGCAGCC CGCGACCGTC
GAGGACCGCG CCAGCCAGGA GAAGGCGCTG CGCGACTATC CCTTCGCCGC CGGCTTCAGC
CTGACGCTCA AGCTCGGCTC CGTCGCGGCC CTGGCGTTCG TCCTGCTCGG CCTGGCCCTG
CACACCGTCT CCACGCTGCG CGAACGGGCC GGCGAGCTGG CGGTGCTGGA CGCGCTCGGC
CTGACCCGCC GCCGGGCCGC GTGGCTGCTG CTGGCCGAAC AGGCCGGCGT CGCGGTGCTC
GGCGCGCTCG CCGGGCTCGG CCTGGGCGCC CTCGCCCTGC GATCGGAGTT CAAGCTCATG
GTGTTCACGC CCTCCGGCGC TCCGCCGACG CCCCCGGCGG TCCGGGTCTA CGACTGGCCG
GCGCTGGGTC TGGCCGGGGC GGCCGCGGCG GTGTTCGGGG TGGTGGCGGT GCTGGTCACG
TTCGCGGTCG GCGAGCGCGC CTCATTCCGT GCGGAGGCGG ACTGA
 
Protein sequence
MLAAAVAVVM TALVLVTLIG LHASTVRAAS VRSALRAMSA DTRTVRVVYG SGTEWQATAD 
ERSAFFARLR ARVDAAFAGV GARVDTTVVS GPYVLLDAGP SSTDAVAFWS GDAVKDKARL
LRGQWPAASP ASSASSAGAV PAVVSEQSLA AHHWNLGEEV RTRSGWESAS SLTFRIVGVY
QPMDPTDAFW LHDGDQGTVF VDSSVTARAD TVLAQVAVAA APDSAGFPAG RLPRFAGQLA
AFDAWVPTTS SPGGDFAASV EDRLAPRMRA LVGVTAVSVR LEWLVAAELG VLSAAALAQI
ARALADRRHA MDGLLRARGS SIRALGRVQI VEGLLLAVPC ALLALWAARW LERRMGATWA
HGDSGAAALL DGADRVQTAQ WVVALIGAVL AAVLLGAVGT RSALTHSTVA RVRRRSTLRG
TVQRATIDLA VLAFALAALW ELLQVGDDAS RTGSLPWPQV VAPAVLLFAG ALIALRLLPF
LAAPAGRIAA YSKGPLAALA GWRLGRAART QALPMALVVV AVAIGVQAGV LLSSADRSDA
DQAAFVVGAD VRVAGLVSTG PQTLHALATT PGTGKGFAAL RASYRLVRAD QAAETHGVDN
TTPTADVLGL DPARSDGVLM LRNDLAGGRS WSRISPLLST DGWNAHGQSG IALPGTPTQL
SANITYRPGT SSTCTDGPLQ VSAHFTDAYG FPGSAVLGTI AAPDGLVHSL TGTLIGSGAH
VASPVSITGL SVTSATLCPI GQVSVGALRA DGKPVALAAG MGFAAPAVST DAGLAGADAS
VKGITGQPTE LLRLTETGRT GPLVVPAIVT RHLLASLDKH VGDTFATGAF SQPVTLRVVA
EIAGAPGTAD GGQDAVIVPI GLLDREAARA AAGPGQTGQQ IGPLAGEWWA DTDGGVSAPR
LADRYRSTLN GAHLAQPATV EDRASQEKAL RDYPFAAGFS LTLKLGSVAA LAFVLLGLAL
HTVSTLRERA GELAVLDALG LTRRRAAWLL LAEQAGVAVL GALAGLGLGA LALRSEFKLM
VFTPSGAPPT PPAVRVYDWP ALGLAGAAAA VFGVVAVLVT FAVGERASFR AEAD