Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_4046 |
Symbol | |
ID | 8335399 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 4574747 |
End bp | 4576621 |
Gene Length | 1875 bp |
Protein Length | 624 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 644957151 |
Product | hypothetical protein |
Protein accession | YP_003114754 |
Protein GI | 256393190 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00161846 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.0746205 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGATCGTGA AGCGCGTGAT CACCCATGGA CATGTGAATC AGGAGTGGGC CGACATCGCG TTCGTGCTCC CGGCGGGCGG CGAACTCCAC CCCCACGACC GGATCGCGGT GTTGCAACAC TACGGCCGCG TCGTCATAGC CACCGTCCCG GAAGACGACG CCGACGAGGT GCGCGCCATC GCGCACGGCA GCCTGGTCGC CGCCGGCCGG CGGCACGGCG ACGTGGACGC GGCGACCTGG CACTCGCTGG ATCCTGCTGA GCGCCTCGCC CTCGAGGGCT ACTGGCTCAA CCACTCTGAG GAATACCGGC GGACCAAGGA CGAACGGCCC GGAATGGGGG CCGCCTGGGA TCATCCGGAC TTCCTGCCAC CCGACCCGCC GGGAGGGGTG GACCTCACCG CCGTGGCCGA GATGAACCAC CGGCTGCGCG GATCGGTCGC CATCGGGCTG GTGATGGTCA GCGGGCCCGA CGATCTGGCC TTCAGCGACG ACCAGAAGGC CCGCGTGCTG GCGGAGACCC AGACCGGCCT GTCCTGGTTG GGCTGGATGT GGCAGCTGAA GCGGGTGCGG TACATCGTGG ACGTCTACAC CCCCACCATC ACGACACCCG CCGACCCGTC GGCCCCGGAC AGGGAGAAGG AGGCGGTGTG GCGCAATCCG GCGATGAAGG AGATGAAGCA GAAGGAGGGG AAGGAAGGCG TCGGTGAGTT CGCGGAGTCG TTGCGGAAGA AGCACAGCAC GGACGGCGCG TACGTAGCCT TCTTCATGCA CTACCCCGCC CTGAATTTCG CCTACGCGGA GCCGGGCTGG CCGGAGACGG TCATGCAGTA CTCGAACGGC CCCTGGACGC CGTACGGGAT CGACCGGGTC TTCGCGCACG AGACGTGCCA CATCTTCGGC GCGCCGGACG AGTACCGCGA GTCCCTGTGC CGGTGCGGGG GGAGTCACGG CTACTACCAC ATGCCCAACG ACAACTGCGA CAACTGCGAC GGCCCCCACG AGTTCTGCAT CATGGGCCGC AACTGGTACG ACATGTGCAC CTGGACTCCG CGTCATCTCG GCGCACCCAT CTGGTGGGTC AACGGACAGG ACAAGATCTT CTCCCCGGTG GCGGTCTCCG AGGGCGTCAT CTACTACCAG GGCACGGACA GCTACATCTA CCGGGTCAAG ACCGACGGCA AGCCGGCCGT CAGGTTCGAC AAGGAACAGA CGTCGGCGAC GCCGTGCGTC GTCGGCAAAG TCATTTACTA TCAGGGCACG AACTACCGCC TGTACCAGGT GACGACCGAC GGCACGAGCG GCGGCAGCAT CGGCGAGATC AAGCTGATGG GATCGCCGGT TTACAGCGAC GGCTACCTCT ACTACCGGGG TACGGACAAC TATCTGTACA AGCTGCGCAT CGGCGATACC GAAGGCGAGC GCATCGCCAA CAACAAACTG CTGTCACCGC CGGCCGTCGG CGGAGGATAC ATCTTCTATC AGGGTACGAA CTACTACCTG TACAAGGTGC GCATCGGCGA GACGACCGGC ACGCGCGTCG GTGACGGCGA GTTGCTGTCG TCGCCGTTCG TGAGCGACGG CGCTGTCTAC TTCCAGGGCA CGAACAACGG CCTCTACCGG ATCCCCTTCG ACGGCAAGGT CTCCCAGCAG CTCGGCGAGT GCCGTACCCT GGCGACTCCG TTCGTGGCCG GCGGCTGGGT GTATCACCAG GGCACGGACT GCCTGCCGTG GCGGGTCAAG ACCGACGGCA CGGGCCGGGA GATACTGACC GGCGCACCCA TCCGGTCCAG CCCGGTCGTC GCCGACGACG TCGTCTACTT CGAGGGCAAC GACCTGCTGC ACGAGAACCA CCTGTATATG ATCAACCTCA CGTGA
|
Protein sequence | MIVKRVITHG HVNQEWADIA FVLPAGGELH PHDRIAVLQH YGRVVIATVP EDDADEVRAI AHGSLVAAGR RHGDVDAATW HSLDPAERLA LEGYWLNHSE EYRRTKDERP GMGAAWDHPD FLPPDPPGGV DLTAVAEMNH RLRGSVAIGL VMVSGPDDLA FSDDQKARVL AETQTGLSWL GWMWQLKRVR YIVDVYTPTI TTPADPSAPD REKEAVWRNP AMKEMKQKEG KEGVGEFAES LRKKHSTDGA YVAFFMHYPA LNFAYAEPGW PETVMQYSNG PWTPYGIDRV FAHETCHIFG APDEYRESLC RCGGSHGYYH MPNDNCDNCD GPHEFCIMGR NWYDMCTWTP RHLGAPIWWV NGQDKIFSPV AVSEGVIYYQ GTDSYIYRVK TDGKPAVRFD KEQTSATPCV VGKVIYYQGT NYRLYQVTTD GTSGGSIGEI KLMGSPVYSD GYLYYRGTDN YLYKLRIGDT EGERIANNKL LSPPAVGGGY IFYQGTNYYL YKVRIGETTG TRVGDGELLS SPFVSDGAVY FQGTNNGLYR IPFDGKVSQQ LGECRTLATP FVAGGWVYHQ GTDCLPWRVK TDGTGREILT GAPIRSSPVV ADDVVYFEGN DLLHENHLYM INLT
|
| |