Gene Caci_6626 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_6626 
Symbol 
ID8337990 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp7635217 
End bp7638252 
Gene Length3036 bp 
Protein Length1011 aa 
Translation table11 
GC content68% 
IMG OID644959720 
Productparallel beta-helix repeat protein 
Protein accessionYP_003117313 
Protein GI256395749 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0442384 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.141702 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTTTGT CCATACTGGG AAGACGGCTG ACGGCCTGGA CGGGCACCGC GGCGTTGACG 
GCCGCGGTCG CGGTGTCGGC GGCTGCGCAG ACGTCGGCCG CGACCGCCGT GACGTTCTAC
GCCTCGCCGA GCGGCAGTGG TTCTGCCTGC TCGCAGGCCG CGCCGTGCTC GTTGTCGGGC
GCGCAGGGCG CCGTTCGATC GCAGTTGGCT GCGACGCCGG GCGCCGACGT GACCGTGCTG
GTGCAGGACG GCACGTACCG GCTGGCCGCC ACATGGGCCT TCGGTGCGGC GGATTCGGGA
AGCGCCGGGC ATCCGGTGGT GTGGCAGGCC GCGCCGGGGG CGCATCCGGT GATCTCGGGG
GCGTCTCAGG TCACTGGCTG GACTCAGGTG GGGACGTCCG GGGTGTGGTC GGCCGTGGTT
CCGGCAGGTA GTGCCAGTCG GCAGCTGTAC GTCAGCGGAG CCGAGGCGCC GATCGCGATG
TCCACGCCGT CCGCCTTGGG GTTCGCCGGC GGTTGGAGTG GCACGTCGAC CGGCTATTCG
ATCGGCGGCG ACGCCGCGGC GTCGGCTTGG TTCGGCGCGC TGAGCGCGGC CCAGGTCGCC
GGGGTGGAGT TCGACTACCC GGGCGGCAAC GGAGCGTGGA CCGAGTCCCG GTGCAGGGTG
GCGAGCTACT CGGCCACCGC GAAGACGCTG ACGATGGCCC AGCCGTGTTG GACCGACACG
ACCGCGCGCG CCTCGTTCAG CCAGGGCAGC GGCGGACTGC CGTCGATGTC GACCGGCACC
ATGCCGACGC TGATCGAGGA TGCCAGGGCT CTGATCCACC CCGGGCAGTG GTTCCTCGAC
TCGGCCGCGA ACACCCTCTA CTACCAGCCC GCCACCGGAC AGCAGATGAG CGCGCTGGAC
GTCGAGCTGC CGCGGTTGGA GTCCCTGCTG CAAGGAGCCG GCACGCTGGC CAAGCCGCTG
CACGACGTGA CGTTCCGCGG TCTGCAGTTC TCCTACGCCA CCTGGAACGC GCCCTCGGCT
GCCTCCGGCT TCGCCGACGT GCAGAGCAAC CTGCGGATGA CCGGGGCGAA AAACCAGGGC
ATGTGCACCT TCTCCTCACC GGCCGGGACT TGCCCTTGGG GCGCGCTGAC CCAGCCGACG
GCGAACGTGG CGTTCACCGC CTCGAACAAC GTCACGCTCA CCGGGAACCG GTTCGCCGAG
CTCGGCGGCG CCGGGCTGAG CGTGATGTAC GGGTCGGCGA ACACCCTCAT CCAGGGCAAC
GAGTTCACGG ACATCGCCTC GACGGCGATT CTGCTCGGCT GTACCTACGA TCCGCTGCCG
ACCGACGCGT CGGAGTCCGC GGGCATCAAG CAGAACTGCA CACCGAACGG CTCCGCGGTC
AGCGCGGACG TGATCGGGAC CAACGAGATC CTCACCGGCA CCACCGTTTC GGACAACATC
GTCCACCACA TCGGCACCGA CTACTCCTCA GCCTGCGGCA TCACGCTGTT GTTCTCGCGC
GGCACCACCA TCACCCACAA CGACCTGTAC GACCTGCCCT ATACCGGCAT CACCGCCGGC
GTCATCCAAG GACACGTCGA CCAGGCCAGC GCGCCGCAGA ACTCGACCAA CATCAACGAG
AACAACACCC TGAGCGACAA CGTCTTCCAC AACTACCTGT CGGTGCGCAG CGACGGCGGC
GCGATCTACG CCGAAGGGCA TCAGACGCAG TACGTCTACC AAAGCGGCGG CACGACGATC
GACCCGGTCC AGACCCTGGC CCATGGTCTC CAGGTGACCG GCAACATCGC TTATCACGGC
CCGACGACCA ACTTCACCTA CTACGACGAC GCGGGCTCGG AGTGGATCAA CTGGCAGGGC
AACGTCGCCT TCGGCGCGGG CTCGGCGTCG CAGGGCGGCT GTAGCCCGAC CGGCCATTTC
TGGATCGTCG GCAACTACTT CTCCAACCAG ACGCAGTACT ACCCGTGCAA CGCGCCGGTC
GATTCCAACG TCAGCGGTAC GACCACCATT TCCGCCACGC CGGCGCCGGG CGACGTTCCC
AACGGCCTGT TCAGCGCGGC CGGTGTGCGG GCTGCCAACT CGGCACTGGC CGTCGCCGCC
GGCCCGAAGA TCTACTACGC CTCGCCGACC ACGAGCACGA GCACGCAGGT GCTCATCGGC
GGCGAAGGAT TCAGCTCCAG CACGCCGGTA TTCGTGGGCT CGACGCAGGT CAGCGGCGTC
CAGTACCTGT CCGGCGGCTT CCTGATCGTG CCCGTCCCGG CCGGAACCCC GTCCTCCCAG
ATCTCCGTCG GCGCGCCCGC CGGCACGAGC CGGCTCAACG ACACCGATCC GTCGATCACC
TACAGCGGCT TCAGCTACTC GTCGAACCGC GGTCTCGGCG ACTACGACGA CGATCTGCAC
TACGCCACGG CGAACGGCTC CACGGCGAAG TTCTCGTTCT CCGGAACCGG CGTCCAGGTC
TTCGGCGAGC AGAACACGGA CCAGGGCAAC ATCGGAATCA GCATCGACGG CGGTACCCAG
CAGACCGTCA GCACCGTTCC CGCTGACGGG CAGCGTCACT CCAACGTGGT CGTGTATGCC
GCCAGCGGGC TCGCGGCCGG GAGTCACACG ATCGTGGTGA CGAAGCTTTC CGGCCAGTAC
GCCACGCTCG ACGGCTTCCA AGCGCTGAAC TCGCGCCTCA ACGACACTGA CCCGTCGATC
GCCTACAGCA GCTTCAGCTA TGCGGCGAAC CGTGGCTTCG GCGACTATGA CGACGACGTG
CACTACGCCA CGGCGAACGG CTCCACGGCG AAGCTGTCGT TCTCCGGAAC CGGCGTCCAG
GTCTTCGGCG AGCAATACAC GGACCAGGGC AACATCGGAA TCAGCATCGA CGGTGGCACT
CAACAGACGG TCAGCACAGT GCCGGCCGAC GGCCAGCGCC ATGCGAACGT CGTCGTATAC
GCGGCGACCG GACTCGCTCG CGGGAGCCAC ACCGTTGTCG TGACGAAACT GTCCGGGCAG
TACACGACCC TCGACGGCTT CGTCATCATT CAGTAG
 
Protein sequence
MGLSILGRRL TAWTGTAALT AAVAVSAAAQ TSAATAVTFY ASPSGSGSAC SQAAPCSLSG 
AQGAVRSQLA ATPGADVTVL VQDGTYRLAA TWAFGAADSG SAGHPVVWQA APGAHPVISG
ASQVTGWTQV GTSGVWSAVV PAGSASRQLY VSGAEAPIAM STPSALGFAG GWSGTSTGYS
IGGDAAASAW FGALSAAQVA GVEFDYPGGN GAWTESRCRV ASYSATAKTL TMAQPCWTDT
TARASFSQGS GGLPSMSTGT MPTLIEDARA LIHPGQWFLD SAANTLYYQP ATGQQMSALD
VELPRLESLL QGAGTLAKPL HDVTFRGLQF SYATWNAPSA ASGFADVQSN LRMTGAKNQG
MCTFSSPAGT CPWGALTQPT ANVAFTASNN VTLTGNRFAE LGGAGLSVMY GSANTLIQGN
EFTDIASTAI LLGCTYDPLP TDASESAGIK QNCTPNGSAV SADVIGTNEI LTGTTVSDNI
VHHIGTDYSS ACGITLLFSR GTTITHNDLY DLPYTGITAG VIQGHVDQAS APQNSTNINE
NNTLSDNVFH NYLSVRSDGG AIYAEGHQTQ YVYQSGGTTI DPVQTLAHGL QVTGNIAYHG
PTTNFTYYDD AGSEWINWQG NVAFGAGSAS QGGCSPTGHF WIVGNYFSNQ TQYYPCNAPV
DSNVSGTTTI SATPAPGDVP NGLFSAAGVR AANSALAVAA GPKIYYASPT TSTSTQVLIG
GEGFSSSTPV FVGSTQVSGV QYLSGGFLIV PVPAGTPSSQ ISVGAPAGTS RLNDTDPSIT
YSGFSYSSNR GLGDYDDDLH YATANGSTAK FSFSGTGVQV FGEQNTDQGN IGISIDGGTQ
QTVSTVPADG QRHSNVVVYA ASGLAAGSHT IVVTKLSGQY ATLDGFQALN SRLNDTDPSI
AYSSFSYAAN RGFGDYDDDV HYATANGSTA KLSFSGTGVQ VFGEQYTDQG NIGISIDGGT
QQTVSTVPAD GQRHANVVVY AATGLARGSH TVVVTKLSGQ YTTLDGFVII Q