Gene Caci_5034 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_5034 
Symbol 
ID8336388 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp5770936 
End bp5773743 
Gene Length2808 bp 
Protein Length935 aa 
Translation table11 
GC content72% 
IMG OID644958133 
ProductATP-dependent transcriptional regulator, MalT- like, LuxR family 
Protein accessionYP_003115735 
Protein GI256394171 
COG category[K] Transcription 
COG ID[COG2909] ATP-dependent transcriptional regulator 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.314145 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCGATCA CAGCTGCGAC GTCTGCACGG AACGCCGCGC GTTACCCCAC GGAGCGCCCC 
AACCGTCTCC CCCTGATCGC CCGGGACGGT CTGGTTGCGG AATTGGACCA GGCCGCGGCC
GGCAAGGTGA CGGTCATCAC CGCACCCGCC GGCAGCGGGA AGTCCTCATT GCTGCGGGCG
TGGGCCGACC GGCAGGACCG GCAGCACCGG CTGGCGATCG TCCAGGTCCC CCGCGACCAG
CCCGACGGCC AGCTGTTCTG GCTCTCGCTG CTGACCACGG TGCGCCGCCT GAGCGGCGAG
CCGATCGCCG CCGATCCGCC CGCCGCCGCG CCGGAGTTCA GCGCCGCGAC CGCCGTCGAG
CGCATCCTGG AGAAGCTGGC GGAGATCCCC GAGCGTGTCA GCCTGCTGAT CGACGACGTC
CACGAACTGA CGGCGCCGGA GACCTTCGCC GATCTCACCC GGCTGCTCGC CGACCTGCCC
GACCACGTCT CGGTGATCCT GGCCACCCGG CGGAACCTGC CGCTGCGGCT GCACCGCCTG
CGCCTGACCG GGCAGCTGAC CGACCTGCGC GCGGCCGATC TGCGCTTCAG CGAGGACGAG
ACCCGCGCGC TGCTGGCGGC GTCGGGGATC ACGCTGTCGG ACGCCGGCGT GGCCCGGCTC
CTGGAGCGCA CCGAGGGCTG GGCCGCCGGG CTGCGGCTGG CGGTGCTGTC CCTGACCGGG
CACCCCGATC CGGAGCGGTT CGTCGCCGAG TTCGGCGGGA GCAACCGCAC GGTCGCGGAG
TACTTGCTCG CGGAGATGCT GGAGCGGCAG CCGCAGGATA TCCAGGACCT GCTGCTGCGC
ACCTCGATCC TGACCCGGAT CAGCGGGGAG CTCGCCGACG CGCTCACCGG CCGCGTCGGC
TCCGAGCCGA TCCTGCTCGA CCTGGAGGAC GCCAACGCCT TCGTGGTCTC CCTGGACGCC
GAGCGCACCT GGTTCCGCTA CCACCACCTG TTCGCCGACC TGCTGCAGCT GGAGCTGCGC
CGAGTCCACC CCGACGAGGG GCGCGGGCTG CACCGCCGGG CCGCGGACTG GTTCGCCGAA
GCCGGGATGA CCGTCGAGGC CATCCGGCAC ACCCAGGCTG CCGGCGACTG GTCCGCCGCG
GCGAACCTGT TCTCCCACCA CGCGTTCGGC ATGATGCTCG ACGGCCAACT GGAGACCATG
CAGTCGCTGC TGGAGGCGTT CCCGACCGGT CCGGCGGCCG ACCTGCCGGC GCTGAGCGTG
GTCCGCGCGA TGGTCGACGT CTGGCACGGC CGCCTGGACG AGGCCGCCGC GCACCTGGCG
GCGGCCAAGG CCGGGCCGGA CCCGGCGCCG CCCTCGGCGC GCGCCGAGGC TCAGATCAGC
GCGCTGGAGC TGGTGCTGGC GCGGCGCCGG GGCGACGTCG ACACGGTGCT GCGGCTGGCT
CGCGTGCTCG CCGTCTCGAC CACTTCCGCT TATGCCTGCC CGCCCGCGTC TGCCTGCGCC
TTAGCCGTCG AAGCCGGTTC TGACTCCGAG TCCGAGTCCG AGTCCGGATC TGACTCGGTC
TCGACGCCTA CCTCTACCTC CGCCGCCGTC TCCGCTGACG AGCTCGCGCT GGGCGCCGAC
CTGCGGGTGC TGGTCCTGAT GAACCTCGGC ATCGTCGAGG CCTGGGCCGG CGGACTCGCC
GACGGCGCCA CGTACCTGAC GCACGCCGTC GAGCTGGCCC GCGTGATCGG GCGGCCGTAT
CTGGAGGTGA CCTGTCTGGC CCAGCTCGCG TTCGCCACCA AGCTGCACAG CTTCGCCGAG
ACCCGGCGCG TGTGCGAGCA GGCGGTCGCG CTGGCGGCGC GGCACGGCTG GGACACCGCC
CCGGTCCTAG CGCCGGCGCT GGTCACCCTG GCGTGCCTGC AGGTGTTCAC CGGCGAGTTC
GAGCACGGTC AGGACACCAT CGACCGTGCG GAGGAAGCGC TGCTCCTGGA CGCCGGAGCC
GACATCGGGC TGTTGCACCG GACCACCAAG GGGATGCTGC ACGCCACTCA AGGCCGCCTG
GCCGAAGCCG AGGCCGAGTT CGCCGCCGCC TACGACCTGC AGTCGCAGAT GCGGTACCCG
CACGCGCTGA GCGGTTACGT CGCGGGCTGG TTGGCCGCCG CCAAGGCGCG GCGCGGGTCG
ACGGACGCCG CGCGCGCCGT CATCGACAAG CTCGATGCCA CGCTGTTCGA CTCCGGGGAG
ATCCGCAACG CCCGCGCGGT GATCGCGCTT CGAGAGGGGG ACGTGGCGGG GGCGCTGGAC
GTCGTCACAC CGGTCACCAA CGGCTCCGCC AGCGCGGTCC ACGCGACCAC CCTGGTCGAG
GCGAACGCGC TGGAGGCGCT CGCATATCAC CGGAGCGGCA TTCCGGCGTC CGCAGCCAGG
GCTCAGGAGG CTGTCGAACG CGCCCTCGCC GCCGCCGAGC CCGAACGGCT GATCCTGCCG
CTGGTCATGG TCGGCGCGGG CGAGGTGCTC GAGACGGTGC CACGGCAGCG ATCGGCGCAC
GCCTCGCTGC TCACGGACAT CCTCGACATC ATCCACGGAT CAGCACCGGC GGCGACGGCT
TCGGACCAGC CGGCAGTCCC GGAGCTCAGT CCCACGGAAC TACGGATCCT GCGTTACCTC
CCCACCAACA TGTCCCGCCC CCAGATCGCC GGCGAGCTGT CGGTATCGGT GAACACGATC
TCCACACACG TCCGCAGCAT CTACGCCAAG CTCCAGGCGA CCGACCGCGC CTCAGCCGTA
CAGCGAGCAC GGGAGCTCCG ACTGCTGGCC GCCGGACCCT CGCGCTAG
 
Protein sequence
MPITAATSAR NAARYPTERP NRLPLIARDG LVAELDQAAA GKVTVITAPA GSGKSSLLRA 
WADRQDRQHR LAIVQVPRDQ PDGQLFWLSL LTTVRRLSGE PIAADPPAAA PEFSAATAVE
RILEKLAEIP ERVSLLIDDV HELTAPETFA DLTRLLADLP DHVSVILATR RNLPLRLHRL
RLTGQLTDLR AADLRFSEDE TRALLAASGI TLSDAGVARL LERTEGWAAG LRLAVLSLTG
HPDPERFVAE FGGSNRTVAE YLLAEMLERQ PQDIQDLLLR TSILTRISGE LADALTGRVG
SEPILLDLED ANAFVVSLDA ERTWFRYHHL FADLLQLELR RVHPDEGRGL HRRAADWFAE
AGMTVEAIRH TQAAGDWSAA ANLFSHHAFG MMLDGQLETM QSLLEAFPTG PAADLPALSV
VRAMVDVWHG RLDEAAAHLA AAKAGPDPAP PSARAEAQIS ALELVLARRR GDVDTVLRLA
RVLAVSTTSA YACPPASACA LAVEAGSDSE SESESGSDSV STPTSTSAAV SADELALGAD
LRVLVLMNLG IVEAWAGGLA DGATYLTHAV ELARVIGRPY LEVTCLAQLA FATKLHSFAE
TRRVCEQAVA LAARHGWDTA PVLAPALVTL ACLQVFTGEF EHGQDTIDRA EEALLLDAGA
DIGLLHRTTK GMLHATQGRL AEAEAEFAAA YDLQSQMRYP HALSGYVAGW LAAAKARRGS
TDAARAVIDK LDATLFDSGE IRNARAVIAL REGDVAGALD VVTPVTNGSA SAVHATTLVE
ANALEALAYH RSGIPASAAR AQEAVERALA AAEPERLILP LVMVGAGEVL ETVPRQRSAH
ASLLTDILDI IHGSAPAATA SDQPAVPELS PTELRILRYL PTNMSRPQIA GELSVSVNTI
STHVRSIYAK LQATDRASAV QRARELRLLA AGPSR