Gene Caci_3970 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_3970 
Symbol 
ID8335323 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp4503084 
End bp4504694 
Gene Length1611 bp 
Protein Length536 aa 
Translation table11 
GC content65% 
IMG OID644957085 
Producthypothetical protein 
Protein accessionYP_003114688 
Protein GI256393124 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00762799 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.606587 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCCGAGC AGTCGACGTT CACTGTTGCC CAGTTGATCG ATCAGGCGGA GCGGTTCGAC 
GTCCCGGAGG TACAGCGCCT TTTCACCGCT CGACCCGAGT GGGTCGTCCT GCTCATTGAC
TCCCTTTACC GAGGGATCGG GGTTGGCGCG CCTCTCCTGT GGAGTCCGCG CGAGGGTGCC
CCAGATTCCC GCTATCGGTG TGAATCGACA GCCGATTATT GGATTATCGA CGGGCAGGGG
CGCCTTACTG GAACGCTGGC CGCCTTCGGG ATCCGGCCGC CGTGGATCGC CGGCGAGCAG
TGGGAAGCCA TGGGTGGTCC GGAGCGCGAG GTGGCGGTGG CGTTCACCCC GCTCGGCCAA
ACTAATTTCG TACAGTACAA GCCTGGTGAG CGTTGCCAGA TCCGGCTTCG CGATCTGCTT
GATCCGGGGC CAGGGGGACT GTCAAAGCTG CTGCGTGAGA CGACTGGGAC TATGCCGGAC
GCTGTGATGA TCGAGACGCT CGCCGTACTC GCTCAACGGT TGCGCGATGC GGCGTTCCAT
GTGTACTGGC AGGACGGTGG TCTGCGCAAT GTCGTCGACG CGTTCATCCG GCACAATCAG
AGGGGCTCGG GGCGATTCCT GTCCCGTGAA GAGTGTGACC TGGCAGTTCT GGCCCTATCG
TGCCCAGGGC TGCAACGGGA CATCATCGAC CCGGCTGTGG CTGATGTCGC CGCTGCCGGC
TTCCCCCTGA CTCTGGATCG GCGCCGCATC TTCGCAGTCA TGAAGGTCCT GACGCCGGTG
AAACTACGCA TGTGCATGGC GGACAACCCC GATCGGCTGC GGGCTGTCGC ATATACCGCC
GTAGCCGGGG CCCGAGCTGT AGCCGAGTAC CTATCGCGCT GCGGGATCGC GGGCGACGAA
CTGTTCGCCC GCCGTCCACT GGCGTTGGTA CTCGCGACGT TGTTCGCCCG CTTCCCGCAG
TCTGCCTCAC GCGACTTCGC CCGACGCTGG CTGGCCCAGG CGCTGGCCTC AGGACGATAC
GACTTCGGAG GCAACCAGTT CGCCGACAGC GACGCCTCCG CGGTCGCCCG CTGTACGACT
CTGGACGACG CCGAAACCGT CCTGGCGGCT CGGATCGCGC AATTCACTGA ACCGCAGCTT
GACCCGGAGG ACCTGACCAC CAGCCACTCC GCCGCCGGCA AGGCCTGGAC ACTTTACGCT
CTGGCCTGCC ACGCACAGAC CTGCGGCCCG GTCAGCGATC TGGCCGATCC CACGATCGGC
GCCGGCGACC CGGCACTGCA GTTGCACCCC TTGTGGCCAC ACACAGCCAG CAGGACTCGC
CGCACCTTGG CCGCCTACGC GATGATGACC GAGGCCAGCG CGGAGCGCAT CGCGGCGGTC
GGAGGGTTCA CCGTAGATGC CTACCTGGAC CTTCGTTGCT CGGACCAATC ACTACACGCC
CAACAAATCT GTCGCCCCAG CTCCGATACC GACGTCGAGG AAGTGGTTCG TCACCGGACG
GTCGCCCTCG TCGACATGAT CGGCGGCTTT CTAGCGCGAC TTGAACCGCT GGCACCCCCA
CCCTTGGTCG GCGCGGACGT TGCACTGCCA CGCGCGCTGG AAACCGCGTG A
 
Protein sequence
MAEQSTFTVA QLIDQAERFD VPEVQRLFTA RPEWVVLLID SLYRGIGVGA PLLWSPREGA 
PDSRYRCEST ADYWIIDGQG RLTGTLAAFG IRPPWIAGEQ WEAMGGPERE VAVAFTPLGQ
TNFVQYKPGE RCQIRLRDLL DPGPGGLSKL LRETTGTMPD AVMIETLAVL AQRLRDAAFH
VYWQDGGLRN VVDAFIRHNQ RGSGRFLSRE ECDLAVLALS CPGLQRDIID PAVADVAAAG
FPLTLDRRRI FAVMKVLTPV KLRMCMADNP DRLRAVAYTA VAGARAVAEY LSRCGIAGDE
LFARRPLALV LATLFARFPQ SASRDFARRW LAQALASGRY DFGGNQFADS DASAVARCTT
LDDAETVLAA RIAQFTEPQL DPEDLTTSHS AAGKAWTLYA LACHAQTCGP VSDLADPTIG
AGDPALQLHP LWPHTASRTR RTLAAYAMMT EASAERIAAV GGFTVDAYLD LRCSDQSLHA
QQICRPSSDT DVEEVVRHRT VALVDMIGGF LARLEPLAPP PLVGADVALP RALETA