Gene Caci_4304 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_4304 
Symbol 
ID8335658 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp4883528 
End bp4884556 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content67% 
IMG OID644957407 
Producttwin-arginine translocation pathway signal 
Protein accessionYP_003115009 
Protein GI256393445 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0262518 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.00234368 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCTCGT TCGCCACCGT CCCGTACAGC CGCCGCGGAT TCCTCGGCCT GGTCGGCACC 
GCCGCGCTCG CCGCCGGCTG CGGCTCGGGC GCAGCCGGCT CGGCCAAGAA GACCACCAAA
CTTCGCTACC AGGGCTCGGT GGGCACGGTC ACGCCGCCGG AACTCGCCGC AGACCTCGGA
TATCTGGGCC CTGTGACGCT CGACTGGGTC GGCAACACCA CCAGCGGTCC GCAGGACATC
CAGTCGGCGG CCACCGGGCA GACCGACTTC GGCGGAGCCT TCAACGGCGC GGTCGCCAAA
CTGCACTCCG CCGGCGCCCC GATCACCGCC GTCATCAGCT ACTACGGCGT CGATCAGTAC
TCCTACAACG GCTTCTACAC CCTGGAAGGC AGCCCGATCG CCTCCGCGCC CGACCTGTTC
GGCAAGAAGG TCGGCATGAA CACCCTCGGC GCGCACTACG AGGCGGTCCT GGACATCTAC
CTGAGCCGCA ACGGCGTCTC GGACAGCGAC GCCAAGAAGG TCGAGCCGCT GGTCGTGCCG
CCGGTCAACA CCGAGCAGTC GCTGCGCGCG CACCAGATCG ACGTCGCCAC CCTCGGCGGC
ATCCTGCGCG ACAAGGCGCT CGCCGACGGC GGCGTGAAGC AGCTGTTCAC CGACTACCAA
CTGCTCGGCA CGTTCAGCGC CGGGACGTAC GTCTTCCGCA ACGACTTCCT GGCGAAGAAC
CCTGACACGG TCCACGCCTT CACCTCCGGC GTCGGCAAGG CGATCGAGTG GGCCCGCACC
ACACCCCTGC CGGAGGTCGT CGACCGCTTC ACGAAGATCA TCAAAGCCCG CGGCCGCAAC
GAGGACACCT CGACCCTGAA GTACTTCAAG TCCTACGGGA TCGCCGGCAC CGGCGGCGTC
GTCGCCGCCA AGGAATTCGA CACCTGGATC ACCTGGCTCG AACAGCAGGG CCAGATCCCC
AAGGGCAAGG TCAAGGCCAC CGATGTCTAC ACGAACAAGT ACAACTCCTT CGCCAACGGC
GGCAGCTGA
 
Protein sequence
MSSFATVPYS RRGFLGLVGT AALAAGCGSG AAGSAKKTTK LRYQGSVGTV TPPELAADLG 
YLGPVTLDWV GNTTSGPQDI QSAATGQTDF GGAFNGAVAK LHSAGAPITA VISYYGVDQY
SYNGFYTLEG SPIASAPDLF GKKVGMNTLG AHYEAVLDIY LSRNGVSDSD AKKVEPLVVP
PVNTEQSLRA HQIDVATLGG ILRDKALADG GVKQLFTDYQ LLGTFSAGTY VFRNDFLAKN
PDTVHAFTSG VGKAIEWART TPLPEVVDRF TKIIKARGRN EDTSTLKYFK SYGIAGTGGV
VAAKEFDTWI TWLEQQGQIP KGKVKATDVY TNKYNSFANG GS