Gene Caci_5370 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_5370 
Symbol 
ID8336724 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp6191694 
End bp6193583 
Gene Length1890 bp 
Protein Length629 aa 
Translation table11 
GC content69% 
IMG OID644958468 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003116070 
Protein GI256394506 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.358498 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCAACC CCAACCCACG ACGACGTGCT TTCGCCCTGG CGGCGGCCCT GGCCGGGGCG 
GTGGCGCTGA GCGCGCCGGC CGCGGCCGCA TCCGGGCCGG CGCAGGCGCG GGCGCGCAGC
TCTTTTAGCC AGACCTCTGA CGCTCAGGCG GCGCCGGCTT CGGCGGCTGC CAGCGGCAAG
ACCTTGACCG TGGCGACCAC CGGCAGCATC GACTCCCTGT CGCCGTTCCT GGCGCAGCGG
GCGCTGCCCA CCCAGATCCA CCGCCTGATC TACGACTTCC TGACGAACTA CGACGCCTCC
GACGACCACG CGATCGGCGC CCTGGCCACC TCCTGGACCA CCTCGACGGA CAAGCTGACC
TGGACCTTCA CCTTCCGCGA CGGAATGAAG TGGTCCGACG GCCAGCCGGT CACCGCCGCC
GACGCGGCCT TCACCTACAA CCTGATGATG ACCAACGACG ACGCGGCCAC CGCGAACGGC
AACTTCGTCA CCAACTTCGC CAAGGTCACC GCGACCGGCA ACCAGCTGGT CATCACCTTG
AAGCAGCCGC AGTCCACGAT GCTCGCGCTG GACATCCCGA TCGTGCCGCA GCACGTCTGG
GCCTCGCACG TCGCCGACAT CGCCACGTTC AACAACGACG CCCAGTTCCC GGTCGTGGGC
GACGGGCCGT TCATCCTCAC CGGCTACCAG AAGGACCAGT ACCTCACCCT GGACGCCAAC
CCGAACTACT GGCGCGGCAA GCCCGGCTTC GACCACCTGG TGTTCAAGTT CTTCAAGGAC
GCCGACGCCG AGGTGGAGGC GCTGAAGAAG GGCGAGGTCG ACTTCGTCAG CGGCCTGACC
CCGGCGCAGT ACGACGCGCT GAAGGGCCAG TCGGGCATCG CCACCAACAA CGCGCAGGGC
AAGCGGTTCT ACGCCCTGGC GATGAACCCC GGCGCGACCA CCACCACCGG GCAGGCGTTC
GGCGACGGCA GCCCGGCGCT GCAGAACCAG CAGTTCCGCC AGGCGCTGAT GTACGCGATC
GACACCAAGA CGCTGGTCGC CAAGACCCTC GGCGGCTACG GCACGGTCGG CAGCGGCTAC
ATCGCCCCGA TCTTCGCCGC CTACCACTGG GCTCCGGACC CGGCCACCGC CTACACCTAC
GACCCGGCCA AGGCGAACCA GATGCTGGAC GCCGCCGGGT TCAAGAAGGG CTCGGACGGC
ATGCGCACGC TGCCCGACGG CAAGCCGCTG AAGCTGCGCC TGATGGGCGA GACCAACCGG
GCCGACGACA CCCAGAACGT CGCCTACGTC GCCGACTGGC TCAAGGCCGT CGGGATCGCC
ACCACCACCA CGGTCGTGGA CCAGGGCAAG CTCGCCGACA CCGAGACCGC CGGCACGTTC
GACCTGGCCT TCGACAGCTG GGGGGAGAAC CCGGACCCGG ACGCCGTGCT GTCGATCCAG
AAGTGCGACG GCCGGCCCGC CGCGCAGGGC AAGAACTTCA ACGGCGACGA CTTCATCTGC
GACCAGGACT ACGACGCCCT GTACCAGAAG CAGATCACCG AGTACGACCC GGCCGCGCGC
GCCGCCGACG TCAAGCAGAT GGAGCAGAAG CTCTACACCG ACGCCTACAT CAACGTCCTG
TATTACGGGA ACGTGCTGGA GGCCTACCGC TCCGACGTCA TCGGCTCCAT GGACAAGCAG
CCGCAGCCCA ACGGCCTGTA CTGGGGTCAG GACGGCTACT GGTCCCTGTG GTCGGCCAAG
CCCGTGGCCG CCTCCTCCTC GTCGTCCTCG TCGAGCTCGA ACACCGGTCT GATAGTCGGC
ATCGTGATCG CGATCGTGGT GGTCGGCGGC GGCGGTGCCC TGCTCCTGAC CCGCCGGCGC
CGCGGCACCA CCGCCGACGA ACGCGAGTAG
 
Protein sequence
MPNPNPRRRA FALAAALAGA VALSAPAAAA SGPAQARARS SFSQTSDAQA APASAAASGK 
TLTVATTGSI DSLSPFLAQR ALPTQIHRLI YDFLTNYDAS DDHAIGALAT SWTTSTDKLT
WTFTFRDGMK WSDGQPVTAA DAAFTYNLMM TNDDAATANG NFVTNFAKVT ATGNQLVITL
KQPQSTMLAL DIPIVPQHVW ASHVADIATF NNDAQFPVVG DGPFILTGYQ KDQYLTLDAN
PNYWRGKPGF DHLVFKFFKD ADAEVEALKK GEVDFVSGLT PAQYDALKGQ SGIATNNAQG
KRFYALAMNP GATTTTGQAF GDGSPALQNQ QFRQALMYAI DTKTLVAKTL GGYGTVGSGY
IAPIFAAYHW APDPATAYTY DPAKANQMLD AAGFKKGSDG MRTLPDGKPL KLRLMGETNR
ADDTQNVAYV ADWLKAVGIA TTTTVVDQGK LADTETAGTF DLAFDSWGEN PDPDAVLSIQ
KCDGRPAAQG KNFNGDDFIC DQDYDALYQK QITEYDPAAR AADVKQMEQK LYTDAYINVL
YYGNVLEAYR SDVIGSMDKQ PQPNGLYWGQ DGYWSLWSAK PVAASSSSSS SSSNTGLIVG
IVIAIVVVGG GGALLLTRRR RGTTADERE