Gene Caci_1286 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_1286 
Symbol 
ID8332621 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp1459076 
End bp1460860 
Gene Length1785 bp 
Protein Length594 aa 
Translation table11 
GC content67% 
IMG OID644954433 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003112052 
Protein GI256390488 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAAGG AAACCGCTGT GATACCTAAG TTCTCGATCC GTGCGGGAGC CGCCGCGGCG 
GCGCTCGCCC TGGTCCTGTC CGCCTGCGGC TCGAGCAGCG GCGCGGGCGG GGGTTCGGCC
ATGGGTCCGG GCTTCGACCT GGCGAGCAAG CAGGTGCTGA ACGCCTCGAC CAAGACCGGC
GGCACCATCA ACCTGGTCTC CTCGCAGTCC TTCGACTCGA TCGACCCGGG GGTCACCTAC
TCCGCGCAGA CCTGGAACCT GTTCCGGATG TTCGCCCGGC CGATGATGGC CTACGAGCAC
ACGCCCGGCG GCAACCAGAT CGTCGGCGAC CTGGCCACCG GCCCGGGCGT GCAGTCCAAC
GGCGGCAAGA GCTGGACCTA CACCCTGCGC GACAACGCCA CCTTCGAGGA CGGCACGCCG
ATCACCTCCG GCGACGTGCG CTGGGCCATC GAGCGCTCCA ACTGGTCCTC GCTGGTCGGC
AACGGCCCGA CCTACTTCCA CAACATCCTC ACGCCGCCGA ACGACCCGCG GTTCACGGAC
CTGGACGTGT ACAAGTCCGG GGACAAGATG TTCGACAACA TCATCGTCAC CACGGACCCG
AAGAAGATCA CCTTCAACCT CCCGCAGGCC TTCGGCGAGT TCGACTACGT GATGACCATG
CTGCAGACCG CTCCGGTCGA GCGCACCGTG GACGAGAAGG ACTCCGGCGA CACCTACGGC
AAGCGGCCGG TCTCCACCGG CGCCTACAAG ATCGCCAGCT ACTCCCCGGG CAAGGAGCTC
AAGCTGGTCC GCAACGCCGC CTACAACCAG CAGTCCGACC CCAACCACAT GCACACCTCG
CTGGCCGACG CCGTCGACGT GCAGCTCGGC GTGGACTCCG GGGAGCGCGA CCAGGAACTG
CTGGACGGCC AGGCCGACGC CGACCTGGGC TCGGCGCTGA CGGTGGCCAA CCACGCCAAG
GTGCTCCAGG ACCCGACGCT GAAGTCGCAG ACCGACGACG CCCCGGACTA CTCGATCGCC
TACTCCTCGA TCAACACCGA GCTGATCCCG GACGTCCCGT GCCGGCAGGC GATCGAGTAC
GCGGTGGACA AGAACACGGT GCTGAACCAG CTCGGCGGGC AGTGGGGCGG CACGGTCGCC
ACGAACCTGC TCACCGCCGG CATCCCCGGC GCGGTCCAGT TCCCGACGTA CACCTACGAT
CCGGCCAAGG CCAAGACCCT GCTGGCGACG TGCAAGACGG CCGACCCCGC GCTGTTCGAC
AGCAGCGGGG CGCTGACCTT CAAGATCGCC GCGCAGACCA ACGCCCCGGA CCTGCAGAAC
GCCGCGACCG CGATCCAGGC CTCGCTGGCC GGCGTGGGCA TCAGCACCCA GGTGACGCTG
TTCCCGTTCG GCCAGTACAG CCAGTACTGC GGCAACCAGG CCTACGCCAA GGCGCACCGG
CTCGGCATGT GCCTGGCCAA CTGGGGACCG GACTGGCTCA CCGGCTACGG CATGCTCGAC
CAGCTGGTGA CCTCCAACGG CATCGCGGCC ACCGGCAGCC AGAACTACGC CTTCCTCAAC
GACACGACGG TCAACTCCCT GGAGAAGGAA GCGCTGTCCA GCTTCGAGCC GAGCACCCAG
CAGCAGGACT GGGTCAAGAT CGACCACCGC GTGATGGACC TGGCGGCGTA CGTGCCGCTG
ATGGACCGGC ACATCATGCG CTTCCGGTCC GCCAAGCTCT CCAACGTGAT GATCGACCAG
GCCGGCAGCG GCGGCTACGA CCTGTCGGTC CTGGGTCTGA AGTGA
 
Protein sequence
MKKETAVIPK FSIRAGAAAA ALALVLSACG SSSGAGGGSA MGPGFDLASK QVLNASTKTG 
GTINLVSSQS FDSIDPGVTY SAQTWNLFRM FARPMMAYEH TPGGNQIVGD LATGPGVQSN
GGKSWTYTLR DNATFEDGTP ITSGDVRWAI ERSNWSSLVG NGPTYFHNIL TPPNDPRFTD
LDVYKSGDKM FDNIIVTTDP KKITFNLPQA FGEFDYVMTM LQTAPVERTV DEKDSGDTYG
KRPVSTGAYK IASYSPGKEL KLVRNAAYNQ QSDPNHMHTS LADAVDVQLG VDSGERDQEL
LDGQADADLG SALTVANHAK VLQDPTLKSQ TDDAPDYSIA YSSINTELIP DVPCRQAIEY
AVDKNTVLNQ LGGQWGGTVA TNLLTAGIPG AVQFPTYTYD PAKAKTLLAT CKTADPALFD
SSGALTFKIA AQTNAPDLQN AATAIQASLA GVGISTQVTL FPFGQYSQYC GNQAYAKAHR
LGMCLANWGP DWLTGYGMLD QLVTSNGIAA TGSQNYAFLN DTTVNSLEKE ALSSFEPSTQ
QQDWVKIDHR VMDLAAYVPL MDRHIMRFRS AKLSNVMIDQ AGSGGYDLSV LGLK