Gene Caci_0834 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_0834 
Symbol 
ID8332164 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp969657 
End bp971156 
Gene Length1500 bp 
Protein Length499 aa 
Translation table11 
GC content68% 
IMG OID644953985 
Productamino acid/peptide transporter 
Protein accessionYP_003111609 
Protein GI256390045 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3104] Dipeptide/tripeptide permease 
TIGRFAM ID[TIGR00924] amino acid/peptide transporter (Peptide:H+ symporter), bacterial 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value4.22142e-07 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACGACCC ACGACGTTCC GTCCCCCGCC CAGGGCCTGC CGCCGCAGGA TGAGGACCGA 
GCGTTCTTCG GTCACCCCAA GGGATTGCAG ACGCTGTTCG CGACCGAGTT CTGGGAGCGG
TACAGCTTCT ACGGCATGCG CGGGCTGCTG GTGCTGTTCC TCACCGACAC CGCGGCGCAC
CACGGCCTGG GGCTCTCGCA GGAGGCCGGC AACAGCTTCT ACGGCATCTA CAACAGCCTG
GTCTACCTGA TGGCAGTCCC CGGCGGCTGG ATCGCCGACC GGGTCTGGGG CGCCAGACGC
TCGGTGCTGT GGGGCGGCAT CATCATCGCG CTGGGCCACT ACGTAATGGC CATCCCCACC
GCGGCGACCT CGTTCCTGGG GCTGGGACTG ATCGTGCTGG GCACCGGGCT GCTCAAGCCG
AACATCTCGG CGCAGGTCGG CGGGCTCTAC CACCAGCACG ACAAGCGGCG GGACGCCGGA
TTCACGATCT TCTACATGGC GATCAACATG GGCGCGTTCC TGGCGCCGCT GACCGCCGGC
TGGGTCGGCC AGCACATCAA CTACCACCTG GGCTTCGGGA TCGCGGCGAT CGGCATGACG
TTCGCGGTCG TCTGGTACGT CGTCGAGGGC AAGCACCTGG GCACCGTCGG GCTGCGCCCG
CCGAAGCCGA TCACGCCGCC GGAGCTGCGG CGCTCGCTGC GGGCCGGCGC GGTGATCGTC
GCGATCGTGC TGGCGATCGT GCTGGGCTGG ATGGCGATCA CCGAGTGGTC GGTCTCGGCG
TTCGCCGACG GGCTGGCCGC GCCGATCATC GCCACGCCGT TCGTCTACTT CGGCTACATG
TTCAGCCGCG GCGGGCTGAG CGCCGGGGAG CGGCCCAAGC TGATGGCGTT CGTCGCCTTC
TTCATCGGCG CGACCGTGTT CTGGATGATC TACGACCAGT CCGGCAGCCA GCTGAACCTG
TTCGCCGCGG ACAAGACCGA CCTGTCGATC TTCGGCTGGG AGATGCCCTC GGTGTGGCTG
CAGTCGGCGA ACCCTTTCTA CATCATGGTG TTCGCGCCGG TCTTCGCCGG GATGTGGCAG
CGCCTGGGCG ACCGGGCGCC GCGGACCTCG GTGAAGTTCG CGCTCGGGCT GGTGGTGATC
GGCTGCTCGT TCTTCGTGAT GAGCATCGCC GGCAAGGACG CCACACCGAC GCACCGCGTC
TCGATCGTCT TCCTGGCGGT CACCTACCTG CTGCAGACGA TCGGCGAGCT GTGCCTGTCC
CCGGTCGGGC TGTCGGTGAC CACGCAGCTG GCGCCGGCCC GGTTCGCCGG GCAGATGCTG
GGGCTGTGGT TCCTGGCCAC CGCCACGGGC AACGCGCTCA ACGTGTACGT CACCAAGCTC
AGCACCGTGA TGAGCGACTT CACCTACTTC CTGACGCTGG GCGCCGTGGC GGCGGGCATC
GGCGTGCTGG TGTTCCTCGC CTCGCCGGTC ATCAACCGGC TGATGGGGGA TGTGCGGTAG
 
Protein sequence
MTTHDVPSPA QGLPPQDEDR AFFGHPKGLQ TLFATEFWER YSFYGMRGLL VLFLTDTAAH 
HGLGLSQEAG NSFYGIYNSL VYLMAVPGGW IADRVWGARR SVLWGGIIIA LGHYVMAIPT
AATSFLGLGL IVLGTGLLKP NISAQVGGLY HQHDKRRDAG FTIFYMAINM GAFLAPLTAG
WVGQHINYHL GFGIAAIGMT FAVVWYVVEG KHLGTVGLRP PKPITPPELR RSLRAGAVIV
AIVLAIVLGW MAITEWSVSA FADGLAAPII ATPFVYFGYM FSRGGLSAGE RPKLMAFVAF
FIGATVFWMI YDQSGSQLNL FAADKTDLSI FGWEMPSVWL QSANPFYIMV FAPVFAGMWQ
RLGDRAPRTS VKFALGLVVI GCSFFVMSIA GKDATPTHRV SIVFLAVTYL LQTIGELCLS
PVGLSVTTQL APARFAGQML GLWFLATATG NALNVYVTKL STVMSDFTYF LTLGAVAAGI
GVLVFLASPV INRLMGDVR