Gene Caci_7779 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_7779 
Symbol 
ID8339155 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp9019113 
End bp9020906 
Gene Length1794 bp 
Protein Length597 aa 
Translation table11 
GC content67% 
IMG OID644960863 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003118444 
Protein GI256396880 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.354248 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCGTCA GCAGGAGGAC ACTCGCGCTG CAGACCGGAG CGATCGCGGC GGCGGTGGCC 
TTGGCCGCCA CCGCCTGCGG CAGCAGCAAG AGCGGCGGAG GATCCACCAC CGGTTCCGGT
TCGGGCGCCG GCACGCCCAT CGCCGACCGG AACTCGGTCA ACGCCGCGAC CGTGAAGCAG
GGCGGGAAGA TCACCTGGAC CATCGAGAAG ACGGTCCAGG ACTGGAACCC GCTCACCTCT
CTGGGCAACA CGTTCGACTA CGCGCAGACC ACGAACGGCA TCTACCCGGA CGTCTACGTC
CCGCAGCCGG ACTACTCGCT GGTGCTGAAC ACCGACCTGA TGGCCGGGGA CCCGGTGGTC
ACCAACGCCA CCTCGACCGA GCCGCAGAAG ATCGTCTACA AGATCCAGCC GAACGCCAAG
TGGTCCGACG GGACCCCGGT CACCGCCGAC GACTTCATCT ACCTGTGGCA GGCGCAGAAC
GGCACCAACC CGAACGTCGA CGTGGCCAGC ACCACCGGCT ACAGCGACGT GGCCTCGGTG
ACCGGCAGCG ACAACGGCAA GACCGTGACC GTCGCCTTCA AGCAGGACAA GCCCTTCTCG
GACTGGAAGA GCCTGTTCAC CTCGATCCTG CCGGCGCACG TCGCCAAGCA GCACGGCGAC
GTCGCGGCCT CCTTCACCTG GCTGGACGCC AACCCCCCGA CGGTCTCCGC CGGCCCGTTC
GAGATCGCCC CCGGCGGCGT CTCGGCCGAC AAGAGCCTGA TCAAGACGAT CAAGAACCCG
CAGTACTACG GCAAGCCGGC CAACCTCGAC GAGGTCGACT TCCGCGCGAT CACCGACTCC
TCGCAGGAGC CGACCGCGCT GGCCAACGGC GAGGTGGACG GCATCTACCC GCAGCCGCAG
CTGGACCTGG TGAACCGGGT CAAGAGCATC GCCGGCGTGG ACTACCACAT CAACCAGGGC
CTGGTCTGGG AGCACATCGA CCTGAACCTG CGCAACAGCG CCTTCGGCGG CCCGGCCGAC
GCCGACCAGA CCCAGCCGGC CAAGGTCGCG CTGCGCCAGG CGATGTTCAC CGCCTTCGAC
CGGCTCGGCC TGCTGAACCG GACGATCAAG CAGTTCGACA GCGACGCGGC GGTGCTGAAC
AACCGCATGG TGGTGCCCGG CCAGCCCGGC TACCAGGACA ACGCCTCCGC GATGTACCCG
GAGTCCGGGG ACCTGAACAA GGCCAAGCAG CTGCTCACCA CGGCCGGCTA CAAGGGCGTG
GGCACCGCGC TGGTGGACCC CAGCGGCAAG GCCGTCCCGG CGTTCAGCAT GCGCTACACC
GTCGGCAACC AGCTGCGCCA GGACACCTGC AACCTGTTCG CGCAGGCCAT GAAGCAGCTG
GGGATCACGG TCAACGTCAG CTCCACCGAC GCCCTGGGCA AGACCCTGAC CCAGTCCGAC
GCGCAGCACA CGTACGACAT CATCGTCTTC GCCTGGGTGG ACACCCCGTT CCCCAACTCG
GCGAACCAGC CGCTGTACAC CACCACGACG CAGGGCAACC CGCAGAGCAA CTACGGGTAC
TACAGCAACG CGAACGTGGA CAAGTGGCTG GCCGACGCCA CGGTCAACCC CGACCAGACG
GCCCGGGAGA AGGACCTGAA CCAGGCCGAC GCGCAGATCA CCAAGGACGC GTACACGCTG
CCGCTGTACC AGAAGCCGAC GATGATCGCG TACAAGAACA CCCTGGGCAA CGTGCGGGAC
AACCCGACGC AGATCGGCCC GACGTACAAC ATCGCGCAGT GGGGCCAGAA GTAG
 
Protein sequence
MGVSRRTLAL QTGAIAAAVA LAATACGSSK SGGGSTTGSG SGAGTPIADR NSVNAATVKQ 
GGKITWTIEK TVQDWNPLTS LGNTFDYAQT TNGIYPDVYV PQPDYSLVLN TDLMAGDPVV
TNATSTEPQK IVYKIQPNAK WSDGTPVTAD DFIYLWQAQN GTNPNVDVAS TTGYSDVASV
TGSDNGKTVT VAFKQDKPFS DWKSLFTSIL PAHVAKQHGD VAASFTWLDA NPPTVSAGPF
EIAPGGVSAD KSLIKTIKNP QYYGKPANLD EVDFRAITDS SQEPTALANG EVDGIYPQPQ
LDLVNRVKSI AGVDYHINQG LVWEHIDLNL RNSAFGGPAD ADQTQPAKVA LRQAMFTAFD
RLGLLNRTIK QFDSDAAVLN NRMVVPGQPG YQDNASAMYP ESGDLNKAKQ LLTTAGYKGV
GTALVDPSGK AVPAFSMRYT VGNQLRQDTC NLFAQAMKQL GITVNVSSTD ALGKTLTQSD
AQHTYDIIVF AWVDTPFPNS ANQPLYTTTT QGNPQSNYGY YSNANVDKWL ADATVNPDQT
AREKDLNQAD AQITKDAYTL PLYQKPTMIA YKNTLGNVRD NPTQIGPTYN IAQWGQK