Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cag_0404 |
Symbol | |
ID | 3747782 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium chlorochromatii CaD3 |
Kingdom | Bacteria |
Replicon accession | NC_007514 |
Strand | - |
Start bp | 469051 |
End bp | 470739 |
Gene Length | 1689 bp |
Protein Length | 562 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 637772932 |
Product | peptide ABC transporter, periplasmic peptide-binding protein, putative |
Protein accession | YP_378720 |
Protein GI | 78188382 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.195292 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAAAGA AACCTCCATT TGCACTCATA GTTGCATTAC TGCTTGTTGT AAGCACGTTA TACGGTTGCC ATCAACAACG GGCTGAAGAG CAACACCAGC AAGTAGTTAC CGCCATTTCA GCCGATTTCG ATTACCTCAA TCCGCTGCTC ATTCAGTTTG CAATGTCGCG CGAAATGTGC AAGCTGCTCT ATCCATCGCT TGTACGCCCA GCTTACGATT CAGCCCAAGG CACCATTCGC TTCCTTCCTA ACGCTGCCAA GCAATGGAAA TTTTCGCCCG ATGGTAAAGG TGTTACGTTC CATCTTAACC GCAATGCTCG CTGGGAAGAT GGGGTGCCTG TAACATCGCA CGACTTTAAA TTCTCCTACG CTCTCTACAA AAATCCAGCA GTGGCAAGTA CGCGCCAGCA TTACTTAAAC GATTTACCAT TGCTACCCGA TGGCTCACCC GATGTTGAGC GTGCGGTGGA AACGCCGAAC GATTCAACGT TGGTGATTCA CTTTAGCAAT GCTATGGCAG AGGAAATTGT GCTTGATCAT CTGAATGATT TAATGCCGGT AGCTCGCCAT CGCTTTAAGG ATTTTCGCCC TGAGGAGATT CGCCAACGTG CGGCTGAATT ACCGCTGCTG AGTGCGGGTC CGTTCCGCGT AAAAGAGTGG AAACGGCAAG AGCGTTTGGT GCTGGAACCA AATCCAACCT CAGCGCTACC GCATCCTGCC ACCTTAAAAA GCATGGTCTT TATGGTGGTG CCCGAATACA CCACACGCTT GGCTATGCTG AAAGCAGGGC AAATTGATGC AATGCTTTCA GCGGGCGGCA TTAACCCAAA GGATGTGCCA GAACTACACC GAAGCGCGCC CAATGTGGTG ATTCGCCCTG TACAGCACCG CTATTTTGAT AGCATTGTGT GGCTCAATAT TGATGGCGAA TGCTATCGTA CCCGCAAGCA GATTGCGCCT CATCCGCTTT TTGGTGATAA GCGTGTTCGC CAAGCCCTTA CCCTTGCCAT TGATCGCCAA TCCATTATTG ATGGTTTTAT GGGACCCGAC CACGCCACCA TTGTGAACAC CTCGCTTTCG CCGGCATATC GCACCCTTGC CGATAGCTCG CTTGATGCGT ATGCCTACAA TCCACGCCGA GCTTCCATGC TCTTGCGTCA AGCAGGATGG CTACCGGGAG CAAACGGCAT TCTGCAAAAA AATGGCAAAC CCTTTAGCTT TGAGCTTGCA GCACCTACCG GCAATCCACG CCGCAACTAT GCCGCTACAA TCATTCAGCA AAACTTGCGG GCAATTGGCA TTGATTGTCG CCTCCGCTTT GATGAAGGAC TTATTTTTAA CCGCAACCAA AACGAGTTTC GCTACGATGC CGCCCTTTCG GGCATGGCAG CAGAAACCTT GCCTTTTCAG CTTATTATTT GGGGATCGAA TTTTGCCGAA CGCCCCTTTA ACTCAGCCGC TTTTCAAAAC AGGGAGCTTG ACGAGGTGAT TGCTCAACTC AGCAAACCAA ATTCACCTAC ACGAAAACGT GCCTTTTGGC AACGCTATCA GCAAATTTTG CACGAAGAAC AGCCACGCAC ATTCCTTTAT TACTATGATG AACTTGAAGG CTTCAATAAA CGCATTCGCA ACGCCGAGGT AAACATGCTC TCAACGCTCT ACAATCTCCA CGAATGGCAA ACGGAGTAA
|
Protein sequence | MEKKPPFALI VALLLVVSTL YGCHQQRAEE QHQQVVTAIS ADFDYLNPLL IQFAMSREMC KLLYPSLVRP AYDSAQGTIR FLPNAAKQWK FSPDGKGVTF HLNRNARWED GVPVTSHDFK FSYALYKNPA VASTRQHYLN DLPLLPDGSP DVERAVETPN DSTLVIHFSN AMAEEIVLDH LNDLMPVARH RFKDFRPEEI RQRAAELPLL SAGPFRVKEW KRQERLVLEP NPTSALPHPA TLKSMVFMVV PEYTTRLAML KAGQIDAMLS AGGINPKDVP ELHRSAPNVV IRPVQHRYFD SIVWLNIDGE CYRTRKQIAP HPLFGDKRVR QALTLAIDRQ SIIDGFMGPD HATIVNTSLS PAYRTLADSS LDAYAYNPRR ASMLLRQAGW LPGANGILQK NGKPFSFELA APTGNPRRNY AATIIQQNLR AIGIDCRLRF DEGLIFNRNQ NEFRYDAALS GMAAETLPFQ LIIWGSNFAE RPFNSAAFQN RELDEVIAQL SKPNSPTRKR AFWQRYQQIL HEEQPRTFLY YYDELEGFNK RIRNAEVNML STLYNLHEWQ TE
|
| |