Gene Cag_0404 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_0404 
Symbol 
ID3747782 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp469051 
End bp470739 
Gene Length1689 bp 
Protein Length562 aa 
Translation table11 
GC content50% 
IMG OID637772932 
Productpeptide ABC transporter, periplasmic peptide-binding protein, putative 
Protein accessionYP_378720 
Protein GI78188382 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.195292 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAAAGA AACCTCCATT TGCACTCATA GTTGCATTAC TGCTTGTTGT AAGCACGTTA 
TACGGTTGCC ATCAACAACG GGCTGAAGAG CAACACCAGC AAGTAGTTAC CGCCATTTCA
GCCGATTTCG ATTACCTCAA TCCGCTGCTC ATTCAGTTTG CAATGTCGCG CGAAATGTGC
AAGCTGCTCT ATCCATCGCT TGTACGCCCA GCTTACGATT CAGCCCAAGG CACCATTCGC
TTCCTTCCTA ACGCTGCCAA GCAATGGAAA TTTTCGCCCG ATGGTAAAGG TGTTACGTTC
CATCTTAACC GCAATGCTCG CTGGGAAGAT GGGGTGCCTG TAACATCGCA CGACTTTAAA
TTCTCCTACG CTCTCTACAA AAATCCAGCA GTGGCAAGTA CGCGCCAGCA TTACTTAAAC
GATTTACCAT TGCTACCCGA TGGCTCACCC GATGTTGAGC GTGCGGTGGA AACGCCGAAC
GATTCAACGT TGGTGATTCA CTTTAGCAAT GCTATGGCAG AGGAAATTGT GCTTGATCAT
CTGAATGATT TAATGCCGGT AGCTCGCCAT CGCTTTAAGG ATTTTCGCCC TGAGGAGATT
CGCCAACGTG CGGCTGAATT ACCGCTGCTG AGTGCGGGTC CGTTCCGCGT AAAAGAGTGG
AAACGGCAAG AGCGTTTGGT GCTGGAACCA AATCCAACCT CAGCGCTACC GCATCCTGCC
ACCTTAAAAA GCATGGTCTT TATGGTGGTG CCCGAATACA CCACACGCTT GGCTATGCTG
AAAGCAGGGC AAATTGATGC AATGCTTTCA GCGGGCGGCA TTAACCCAAA GGATGTGCCA
GAACTACACC GAAGCGCGCC CAATGTGGTG ATTCGCCCTG TACAGCACCG CTATTTTGAT
AGCATTGTGT GGCTCAATAT TGATGGCGAA TGCTATCGTA CCCGCAAGCA GATTGCGCCT
CATCCGCTTT TTGGTGATAA GCGTGTTCGC CAAGCCCTTA CCCTTGCCAT TGATCGCCAA
TCCATTATTG ATGGTTTTAT GGGACCCGAC CACGCCACCA TTGTGAACAC CTCGCTTTCG
CCGGCATATC GCACCCTTGC CGATAGCTCG CTTGATGCGT ATGCCTACAA TCCACGCCGA
GCTTCCATGC TCTTGCGTCA AGCAGGATGG CTACCGGGAG CAAACGGCAT TCTGCAAAAA
AATGGCAAAC CCTTTAGCTT TGAGCTTGCA GCACCTACCG GCAATCCACG CCGCAACTAT
GCCGCTACAA TCATTCAGCA AAACTTGCGG GCAATTGGCA TTGATTGTCG CCTCCGCTTT
GATGAAGGAC TTATTTTTAA CCGCAACCAA AACGAGTTTC GCTACGATGC CGCCCTTTCG
GGCATGGCAG CAGAAACCTT GCCTTTTCAG CTTATTATTT GGGGATCGAA TTTTGCCGAA
CGCCCCTTTA ACTCAGCCGC TTTTCAAAAC AGGGAGCTTG ACGAGGTGAT TGCTCAACTC
AGCAAACCAA ATTCACCTAC ACGAAAACGT GCCTTTTGGC AACGCTATCA GCAAATTTTG
CACGAAGAAC AGCCACGCAC ATTCCTTTAT TACTATGATG AACTTGAAGG CTTCAATAAA
CGCATTCGCA ACGCCGAGGT AAACATGCTC TCAACGCTCT ACAATCTCCA CGAATGGCAA
ACGGAGTAA
 
Protein sequence
MEKKPPFALI VALLLVVSTL YGCHQQRAEE QHQQVVTAIS ADFDYLNPLL IQFAMSREMC 
KLLYPSLVRP AYDSAQGTIR FLPNAAKQWK FSPDGKGVTF HLNRNARWED GVPVTSHDFK
FSYALYKNPA VASTRQHYLN DLPLLPDGSP DVERAVETPN DSTLVIHFSN AMAEEIVLDH
LNDLMPVARH RFKDFRPEEI RQRAAELPLL SAGPFRVKEW KRQERLVLEP NPTSALPHPA
TLKSMVFMVV PEYTTRLAML KAGQIDAMLS AGGINPKDVP ELHRSAPNVV IRPVQHRYFD
SIVWLNIDGE CYRTRKQIAP HPLFGDKRVR QALTLAIDRQ SIIDGFMGPD HATIVNTSLS
PAYRTLADSS LDAYAYNPRR ASMLLRQAGW LPGANGILQK NGKPFSFELA APTGNPRRNY
AATIIQQNLR AIGIDCRLRF DEGLIFNRNQ NEFRYDAALS GMAAETLPFQ LIIWGSNFAE
RPFNSAAFQN RELDEVIAQL SKPNSPTRKR AFWQRYQQIL HEEQPRTFLY YYDELEGFNK
RIRNAEVNML STLYNLHEWQ TE