Gene Cag_0501 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_0501 
Symbol 
ID3746370 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp584389 
End bp585417 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content43% 
IMG OID637773035 
Productperiplasmic phosphate binding protein 
Protein accessionYP_378817 
Protein GI78188479 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0226] ABC-type phosphate transport system, periplasmic component 
TIGRFAM ID[TIGR02136] phosphate binding protein 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTATTA CACAGTTTTG GAAACATGCC ACTATGGCAT TGGCTTTTGT TGGACTCGCT 
TCGGGTTCAT TAGAGGCGCG AGAACAAATT AGAATTGTGG GTTCAAGTAC CGTATTCCCA
TTTGCAAGCT ATGTTGCAGA GGAATTTGGT AAAACCACAG GCAACCAAAC TCCTGTTATT
GAATCAACGG GTTCAGGCGG AGGGCACAAA TTGTTTGGTG AAAGTGATGC TATTACCACG
CCCGACATTA CCAACTCTTC ACGCAGAATG AAGAAAGCGG AATTTGATCG CGCCAAACAA
AACGGCATAC AAGCTATCCA CGAGGTTGTA ATTGGTTATG ATGGTATTGT AATAGCAAAT
GCAAAAAAAG CTACCACCTT ACAGCTTACC CGTCGCGACC TGTTCTTTGC GCTTGCTGAA
GAAGTACCCA TGAAAGGTCA GCTTGTAAAA AACCCTTATA CCAAATGGAG CCAAATCCGT
AAAGGGTTGC CAAACCAAAA GATTCTTGTG TATGGTCCTC CAACAAGCTC AGGCACCAGA
GATGCGTTTG ATGAAATGGT AATGGAAGCA TCATCAAAGA GCATCACTGA ATATGGAGCA
CTTGCTGGTA AGTACAAGAA AATTCGTCAA GATGGTGTTT TTGTGCCTTC AGGTGAAAAC
GACAATTTAA TTGTACAGCG CATTGTAAAA GATAAAGCTG CGGTTGGCGT TTTTGGCTAT
AGCTTTTTAG AGGAGAATGC CGATCGCATT AAAGGTGCAA CCATTGATGG TGTTGCGCCA
GTGCCAGCCA ACATTACCTC AGGTAAATAT CCTGTATCGC GCGATCTTTA TTTCTACGTA
AAAGGCTCAC ATATTGCCCA AGTAAAGGGT TTGAAAGAGT ATGTTGACCT TTTTGTTGGC
GAAAAAATGA TTGGCGACTA TGGATATTTG AAAAAAATCG GTTTGATTCC GCTACCTAAA
AAAGAGCGTG AAGCAATCCG TGCAAATTGG AATGCTCGTA AGATGTTAAC GGGAACAAGC
CTCGATTAA
 
Protein sequence
MRITQFWKHA TMALAFVGLA SGSLEAREQI RIVGSSTVFP FASYVAEEFG KTTGNQTPVI 
ESTGSGGGHK LFGESDAITT PDITNSSRRM KKAEFDRAKQ NGIQAIHEVV IGYDGIVIAN
AKKATTLQLT RRDLFFALAE EVPMKGQLVK NPYTKWSQIR KGLPNQKILV YGPPTSSGTR
DAFDEMVMEA SSKSITEYGA LAGKYKKIRQ DGVFVPSGEN DNLIVQRIVK DKAAVGVFGY
SFLEENADRI KGATIDGVAP VPANITSGKY PVSRDLYFYV KGSHIAQVKG LKEYVDLFVG
EKMIGDYGYL KKIGLIPLPK KEREAIRANW NARKMLTGTS LD