Gene Cag_1021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1021 
Symbol 
ID3746749 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp1369800 
End bp1371317 
Gene Length1518 bp 
Protein Length505 aa 
Translation table11 
GC content32% 
IMG OID637773550 
Producthypothetical protein 
Protein accessionYP_379326 
Protein GI78188988 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000100407 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACAGC TACCCGTAGG CATACAAACA TTCAATAAAA TCATTGAAGG CGATTATCTG 
TACATTGATA AAACAGATAT AGCAAAAAAC ATTATTGAAA AATATCAATA TGTTTTTCTA
TCACGTCCAC GGCGCTTTGG TAAAAGCCTA TTTCTTGATA CACTCAAAAA TATTTTTGAA
GGCAAGCAAG AGCTTTTTAA AGACTTATTT ATTTACAACC AATGGAATTG GAATGTAACC
TATCCCGTTA TTAAAATAAG TTTTAGTGGC GGCATACGCG ATAAAGAAAG CCTTCGTAGA
AATCTTGTTT ATGTCTTAAA AGACAATCAA AAGCAGCTCA ATATTACTTG TGAAGAAAAA
GATGATCCGA ATCTATGCTT TGCTGAATTG ATCCAACAAG CATCTGAAAA GTATCAGCAA
AAAGTTGTTA TTCTCATTGA CGAGTACGAC AAACCTATTC TTGATAATAT TGAAAATATT
GCGGAAGCTA TTATCGTCCG CGATGGAATG CGAGACTTTT ATACCAAAAT AAAAGAGAGT
GATGAATATT TACGATTTGT TTTTCTTACA GGAGTAAGCA AATTTTCAAA AGTATCACTC
TTCAGTGGTT TAAATAATCT TGAAGATATT AGCCTCAATC CCGATTTTGG TAATGTTTGT
GGCTACACAC AAAACGATGT TGATACCATT TTTGCACCTT ATTTTGAAGG CGTTGATATG
GAAGAGGTGA AACGCTGGTA TAATGGCTAT AATTTTCTTG GCGATAAAGT CTATAATCCT
TATGACATTT TGCTCTTTAT TAAGAACAAG TACGTTTTTG ACAGTTATTG GTTTGAAACA
GGTACTCCAC GTTTTTTAAT TGAGTTAATC AAAAAAAATA ATTATTTCAT TCCAGATTTT
TTAACACTCA AAGTTAAAAA AAGTATAGTA AATAGTTTTA ATCTTGAAAA TCTTAATTTG
GAAACAATTT TATTTCAAGC AGGCTATTTG ACTATTAAGC GATTGATTTC TACGAATAAA
GGCATTAGTT ATGAGTTAAG ATTTCCAAAC AAAGAGGTGC AAATTAGCTT TAATGACTAT
CTTTTACAGG AGTTAACTAC CATTTCGGAA AATGAGCTAA TTTGCGATGA TCTATTTGAT
CTTTTCAATA ATGGAGATAT TGCCAATTTA GAACCCGTTA TCAAACGACT TTTTGCAAGT
ATTGCTTATA ATAATTTCAC CAACAACTAT ATTGAGAGTT ATGAAGGTTT TTATGCAAGC
GTGCTTTATG CTTATTTTGC AAGTCTTGGG TTTGATATGA TTGCTGAAGA TATCACCAAT
AAAGGTAGAA TTGATTTGAC ACTTAAAACA CTCGATAAAA CCTACATCTT TGAGTTCAAA
GTAATTGCAG AAGAGCCGCT TGAGCAGATC AAGAAAATGA GATATTATGA GAAATATGAC
GGCGAACGTT ATCTCATTGG CATTGTTTTT GATCCGAAGG CGAGAAACGT TAGTCGGTTT
GAGTGGGAGA GGGTTTGA
 
Protein sequence
MKQLPVGIQT FNKIIEGDYL YIDKTDIAKN IIEKYQYVFL SRPRRFGKSL FLDTLKNIFE 
GKQELFKDLF IYNQWNWNVT YPVIKISFSG GIRDKESLRR NLVYVLKDNQ KQLNITCEEK
DDPNLCFAEL IQQASEKYQQ KVVILIDEYD KPILDNIENI AEAIIVRDGM RDFYTKIKES
DEYLRFVFLT GVSKFSKVSL FSGLNNLEDI SLNPDFGNVC GYTQNDVDTI FAPYFEGVDM
EEVKRWYNGY NFLGDKVYNP YDILLFIKNK YVFDSYWFET GTPRFLIELI KKNNYFIPDF
LTLKVKKSIV NSFNLENLNL ETILFQAGYL TIKRLISTNK GISYELRFPN KEVQISFNDY
LLQELTTISE NELICDDLFD LFNNGDIANL EPVIKRLFAS IAYNNFTNNY IESYEGFYAS
VLYAYFASLG FDMIAEDITN KGRIDLTLKT LDKTYIFEFK VIAEEPLEQI KKMRYYEKYD
GERYLIGIVF DPKARNVSRF EWERV