Gene Cag_1024 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1024 
Symbol 
ID3746752 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp1378520 
End bp1380037 
Gene Length1518 bp 
Protein Length505 aa 
Translation table11 
GC content33% 
IMG OID637773553 
Producthypothetical protein 
Protein accessionYP_379329 
Protein GI78188991 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACCGC TACCCGTAGG CATACAAACA TTCAGTAAAA TAATCGAGGA CAATTATCTG 
TACATTGATA AAACAGATAT AGCAAAAAGC ATAATAGAAA AATATCAATA TGTTTTTCTA
TCACGTCCAC GACGATTTGG AAAGAGTTTG TTTCTTGATA CGCTCAAAAA TATTTTTGAA
GGCAAACAAG AGCTTTTCAA AGACTTGCTT ATTTACAACC AATGGAATTG GGCTGTAACT
TATCCCGTTA TTAAAATAAG TTTTAGTGGT GGTATTCACT CCAAAGCTGA TCTCGAAGAA
GATTTAATAC AAATACTGAA GGCGAATGAA AAACGGCTTG ATCTAAAGTG CGAAAATCGC
TCAAAAGCAA AATACTTTTT TGCTGAGTTG ATTCAACAAG CTTCTGAAAA GTATCAACAA
AGCGTTGTTA TTCTCATTGA CGAGTACGAT AAACCAATTC TCGATAATAT TGAAAATATC
CCTGAAGCAC TCATTATCCG TGATGGAATG CGAGACTTTT ATACCAAAAT AAAAGAGAGT
GATGAATATT TACGTTTTGT TTTTCTTACA GGAGTAAGCA AATTTTCAAA AGTATCGCTT
TTTAGTGGTT TAAATAATCT TGAAGATATT AGCCTGAATT CCGATTTTGG TAACATTTGT
GGTTATACAC AAAACGATGT TGATACCACT TTTGCACCCT ATTTTGAAGG CGTGGATATG
GAAGAGGTGA AACGCTGGTA TAATGGATAT AATTTTCTTG GTGATAAAGT CTATAACCCA
TTTGATATAC TCCTTTTTAT TAAAAACCAA AAAATGTTCA GAAACTATTG GTTTGAAACA
GGCACACCAC GATTTCTGAT TGAGCTTATC AAAAAAAACA ACTATTTTGT TCCCAACTTA
AATAAACTGC GAATTAATGA ATCCTTAGCG AATAGTTTTA ATCTTGAAAA TCTTAATTTA
GAAACAATTT TATTTCAAGC AGGCTATTTG ACTATTAAGC GATTGATTTC TACTAACAAA
GGTGTTAGCT ATGAGTTGGG ATTTCCTAAC AAAGAGGTGC AAATTAGCTT TAACGATTAT
CTTTTGCAAG AATTAACTAC TGTTTCGGAA AATGAGCTAA TTTGCGATGA TCTTTTTGAA
CTTTTCAATA ATGGAGATAT TGCCAATTTA GAACCCGTTA TCAAACGACT TTTTGTAAGT
ATTGCTTATA ATAATTTCAC CAACAACTAT ATTGAGAGTT ATGAGGGCTT TTATGCAAGT
GTGCTCTATG CTTATTTTGC AAGTCTTGGG TTTGATATGA TTGCTGAAGA TATCACCAAT
AAAGGCAGGA TTGATTTAAT CCTTAAAACC TTCGATAAAA CCTACATCTT TGAATTCAAA
GTAATTGCAG AGGAGCCGCT TGAGCAAATC AAAAAGATGA AATATTACGA GAAGTATGAT
GGTGAACGTT ATCTCATTGG TATTGTTTTT GATCCGAAGG CAAGAAACGT CAGTCAATTT
GCGTGGGAGA GGGTTTGA
 
Protein sequence
MKPLPVGIQT FSKIIEDNYL YIDKTDIAKS IIEKYQYVFL SRPRRFGKSL FLDTLKNIFE 
GKQELFKDLL IYNQWNWAVT YPVIKISFSG GIHSKADLEE DLIQILKANE KRLDLKCENR
SKAKYFFAEL IQQASEKYQQ SVVILIDEYD KPILDNIENI PEALIIRDGM RDFYTKIKES
DEYLRFVFLT GVSKFSKVSL FSGLNNLEDI SLNSDFGNIC GYTQNDVDTT FAPYFEGVDM
EEVKRWYNGY NFLGDKVYNP FDILLFIKNQ KMFRNYWFET GTPRFLIELI KKNNYFVPNL
NKLRINESLA NSFNLENLNL ETILFQAGYL TIKRLISTNK GVSYELGFPN KEVQISFNDY
LLQELTTVSE NELICDDLFE LFNNGDIANL EPVIKRLFVS IAYNNFTNNY IESYEGFYAS
VLYAYFASLG FDMIAEDITN KGRIDLILKT FDKTYIFEFK VIAEEPLEQI KKMKYYEKYD
GERYLIGIVF DPKARNVSQF AWERV