Gene Cag_0861 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_0861 
Symbol 
ID3747571 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp1193843 
End bp1195441 
Gene Length1599 bp 
Protein Length532 aa 
Translation table11 
GC content49% 
IMG OID637773390 
Producthypothetical protein 
Protein accessionYP_379169 
Protein GI78188831 
COG category[R] General function prediction only
[S] Function unknown 
COG ID[COG0645] Predicted kinase
[COG2187] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.00795594 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCAACC TTGCTGAAGC CCTTTGCCAC CCCGAAGCTT ACCCTCATGC ACCGCAAAGC 
GTGGAGATGG TGCAAACCCA CTGCTCATGG GTTTTTTTAG CGGGAGCGTG GGCATATAAA
GTAAAAAAGC CGCTTGACCT TGGTTTTCTT GACTTTTCAA CGCTTGAGTT ACGCCGCCAC
TTTTGCTACG AAGAGCTACG CCTTAACCAG CGCCTCTGCT CCACTCTGTA TCTTTCAGTT
GTGCCAATTG TTGCTGTTCG GCAGCAGATC AAGGTTATTG ATAAGGAGAA TAATACGGAT
GAACATTGGA ACGAAGAGGA AAACAATGAG CATGGCACCA TAATTGACTA TGCGGTAAAA
ATGGTACGCT TTGACCGCAC GCAAGAGCTT GATCGCTTAT TGGCGCACCA CAAACTTGAT
GTTAAGCAGA TGGAGCAACT TGCCCGTACC ATTGCTGCAT TCCATAATTC ATTACCAGCA
GCTCCAATGG ATAGCGCATT AGGGCATCCC GACACCATTA TTAAGCCCAT GTTGCACAAT
TTCACCTTGC TTGAGGATAT TGTGGTGGAG AGCGAGGAGC AGCAAGAGTT AGCCACCTTA
CATCAAGCCA CCCTGAGCGA CCATCAGCGC CTTTACCAAC GGTTGCTTCA GCGCAAAGCG
GATGGCTTTA TTCGCCAATG CCATGGCGAT TTACATACAG GCAATATGGT GATGTGGCAA
GGGCGCATTA CGCTATTCGA TTGCATAGAA TTTAACCCAA CGCTCAACAC CATTGATTGC
ATCAGCGATC TCGCCTTTCT TTTTATGGAT TTACGCCATA GCGGCGAAAC GGCTTTAGCA
TGGCGACTTT TAAACGGCTA CTTGATGGAA ACAGGCGATT ACCACGCCTT AGCGCTTCTC
CCCTTTTATG AACGTTACCG CGCAATGGTA AGAGCCAAAG TAACTGCTAT TCATGCCTCG
CAAAGCAAAG ATGCGCCTGA AGTGAGCAGC TTAATGGCAG AGCACCGTAG CTACGTTGCG
CACGCCACAA ACTGTACCAA GCACAATCAG CCAATGCTCC TGATAGTGTG CGGTTTGTCA
GGAAGCGGCA AAAGCACCCT TGCCGCTTCA ATTGCCTCAG AACTGCCAGC AATTCACCTC
CGCTCCGATG TTGAGCGCAA ACGCCTTGCA GGGCTTCGCC CGCTTGAACG TAGCCCAAAG
AGCGACCTTT ACAGCCACTC CATGACTAAC AACACTTATG CACACTTATT GGGATTAGCA
CGATTTTGCT TGTTGGAAGG CTACTGCGTT GTGGTGGATG CCACCTTTTT GCGCCAAAGC
AATCGAGCAC TTTTTACAAC ACTTGCCAAT GAATGTAACG TACCATATCG CCTACTCCAC
TGCACTGCCC CAAAGCAGGT GCTGATGGAG CGTGTACAAT TACGCAACCT TGAAGGCAAC
GATGCCTCCG ATGCCGATGC GGAAGTGGTT GCCATGCAGC TTGAGCAGCA GGAAGCGCTG
ACGGATGACG AAAAAAAAAT TACAATAACG ATTGATACGA CCCATCCCAT TAACGCAACC
GCTCTTACGG GAATGTATCA ACTAAAGAGA GAACATTAA
 
Protein sequence
MINLAEALCH PEAYPHAPQS VEMVQTHCSW VFLAGAWAYK VKKPLDLGFL DFSTLELRRH 
FCYEELRLNQ RLCSTLYLSV VPIVAVRQQI KVIDKENNTD EHWNEEENNE HGTIIDYAVK
MVRFDRTQEL DRLLAHHKLD VKQMEQLART IAAFHNSLPA APMDSALGHP DTIIKPMLHN
FTLLEDIVVE SEEQQELATL HQATLSDHQR LYQRLLQRKA DGFIRQCHGD LHTGNMVMWQ
GRITLFDCIE FNPTLNTIDC ISDLAFLFMD LRHSGETALA WRLLNGYLME TGDYHALALL
PFYERYRAMV RAKVTAIHAS QSKDAPEVSS LMAEHRSYVA HATNCTKHNQ PMLLIVCGLS
GSGKSTLAAS IASELPAIHL RSDVERKRLA GLRPLERSPK SDLYSHSMTN NTYAHLLGLA
RFCLLEGYCV VVDATFLRQS NRALFTTLAN ECNVPYRLLH CTAPKQVLME RVQLRNLEGN
DASDADAEVV AMQLEQQEAL TDDEKKITIT IDTTHPINAT ALTGMYQLKR EH