Gene Cag_1224 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1224 
Symbol 
ID3748258 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp1624902 
End bp1627139 
Gene Length2238 bp 
Protein Length745 aa 
Translation table11 
GC content41% 
IMG OID637773758 
Producthypothetical protein 
Protein accessionYP_379529 
Protein GI78189191 
COG category[S] Function unknown 
COG ID[COG1357] Uncharacterized low-complexity proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.605916 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAACTC CCGACCCACA AGTGAAACCA AGCACTTTAC CCAATGACCC GCTCACGCTG 
CTTGAGGTAG CCAACCACTC TTCGGAGCGT TTAGCGGTGC AACACACAGC GTTTATTGCG
GCATGTGTGT ATGTGCTGAT TATTGTGTTT GGTACGACTG ACCTTGATTT GCTGATTGGT
AAAGGTGTAC GCTTGCCTTT TGTAGATGTT GAAGTGCCGA TTGTTGGTTT TTTTGCTTTT
GTGCCATTTA TCCTTGTGCT GGTGCATTTT AACCTGCTTT TACAGCTTCA ACTGCTTTCG
CGTAAATTGT TTGCTTTTGA TGCAACGGTA CCTCAGGATG ATGGCATTGG CGGGTTGCGC
GATCGCTTGC ACATTTTTGC CTTTACCTAT TACCTTGCTG GTAATCCAAG CCGTTTGGTA
AAGCCATTTC TTGCGATAAT GGTTTCTATT ACCTTGGTGT TGCTGCCATT GTTTGTGCTC
TTTGCTATGC AACTACAGTT TTTAGCATAT CAAGATGAAG TGATTACGTG GATGCAACGT
TTTGCGCTAT GGTTGGATAT TGCTTTAATC AATATTTTTT TGCCAACCAT GTTGCATCCA
AAGGATGATT GGAAAAGCTA TTGGCGTAAT GTGATTGCTT GCTATGTTCC TCACCGAAGA
GTATGGCTTT CGTTTCTGCT ATTATATGTT GGAACTAACA TTTGTTTATT TGCTTCTAAA
AAAGAAATCC TTTTGATAGG GATAGCCCTT CTTGTTCTCT CGTTGCTGTT ACTACCAATA
CTTCGCGGAT GGAAAGCAAC GCACAAGGTT CAAAAAATAA TTATACCAAT ACTCATTATT
GTTACTTTTG CAATTATAGC GCTTTTGTTT TTAGTGGAAG TTAGAGATTG GATAGAAATT
ACAATAACAT CATTTATAAG TACAGAAACT ATTCGTGAAA AGGTATTTCC GTTAAGCTTC
ATTCTTTATG CTCTCATTAT CGTATTAACA GTTTTATGGC AACAGAGCGC ACCACGTGGT
AGCTTTGCTT TAGTAGTAAC ATTGTTTCTT GGAACACTAT TTCCATTAGC GTTTATGGTT
GATGGAGAAC ATCTTGAAAA AATTATTGCA AAAGGTGAAA ATGCAACGTT TTTGTCTAAT
GTACTTCAGG ATAAACGTCG CCTTAACTTG AGCGAACAAC ATCTGTTTGC AAAAGCACTA
AAGCCAGAAA TTATTACGTT AATTAGTGAT GGCAAATGGA AAGAGGCATT ACCTCAAATT
GAACCTATTA ACTTACAAGG GCGTCATTTA CGCCATGCAG AATTAAACCA AGCAATGTTA
CTTGGCGCTG ATTTACGATT CGCTGACTTG CAAGGCGCTT ACTTATCCGA CGCTGACCTG
CAAGGCGCTT ACTTATCCGA CGCTGACCTG CAAGGAGCTC ACTTACGACA AGCTGAGCTG
CAGGGTGCTC ACTTACGACA AGCTAACCTG CAGGGTGCTT ACTTACGACA AGCTGACCTG
CAAGACGCTA ACTTATCATA CACTAACCTG CAAGGCGCTG ACTTCATCGG CGCTGACCTG
CAAGGCGCTG ATTTACGATT CGCTCACTTG CAAGGCGCTA ACTTATTCGG CGCTCACCTG
CAAGGCGCTT ACTTATTCGT CGCTCACTTG CAAGGCGCTT ATTTATCTGG TGCTCACTTG
CAAGGCGCTG ACTTATCTGC TGCTCACTTG CAAGGCGCTG ACTTATTTGG CGCTAATCTT
TATGCTGCAG ATATTCGTAG AGCAAACACC ACGCTTGTTG ATGCACAAAA CATACGCTTA
GAACCTCTTA GCGAAAAAGA AGCAACTGAA TTACGTACAA CACTTAAGCC ATTAATAAAA
GATAATGAGG ACTATAACGA GGTTGCTGAA CGTATAAAAA AAGCAACAGC GCCACATGGT
GAAATTCCAT ATTTTGAATC TATTCTTGCA GAAAAGAATA CGCCACTTCG CTATAAAAAA
TGCTATAATG CAGAAAACTC TGCTGAACGT CGTGCATTTA CAAAACAACT GCATCCATAT
CTTGTATCAC TTGCATCTCA ATCTCCTGAA ATAGCACGAG GTATTATTCA GCAAATTCCA
ATTAGTGAGC CAAACACATC TTCACGCAAA GGATTAGCCG CAGAACTTGC AAAGCATCTG
AACGATCCAA AGTGCAAAGG GTTATATGAA TTACGTGATG ATGAAAAAGA AGAGCTACGA
AATTGGAAGG AGGAATAA
 
Protein sequence
MTTPDPQVKP STLPNDPLTL LEVANHSSER LAVQHTAFIA ACVYVLIIVF GTTDLDLLIG 
KGVRLPFVDV EVPIVGFFAF VPFILVLVHF NLLLQLQLLS RKLFAFDATV PQDDGIGGLR
DRLHIFAFTY YLAGNPSRLV KPFLAIMVSI TLVLLPLFVL FAMQLQFLAY QDEVITWMQR
FALWLDIALI NIFLPTMLHP KDDWKSYWRN VIACYVPHRR VWLSFLLLYV GTNICLFASK
KEILLIGIAL LVLSLLLLPI LRGWKATHKV QKIIIPILII VTFAIIALLF LVEVRDWIEI
TITSFISTET IREKVFPLSF ILYALIIVLT VLWQQSAPRG SFALVVTLFL GTLFPLAFMV
DGEHLEKIIA KGENATFLSN VLQDKRRLNL SEQHLFAKAL KPEIITLISD GKWKEALPQI
EPINLQGRHL RHAELNQAML LGADLRFADL QGAYLSDADL QGAYLSDADL QGAHLRQAEL
QGAHLRQANL QGAYLRQADL QDANLSYTNL QGADFIGADL QGADLRFAHL QGANLFGAHL
QGAYLFVAHL QGAYLSGAHL QGADLSAAHL QGADLFGANL YAADIRRANT TLVDAQNIRL
EPLSEKEATE LRTTLKPLIK DNEDYNEVAE RIKKATAPHG EIPYFESILA EKNTPLRYKK
CYNAENSAER RAFTKQLHPY LVSLASQSPE IARGIIQQIP ISEPNTSSRK GLAAELAKHL
NDPKCKGLYE LRDDEKEELR NWKEE