Gene Cag_1046 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1046 
Symbol 
ID3747774 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp1414240 
End bp1415802 
Gene Length1563 bp 
Protein Length520 aa 
Translation table11 
GC content33% 
IMG OID637773575 
Producthypothetical protein 
Protein accessionYP_379351 
Protein GI78189013 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000103678 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACCGC TACCCGTAGG CATACAAACA TTTAGTAAAA TAATCGAGGA CGATTATCTG 
TACATTGATA AAACAGATAT AGCAAAAAGC ATAATAGAAA AATATCAATA TGTTTTTCTA
TCACGTCCAC GACGATTTGG TAAAAGCTTA TTTCTTGATA CCCTTAAAAA TATTTTTCTT
GGCAATAAAG AGCTATTCCA AAACTTACAT ATTTATAACC AATGGAATTG GAATATAACC
TATCCTGTTA TTAAAATAAG TTTTAGTGGC GGAATACGAA ATAACGAAAG TCTTCGCAAA
AATCTTTTTT ATATCCTAAA AGATAATCAA AAACGGCTCA ATATTACTTG TGAAGAAAAC
GATGAGCCAA ATCTATGCTT TGCTGAATTG ATTCAACAAG CATTTGAAAA ATACCAACAA
AAGGTTGTTA TTCTGATTGA CGAGTATGAT AAACCTATTC TTGATAATAT TGAAAACATT
CCTGAAGCAC TCGTTATTCG TGATGGAATG CGTGATTTTT ATACCAAAAT AAAAGAGAAC
GATGAATATT TGCGTTTTGT ATTTCTAACA GGAGTAAGCA AGTTTTCAAA AGTATCGCTC
TTTAGTGGTT TAAATAATCT TGAGGATATT AGCCTGAACC CTAATTTTGG CAACATTTGT
GGCTATACAC AGCATGATGT TGATACTGTT TTTGCACCAT ATCTTGAAGG TGTGGCTATG
GAGAAAGTAA AGCGCTGGTA TAATGGATAT AATTTCTTGG GCGATAACGT TTATAATCCA
TTTGATATTT TACTTTTTAT AAAAAACCAA AAGACGTTTA AGAATTATTG GTTTGAAACG
GGCACACCAA CCTTTTTAAT GAAGCTTTTT GCTAAGGAGC GCTATTTTTT ACCCAATTTA
GAGCACCTTG AAGTGGGTGA TGAAATTCTT GATTCATTTG ATATTGAAAA AATTCAACTT
GCAACTCTTT TATTTCAAAC GGGATATTTA ACCATAGAGA AACGGTTTGA AACGTTTGAG
CGATTACGTT ACCAACTTAA AATCCCTAAT CAAGAGGTTC GTTTAGCGTT AAGTGATCAT
TTTATTAATG TTTATACCGA GCAGCCGAAT GAGTTAAAAT ATGCCCAGCA AAATCGTTTT
TATACCTATT TAACGCAGGT TGATATGCTT GGTTTCCAAC AAACGTTGCA AGCATTATTT
GCCGGCATAC CGTGGAATAA TTTTATCAAT AACTCGTTGC CTGAGTTTGA AGGCTATTAT
GCAAGTGTAC TGTATGCTTT TTTTATTAGT CTTAATGCTA CAGTTATTCC TGAAGATACC
ACCAATCAAG GGCAGGTTGA TTTAACAATA ATGGTTGAAA ACAAAGTTTA CATTATTGAA
ATTAAACGTG ATACGGTAAA AAGCTATGAA ATAAGCCAAC AAAACATAGC TCTGCAACAA
ATTCAGAGAA AAGGTTACGC CACAAAATAT AAAGGGCAAG GGAAAACAAT TATACAAATT
GGCATGATTT TTAACATCTA TCAGCGCAAT CTTGTACAAA TGGATTGGGA GGTTGTGGGG
TGA
 
Protein sequence
MKPLPVGIQT FSKIIEDDYL YIDKTDIAKS IIEKYQYVFL SRPRRFGKSL FLDTLKNIFL 
GNKELFQNLH IYNQWNWNIT YPVIKISFSG GIRNNESLRK NLFYILKDNQ KRLNITCEEN
DEPNLCFAEL IQQAFEKYQQ KVVILIDEYD KPILDNIENI PEALVIRDGM RDFYTKIKEN
DEYLRFVFLT GVSKFSKVSL FSGLNNLEDI SLNPNFGNIC GYTQHDVDTV FAPYLEGVAM
EKVKRWYNGY NFLGDNVYNP FDILLFIKNQ KTFKNYWFET GTPTFLMKLF AKERYFLPNL
EHLEVGDEIL DSFDIEKIQL ATLLFQTGYL TIEKRFETFE RLRYQLKIPN QEVRLALSDH
FINVYTEQPN ELKYAQQNRF YTYLTQVDML GFQQTLQALF AGIPWNNFIN NSLPEFEGYY
ASVLYAFFIS LNATVIPEDT TNQGQVDLTI MVENKVYIIE IKRDTVKSYE ISQQNIALQQ
IQRKGYATKY KGQGKTIIQI GMIFNIYQRN LVQMDWEVVG