Gene Cag_1052 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1052 
Symbol 
ID3747033 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp1423658 
End bp1424986 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content52% 
IMG OID637773581 
Producthypothetical protein 
Protein accessionYP_379357 
Protein GI78189019 
COG category[R] General function prediction only 
COG ID[COG2081] Predicted flavoproteins 
TIGRFAM ID[TIGR00275] flavoprotein, HI0933 family 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.843089 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGAAC GTTGGGTAGT GGTGATTGGT GGGGGAGCGG CTGGTATGGC GGCTGCCGTT 
TCAGCCGCAG AGCAAGCCCG TTATCTTGGT GTGGATTGCC ATATTACGGT GATTGAAAAA
ACGCATCAAG TGGGTTCTAA AATCCGCATT TCAGGTGGTG GAAAATGCAA TGTTACGCAT
GTTGGTACAT CGGCTGAGTT GCTTGAAAAA GGGTTTCTCC GTGCAGCGGA GCAGCGCTTT
TTACGTTCCG CACTTTACGC ATTTTCCAAT AACGAACTCC GTGCACTCTT GCAGCAGCAG
GGGGTGGCTA CTACTGAGCG AGAGGATGGT AAGGTGTTTC CCGTAGCAGG TGAGGCAAGC
GTGGTGGCTG AGGCATTCCG TACCTTATTG CAGCGCTTGA AAATCAATTG CGAGCTGCAT
GCGCCAGTGC AAGCCATAAA AGTGCATGGG CAGCAATTCC ACCTTATTAC TCTTCACGGC
GACATTGTTG CCGATGCGGT TATTGTGGCA ACGGGTGGGG TTTCGTATCG GCATACGGGC
ACTACGGGCG ATGGTTTGCG CTTAGCGCGT GCATTGGGGC ACACGGTTGT AGAGCCTTCA
GCGGCACTTT CCTCCATTAT GGTGCAGCCG CATTCGTTAG TTGCGCTTGC GGGTGCGGCG
CTACGTGGCG TTGCAGCCGT AGCGCGTGCA GGCAAGTTGC GGGCTGAGCG GCAGGGTGAT
ATTCTCTTTA CCCATCGAGG CTTTAGTGGT CCAGCAATGC TATCGCTTTC GCGCGATGTT
GCCAATATGC AGCGCTCCCA ACGTGAGGCG GTGCACCTTG CGGCTGACCT CTATCCCCAG
CAATTGCATG ATGAACTTGA GGCGTTGTTG CTCCAACATA GTAAAAAGCA AGGTGGGCAG
TTGGTGCGCA AATTCTTGCA AGTGTCGCCC ATTGGCATGT TGTTGCTCAA GAGCGAAACC
ATGCCATATG GCACCATTCC CAATGCCATG GTGCCACTGC TAATGCGTCA GGCGGCGCTT
GATGACGAGG TAACCTTTGC TACGTTAAGC CGTGAGCATC GCCATCAATT GGTGGTTACC
TTAAAGCAGT TTCAGCTTGG TACGGTTCAT AACGTGTCGT TGGATGCAGG GGAAGTTTCG
GCTGGTGGAG TAGCGCTTAG TGAAGTGAAT CCTAAAAGCA TGGAGTCGCG CCTTGTGCCA
AATCTTTATT TTTGCGGGGA AGTGTTGGAT TATGTGGGGG AAATTGGAGG CTATAATTTA
CAAGCTGCTT TTTCAACGGG ATGGATGGCT GGAAAAAGTG CTGTGAACAA GCTTTTAACA
GCTCTTTAA
 
Protein sequence
MKERWVVVIG GGAAGMAAAV SAAEQARYLG VDCHITVIEK THQVGSKIRI SGGGKCNVTH 
VGTSAELLEK GFLRAAEQRF LRSALYAFSN NELRALLQQQ GVATTEREDG KVFPVAGEAS
VVAEAFRTLL QRLKINCELH APVQAIKVHG QQFHLITLHG DIVADAVIVA TGGVSYRHTG
TTGDGLRLAR ALGHTVVEPS AALSSIMVQP HSLVALAGAA LRGVAAVARA GKLRAERQGD
ILFTHRGFSG PAMLSLSRDV ANMQRSQREA VHLAADLYPQ QLHDELEALL LQHSKKQGGQ
LVRKFLQVSP IGMLLLKSET MPYGTIPNAM VPLLMRQAAL DDEVTFATLS REHRHQLVVT
LKQFQLGTVH NVSLDAGEVS AGGVALSEVN PKSMESRLVP NLYFCGEVLD YVGEIGGYNL
QAAFSTGWMA GKSAVNKLLT AL