Gene Cag_1941 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1941 
Symbol 
ID3746701 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp2472805 
End bp2473806 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content50% 
IMG OID637774476 
ProductElongator protein 3/MiaB/NifB 
Protein accessionYP_380232 
Protein GI78189894 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG2896] Molybdenum cofactor biosynthesis enzyme 
TIGRFAM ID[TIGR02666] molybdenum cofactor biosynthesis protein A, bacterial 


Plasmid Coverage information

Num covering plasmid clones42 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGATGAGC CGTATGCCAC AAGCCAACCG CTTTTTGATA CGTTTCAACG CCAAATAACG 
TATGCGCGCT TAGCGGTTAC CTCAGCTTGT AACTTGCGCT GCGGCTACTG TTTAAGCGAA
GCGCACGAAC CCGCTACACT GCACCAACCA CTTCTTTCAA CGGCTGAACT TTGCACCATT
ATTGAGTTGC TTGCCAAGCA TGGCATTCAA AAGCTACGCT TCACGGGTGG AGAACCCTTA
CTGCGTAGCG ATATTGTAGC GCTTATTGCT ATGGCACGGC AGCACTCATC CATTCGCACC
ATTGGCTTAA CGACGAACGG CTTGTTGCTT CTTCCCCTTC TCCCTCGTTT ACTTGACGCT
GGGCTTGACT CGGTAAATCT TAGCCTCGAT ACACTCAATC GCCATCGTTA CTTTCAAATT
ACTCGGCGCG ACCTTTTTCC GCAAGCTGAA GCGGCGTTGC ATGCGCTACT GGCTACACCC
TCGCTTTCAG TAAAATTGAA CGTGGTTATG CTGCGTGGCA TTAATAGCGA TGAACTTACT
GGCTTTGTAG AGCTTACCAA AGAGCATAAC ATCACTGTGC GCTTTTTAGA GCTGCAACCC
TTTGACGACC ATCAAATTTG GAAGACAGGG CGCTTTTTGC GAGCTGATCG GCTTGAAGAG
ATGTTGCTGC ACGCTTATCC CGCTTTGCAG CGCGTGCAAG GCGAAGCAAC CCAGCACTTT
AGCTATTGTT TGTCCAACTA CAAAGGCGCA CTTGCAATTA TTCCCGCTTA CACAAGAGCA
TTTTGCGAGC AATGCAACCG CTTACGTATT ACCTCAAGCG GCAAGCTGAT AAGTTGCTTG
TATGAAAAGG ATGGATTAGA ACTTTTACCC TTGTTGCGAA ATGGTGCAAA ACCCGAAGAG
TTTGCGGCGT TGTTGCAGCA AGCCGTGCTT CGTAAACCAG CCAACGGGCA TCAGCGCCAC
ACAGGTGCTG TGCGTACCAG TATGTCGGAG ATTGGGGGGT AA
 
Protein sequence
MDEPYATSQP LFDTFQRQIT YARLAVTSAC NLRCGYCLSE AHEPATLHQP LLSTAELCTI 
IELLAKHGIQ KLRFTGGEPL LRSDIVALIA MARQHSSIRT IGLTTNGLLL LPLLPRLLDA
GLDSVNLSLD TLNRHRYFQI TRRDLFPQAE AALHALLATP SLSVKLNVVM LRGINSDELT
GFVELTKEHN ITVRFLELQP FDDHQIWKTG RFLRADRLEE MLLHAYPALQ RVQGEATQHF
SYCLSNYKGA LAIIPAYTRA FCEQCNRLRI TSSGKLISCL YEKDGLELLP LLRNGAKPEE
FAALLQQAVL RKPANGHQRH TGAVRTSMSE IGG