Gene Cag_1064 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1064 
Symbol 
ID3746718 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp1446707 
End bp1447933 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content51% 
IMG OID637773595 
Productmolybdopterin binding domain-containing protein 
Protein accessionYP_379369 
Protein GI78189031 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0303] Molybdopterin biosynthesis enzyme 
TIGRFAM ID[TIGR00177] molybdenum cofactor synthesis domain 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATAACCG TTGAAGCTGC ACGCACACTT GTTCAACAAT CCATTCAACC GCTTGGCACC 
GAGCAGTTAC CCATTGCCGA AGCGTTCGGA CGCATTACCG CTGAAGCAAT TCATGCGCCT
TTTGCATTGC CACGTTTTAC CAATGCCGCT ATGGATGGCT TTGCTGTGCG TTGGGACGAT
ATTGCACAAG CATCCGACGC CACGCCAATA ACGCTTACCG TGCAGGAGAT GATTGCTGCT
GGCAGTGAGC CCACTGTGGC AATTTCGCAA GGTTGCTGCT CAGCAATTAT GACGGGTGCT
CCAATGCCGC AAGGGGCTGA TACAGTAGTT CCCTTTGAGC AAACAAGCGG ATTTGGCAGC
AACAGCGTTA CCATTTTTAA AGCCCCAAAG CGTCAAGCCA ATGTGCGCTA TGCAGGCGAA
GAGGTGGCGG CGAATGAATT GCTGGTGGAG AATGGAGTAG CACTAAATCC TGCTGCACTA
TCGGTGCTTG CAAGCTTTGG GGTTGCTCAA TTGAAGGTTC GTCGTCAACC ACGAATTGCC
ATTATTACCG TAGGAGACGA AGTGCAACTA CCGGGGAAAC CCTTAATAGG CGCTCAAATT
TACAACTGCA ACCGCTTTAT GCTTGATGCC GCCTGCCGTT CACTTGGCAT AATTCCAACC
TTTATTCACC ACGCCCCCGA CAACCGCGAA GTATTACGCC ATTCGCTTGG CATGGCGCTC
ACCATGTGCG ATATGCTCCT TACGGCTGGA GGCATTTCAA CAGGAGAATT TGATTTTGTA
CAGAGCGAAT TAACAGCGCT TGGAATCAAC AAACATTTTT GGAGCATTGC CCAAAAGCCG
GGTAAACCGC TCTACTTTGG CACCTCACAC GAAGGCAAAG CCGTATTTGC GTTGCCGGGC
AATCCCATTT CAGCCATTGT TTGCTTTGCC GCTTACGTGG TTGACGCACT TGCCCTGATG
CAAGGCAAAA CCCTCAGCAC ATCACGCTTT ACCGCAACCC TTGCCGAACC ATTCCCCACC
GATAAAAAAC GCTACCGCTT TTTACCCGGT ATGGTGTGGG TGGATCGTGG GCAACTCTTT
TGCAAAGCCG CAAGTAAGAT AGAATCGCAC ATGATTACTT CACTTTCGGG AGCAAACTGT
TTACTTGAAG CCGAAGCCGC TCAATATGAC CGTCCTGCTG GCGAGCTTAT TACTTGCACC
ATGTTGCCGT GGGGGAAGGT TTGTTAA
 
Protein sequence
MITVEAARTL VQQSIQPLGT EQLPIAEAFG RITAEAIHAP FALPRFTNAA MDGFAVRWDD 
IAQASDATPI TLTVQEMIAA GSEPTVAISQ GCCSAIMTGA PMPQGADTVV PFEQTSGFGS
NSVTIFKAPK RQANVRYAGE EVAANELLVE NGVALNPAAL SVLASFGVAQ LKVRRQPRIA
IITVGDEVQL PGKPLIGAQI YNCNRFMLDA ACRSLGIIPT FIHHAPDNRE VLRHSLGMAL
TMCDMLLTAG GISTGEFDFV QSELTALGIN KHFWSIAQKP GKPLYFGTSH EGKAVFALPG
NPISAIVCFA AYVVDALALM QGKTLSTSRF TATLAEPFPT DKKRYRFLPG MVWVDRGQLF
CKAASKIESH MITSLSGANC LLEAEAAQYD RPAGELITCT MLPWGKVC