Gene Cag_1019 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1019 
Symbol 
ID3746747 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp1365493 
End bp1367778 
Gene Length2286 bp 
Protein Length761 aa 
Translation table11 
GC content48% 
IMG OID637773548 
Productphosphoribosylformylglycinamidine synthase II 
Protein accessionYP_379324 
Protein GI78188986 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0046] Phosphoribosylformylglycinamidine (FGAM) synthase, synthetase domain 
TIGRFAM ID[TIGR01736] phosphoribosylformylglycinamidine synthase II 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.198854 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACATA CCGACATTGA AGTTACCCTT ACCCATGCCG AAGAGCATGG CTTAAGCGCT 
GAAGAGTTCA GCCAAATATG CACCATTCTT GGGCGTACCC CCACCATTAC GGAACTTGGT
ATTTTTTCGG TGATGTGGTC GGAGCATTGC AGCTATAAAA ACTCTATTGC AGTTTTAAAA
ACCCTGCCAC GCGAAGGCGG CGCACTGCTT ACATCGGCTG GTGAGGAAAA TGCGGGATTG
GTTGACATTG GCGATAACCT TGCCGTTGCT TTTAAGATTG AGTCACATAA CCACCCTTCA
GCCGTTGAGC CATACCAAGG GGCAGCAACG GGCGTTGGCG GCATTCATCG CGATATTTTT
ACCATGGGCG CTCGTCCTGT GGCTTCACTG AATTCACTAC GTTTCGGCTC ACCAAAAGAT
CCTCGAGTAC GCTATTTGGT GGATGGTGTG GTGCGTGGCA TTGGCGATTA TGGTAACTCG
TTTGGCGTAC CAACGGTTGC GGGCGATATC TATTTTGAGG AGGGTTACAC AGGCAATCCT
CTTGTAAATG CTATGTCGGT AGGAATTGTG GAGCACCATA AAACCGTGAG TGCCACGGCT
TACGGCACAG GCAATCCCGT GCTTATTGTA GGCTCATCAA CGGGACGGGA CGGCATACAC
GGCGCTACCT TTGCGTCAGA AGATTTAAGC GAAGCCTCAG AGGAGAAGCG CCCAAGTGTG
CAGGTGGGCG ATCCGTTTGC CGAAAAACTC TTGCTTGAAG CCACACTTGA AGCCATTGAA
ACAGGTTATG TGGTTGGCTT GCAAGATATG GGTGCAGCGG GAATTACCAG CTCCACCTCC
GAAATGAGCG CTCGTGGAAT TGAAAAATAT GGAGTTGGGG GAATTGAAAT TGACCTCGAT
TTAGTACCAA TTCGTGAAGC GGGAATGAGT GCGTATGAAA TTATGCTCTC CGAGTCGCAA
GAGCGCATGT TGATTGTAGC GGCTAAAGGA TTTGAGGATA AAATTATTGA GGTCTATCAA
AAGTGGGATG TTCAAGCCGT CGTGATTGGT GAAGTAACTG ACGACAACCA TGTGCGCGTT
AAACATCAAG GTCAGGTAGT GGCAAACATT CCTGCTATTT CGTTGGTGCT GGGCGGTGGT
GCTCCCGTTT ACAAGCGTGA AGCAAAGGAA AAAAAGCCCG AAACGCCTTT AGCAAACATG
GTGGCGGATA GCACGCTTAA TTTTAATGAA TTAGGACTTG CTTTGCTCTC GCGCCCAAAC
ATTGCCAGCA AGCAATGGGT TTATCGCCAA TACGACTCAA TGGTGCAAAC CAATACCCTT
ACCCCAACGG GTCAAACCGA TGCGGCGGTG ATTCGCATAA AAGGCACCAA CAAAGGGGTT
GCTATGAAAA CCGACTGCAA TGCTCGTTAT GTTTATCTTA ATCCGCTTGC AGGCGGCAAA
ATTGCCGTTG CCGAATGCGC TCGCAACATT GCTTGTACCG GTGCTCGTCC GCTTGCCATT
ACCAACTGCT TGAACTTTGG TAATCCCCTG AAACCAGAAG TTTATTTCCA ATTTAAGGAA
TCAGTACGTG GCATGGGCGA AGCGTGCCGC ACCTTTAACA CACCTGTAAC TGGTGGAAAT
GTAAGCTTTT ACAACGAAAC CTTTATTGCT GGACAACGCA CAGCCATTTA CCCAACCCCC
ATGATTGGCA TGATTGGCTT GCTCGATAAT ATTGAAAATC TTGTTGGCTC CACCTTTACC
GCATCGGGTG ATCGCATTTT GTTGCTGGGC AATCCTCAGC TCACGCTTGA TGGCTCCGAA
TATCTTGTGA TGCAATATGG CACGCCAGGG CAGGATGCCC CAGCCGTTGA TCTTGAGCAT
GAAGCCCAGT TACAACGCTT GCTTGTTGCG CTTGCCGAGC AAAAGCTACT CCATTCAGCG
CACGATGTTT CCGATGGTGG TTTGTTGGTT GCGCTTGCAG AAAAAGCGAT GATGAATCAA
GAGATGCCAT TGAGCTTTAG AGTACATCTA TCAAATAACG ATAAGAGCGA AACAGCAATT
CAGCAGCAGC TCTTTTCGGA AGCACAAGGA CGCGTTGTAC TGAGTGCTGC CCCCGAAGCC
GTTGCGGCAA TTATGGCATT AGCAAACGAC TATAACCTGC CGATTCAGGA TATTGGAGAA
GTGGTAAATC AGCAAACCAT TTCACTTTCC ATTAATGAGC AAGAGGTAGT GAATTTACCG
CTGAGCAATG TAGCCCATGC TTACTATCAC GCATTGGAAC ATGCACTGCA TTTAGATGAA
TTGTAG
 
Protein sequence
MKHTDIEVTL THAEEHGLSA EEFSQICTIL GRTPTITELG IFSVMWSEHC SYKNSIAVLK 
TLPREGGALL TSAGEENAGL VDIGDNLAVA FKIESHNHPS AVEPYQGAAT GVGGIHRDIF
TMGARPVASL NSLRFGSPKD PRVRYLVDGV VRGIGDYGNS FGVPTVAGDI YFEEGYTGNP
LVNAMSVGIV EHHKTVSATA YGTGNPVLIV GSSTGRDGIH GATFASEDLS EASEEKRPSV
QVGDPFAEKL LLEATLEAIE TGYVVGLQDM GAAGITSSTS EMSARGIEKY GVGGIEIDLD
LVPIREAGMS AYEIMLSESQ ERMLIVAAKG FEDKIIEVYQ KWDVQAVVIG EVTDDNHVRV
KHQGQVVANI PAISLVLGGG APVYKREAKE KKPETPLANM VADSTLNFNE LGLALLSRPN
IASKQWVYRQ YDSMVQTNTL TPTGQTDAAV IRIKGTNKGV AMKTDCNARY VYLNPLAGGK
IAVAECARNI ACTGARPLAI TNCLNFGNPL KPEVYFQFKE SVRGMGEACR TFNTPVTGGN
VSFYNETFIA GQRTAIYPTP MIGMIGLLDN IENLVGSTFT ASGDRILLLG NPQLTLDGSE
YLVMQYGTPG QDAPAVDLEH EAQLQRLLVA LAEQKLLHSA HDVSDGGLLV ALAEKAMMNQ
EMPLSFRVHL SNNDKSETAI QQQLFSEAQG RVVLSAAPEA VAAIMALAND YNLPIQDIGE
VVNQQTISLS INEQEVVNLP LSNVAHAYYH ALEHALHLDE L