Gene Cag_1787 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1787 
Symbol 
ID3747207 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp2306406 
End bp2308271 
Gene Length1866 bp 
Protein Length621 aa 
Translation table11 
GC content46% 
IMG OID637774325 
Productcarbamoyl-phosphate synthase, large subunit 
Protein accessionYP_380081 
Protein GI78189743 
COG category[E] Amino acid transport and metabolism
[F] Nucleotide transport and metabolism 
COG ID[COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) 
TIGRFAM ID[TIGR01369] carbamoyl-phosphate synthase, large subunit 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCACCA TTACGCCCGA CCAAACGGTT CTTACCCTTG CCAAAAAATT GTCTGCCGAG 
CAGCTTTTTG CCGCTAAGCA AAATGGCTTT TCTGACCTGC AGCTTGCCAC TATTTTTAAA
ACATCCGATA CCGTTATTCG TGAGCTTCGT CGCCATTATG GCATTGCCTC CGTGTTTAAA
ACGGTTGATA CTTGTGCGGC GGAGTTTGAT GCAAAAACCC CTTATCACTA TTCAACCTAT
GAAGAGGAAA ATGAGTCGGT TTGCTCTGAT AGGAAAAAGG TGATTATTCT GGGTGGTGGA
CCTAACCGCA TTGGGCAAGG TATTGAGTTT GACTACTGCT GTGTACAAGC GGTGTTTGCT
TTGCGAGAAG CGGGTTACGA AACCATTATG GTTAATTGCA ATCCCGAAAC GGTTTCAACC
GATTACGATA TTGCCGATAA ACTTTACTTT GAGCCGCTTA CCTTTGAGGA TACCATTCGC
ATTATTGAGC ATGAAAAGCC ACTTGGTGTT ATTGTAAGCT TTGGCGGACA AACGCCACTC
AAGCTCTCTA CCCGTTTGCA TGAAGCGGGT GTAAAAATTC TTGGCACCTC ATCTAAAGGC
ATTGACTTAG CTGAGGATCG CAAAAAGTTT GGAGCCTTGC TTGTTGAGCT TGGTATTCCC
CATCCAGCTT ACGGCACGGC TATTAGTTTG GAAGAGGCAA AAGCCATTAC CCAACGTATT
GGCTATCCCG CCTTAGTTCG CCCCAGCTAT GTGCTTGGCG GACGTGCTAT GAAAATTGTC
TATAACGACG ATTCGCTGAA GGAGTACATT GATCAAGCGC TCTTTATTTC CGAAAAATAT
CCGCTCTTAA TTGATCGCTT TCTTGAAACT GCTGTGGAGT TTGATATTGA TGCCCTTGCC
GATAGCACCG ATTGTGTGAT TAGCGGCATT ATGCAGCATG TAGAAGCAGC GGGCATTCAC
AGTGGCGACT CCACCTCCAT TCTCCCTTAC CATAACATTA GCAAGCAGGC AATTGCTGCC
ATGAAGGAGT ACACCCGAAT GCTTGCTAAA AGCTTGAATG TTATTGGGTT AATGAATGTG
CAGTACGCTG TGCAAAACGA CACGGTGTAT GTTATTGAGG TGAACCCTCG TGCCAGCCGC
ACCGTGCCAT TTGTGGGTAA AGCCACCGCT ATTCCGGTGG TAAAAATTGC TACCCGCGTT
ATGCTTGGCG AAAAGCTCTG CGATTTGCGC AACGAGTACA ATTTAAAGGA TTGCGATGAA
CTTGGCATGA AGCACATGGC AATTAAAGAG CCTGTTTTCC CCTTCTCGAA GTTTGTAAAA
TCGGGCGTTT ATCTTGGTCC CGAAATGCGC TCTACGGGTG AAGCTATGAG TTTAGCGAAC
GACTTCCCCG AAGCTTTTGC AAAAGCCTAT CAAGCCGCAA ATATGCAGCT TCCGCTTTCG
GGCGCAGTGT TTATTAGTGT GAACGATCAA GATAAAAACC ATCGTATGCT TGCTATTGCT
CGCTCGTTGT ACGATATGGA TTTTGATTTA GTGGCAACGG CTGGTACATG GCAGTTCCTT
ACCGATAATG GTATTGAGTG CAAAAAAGTA TATAAAGTAG GTGAAGAGGG GCGTCCCAAT
ATTTTTGACA GCATCAAACA CGGCAAAGTT GATTTTGTGA TTAATACGCC ACGCGGCGAA
AAAGCACTGC ACGATGAAGA GGCAATTGGT GCGGCATCAG TGTTAAGCAA CGTGCCATTT
GTAACCACCA TTGAGGCGGC TGAAGCCTCC GTGCAAGCAA TTGGCTGCAT TCGCCATCAA
GAGTTTGGGG TAAAGAGCTT GCAAGAGTAT GCAGCGTATC GCGACACAGC TACCGCCACC
TGTTAA
 
Protein sequence
MSTITPDQTV LTLAKKLSAE QLFAAKQNGF SDLQLATIFK TSDTVIRELR RHYGIASVFK 
TVDTCAAEFD AKTPYHYSTY EEENESVCSD RKKVIILGGG PNRIGQGIEF DYCCVQAVFA
LREAGYETIM VNCNPETVST DYDIADKLYF EPLTFEDTIR IIEHEKPLGV IVSFGGQTPL
KLSTRLHEAG VKILGTSSKG IDLAEDRKKF GALLVELGIP HPAYGTAISL EEAKAITQRI
GYPALVRPSY VLGGRAMKIV YNDDSLKEYI DQALFISEKY PLLIDRFLET AVEFDIDALA
DSTDCVISGI MQHVEAAGIH SGDSTSILPY HNISKQAIAA MKEYTRMLAK SLNVIGLMNV
QYAVQNDTVY VIEVNPRASR TVPFVGKATA IPVVKIATRV MLGEKLCDLR NEYNLKDCDE
LGMKHMAIKE PVFPFSKFVK SGVYLGPEMR STGEAMSLAN DFPEAFAKAY QAANMQLPLS
GAVFISVNDQ DKNHRMLAIA RSLYDMDFDL VATAGTWQFL TDNGIECKKV YKVGEEGRPN
IFDSIKHGKV DFVINTPRGE KALHDEEAIG AASVLSNVPF VTTIEAAEAS VQAIGCIRHQ
EFGVKSLQEY AAYRDTATAT C