Gene Clim_1967 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_1967 
Symbol 
ID6355471 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp2182981 
End bp2184837 
Gene Length1857 bp 
Protein Length618 aa 
Translation table11 
GC content52% 
IMG OID642669565 
ProductCarbamoyl-phosphate synthetase large chain domain protein 
Protein accessionYP_001943978 
Protein GI189347449 
COG category[E] Amino acid transport and metabolism
[F] Nucleotide transport and metabolism 
COG ID[COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) 
TIGRFAM ID[TIGR01369] carbamoyl-phosphate synthase, large subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000143423 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCACTC AAGTTTCCAG CCTTTCACAA GAGCTTTCCG GCCTTGCGAG CAAACTCCCT 
AAAAAAACGC TGATAAAGGC AAAAGAGCAT GGATTTTCCG ATTGTCAGCT TGCCAATATT
TTTAAAACCA CGGAAACCGT CATACGAACA CTGAGAAAAC AGTACGGTGT GGAATCGGTA
TTCAAAACCG TCGATACCTG CGCTGCCGAA TTCGACGCGA AAACCCCGTA CCATTACTCG
ACGTACGATG AAGAGAACGA GTCTGTGCGT TCCGACAGGA AAAAAGTCAT TATCCTCGGA
GGCGGCCCGA ACCGTATCGG TCAGGGCATA GAATTCGATT ATTGCTGCGT ACAGGCGGTT
TTCGCTCTCC GCGAGGCCGG CTATGAGACC ATCATGGTCA ACTGCAACCC CGAAACGGTT
TCGACCGACT ACGACATCGC CGACAAGCTC TATTTCGAGC CATTGACGTT TGAGGACACG
ATCCGTATCA TCGAGCATGA ACAGCCGCTC GGTGTGATCG TCAGCTTCGG AGGTCAGACC
CCCCTGAAGC TCTCGACAAA ACTGGACGAG GCCGGCGTTA CCATTCTCGG AACATCCTCG
AAGGGCATCG ATCTTGCGGA GGACCGCAAG AAATTCGGCG CTCTGCTCGA AAAACTCGAC
ATTCTCCATC CGGATTACGG CACCGCCATC TGTTTTGATG AAGCGCTCGC CATTACCGAA
AGAATCGGGT ATCCGGTTCT GGTTCGACCA AGCTATGTGC TTGGCGGAAG AGCCATGAAA
ATCATCTATA ACAAAGACTC TCTCAAGGAG TACGTCGATC AGGCGCTTTT CATTTCTGAA
AAATATCCGC TGCTTATCGA CCGATTCCTT GAAACTGCCG TTGAGTTCGA CATCGATGCC
ATTGCCGATA CTACCGACTG CGTTATCAGC GGCATCATGC AGCATGTGGA GGCGGCAGGC
ATTCACAGCG GCGATTCAAC CTCGATCCTT CCCTATCGCA ATATCAGCCA GGAAGTGATC
AATACCATGA AAGCCTATAC CAGGACGCTT GCCGAACATC TGAAGGTTGT CGGCCTCATG
AACGTTCAGT ATGCCGTCCA GAACGAAAGC GTTTACGTGA TCGAAGTGAA TCCGAGAGCG
AGCCGTACGG TGCCGTTCGT TGGCAAGGCC ACTGCGGTTC CGGTTGTAAA AATCGCAACG
CGGGTGATGC TTGGCGAGAA ACTCAGCGAC CTTCGCAAAG AGTACGATCT GAAGGATTGC
GACGAACTCG GCATGAAGCA TATGGCCATA AAGGAGCCGG TATTTCCATT CTCGAAGTTC
GTTAAATCAG GCGTTTACCT CGGCCCGGAA ATGCGCTCCA CCGGCGAAGC CATGAGCCTT
GCAGAACAGT TTCCGGAGGC TTTCGCCAAA GCGTATCAGG CTGCGAACAT GGAACTTCCG
CTTTCAGGGT CGGTCTTTAT CAGCGTAAAC GATCAGGACA AAAGCCAGCG CATTATCGCG
ATTGCCAAAG AGCTTTACCG CATGGATTTC GATCTTGTCG CCACGGCCGG AACCCACCGT
TTCCTTATCG AAAACGGAAT AGAGTGCAAA AAAGTCTTCA AGGTAGGCGA AGAGGGGCGT
CCGAACATTT TCGACATCAT CAAACACGGC AAGATCGATT TTGTCATCAA CACACCCAGG
GGGGAAAAGG CGCTGCATGA CGAGGAGGCT ATCGGCGCGG CATCGGTACT GAGCAACGTG
CCGTTCGTCA CCACCATCGA GGCCGCCGAA GCATCGGTTC AGGCTATCGA CTGCATCCGG
CGCCAGGAAT TCGGTGTCAA GAGTCTGCAG GAGTATTCGG CATATCGAAA CAAGTGA
 
Protein sequence
MTTQVSSLSQ ELSGLASKLP KKTLIKAKEH GFSDCQLANI FKTTETVIRT LRKQYGVESV 
FKTVDTCAAE FDAKTPYHYS TYDEENESVR SDRKKVIILG GGPNRIGQGI EFDYCCVQAV
FALREAGYET IMVNCNPETV STDYDIADKL YFEPLTFEDT IRIIEHEQPL GVIVSFGGQT
PLKLSTKLDE AGVTILGTSS KGIDLAEDRK KFGALLEKLD ILHPDYGTAI CFDEALAITE
RIGYPVLVRP SYVLGGRAMK IIYNKDSLKE YVDQALFISE KYPLLIDRFL ETAVEFDIDA
IADTTDCVIS GIMQHVEAAG IHSGDSTSIL PYRNISQEVI NTMKAYTRTL AEHLKVVGLM
NVQYAVQNES VYVIEVNPRA SRTVPFVGKA TAVPVVKIAT RVMLGEKLSD LRKEYDLKDC
DELGMKHMAI KEPVFPFSKF VKSGVYLGPE MRSTGEAMSL AEQFPEAFAK AYQAANMELP
LSGSVFISVN DQDKSQRIIA IAKELYRMDF DLVATAGTHR FLIENGIECK KVFKVGEEGR
PNIFDIIKHG KIDFVINTPR GEKALHDEEA IGAASVLSNV PFVTTIEAAE ASVQAIDCIR
RQEFGVKSLQ EYSAYRNK