Gene Cphamn1_2349 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphamn1_2349 
Symbol 
ID6376044 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides BS1 
KingdomBacteria 
Replicon accessionNC_010831 
Strand
Start bp2518460 
End bp2519560 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content50% 
IMG OID642684833 
Productcarbamoyl phosphate synthase small subunit 
Protein accessionYP_001960731 
Protein GI189501261 
COG category[E] Amino acid transport and metabolism
[F] Nucleotide transport and metabolism 
COG ID[COG0505] Carbamoylphosphate synthase small subunit 
TIGRFAM ID[TIGR01368] carbamoyl-phosphate synthase, small subunit 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGCCCA TACCCGCAAA ACTGGTTCTG GAAAACGGCT CCGTCTATAA AGGCGAAGCA 
TTTGGACATA TTGGCGAAGC CGCCGGAGAA GTGGTTTTCA ACACGTCGCT CACAGGATAT
CAGGAAATTC TTACCGATCC GTCTTACGCA GGACAGATGG TGCTGATGAC CTATCCTCTT
ATCGGAAATT ACGGTGTCAA CGAGACCGAT GAGGAATCGG GAAAAATCTG GGCTTCGGCG
ATCATAGTCC GTGAAGCCTC ACATATATAC AGCAATTTCG CGGCGACTGA CAGTCTCGAC
AACTACCTGA AAGAGTCGGA AGTCCTGGGT CTCGCAGGCA TCGACACCAG AAAACTGGTT
CGTGAAATCA GGGAAAAAGG CGCGATGAGA GGAGTTATAT CAGCAATTGA CGCCGATGAG
AAAAGTCTGC AGGAAAAAGC GATTGCCGTA CCTGAAATGA CCGGTCTCGA CCTTGTTCAA
AAAGTCAGTA CGCCACAGAG CTATACAGCA GATTGCCCGG ACGCACAATA CCATGTCGTT
GCCATGGATT TCGGCATCAA GAGAAACATT CTCAGAATGC TGCAGGATGC AGGATGCAGA
GTCACCGTTC TGAACGCCGG CGCGACAGCA GATGATATCC GGGATCTGAA TCCCGACGGT
CTTTTTCTTT CAAACGGGCC GGGAGATCCC TTTGCCGTAA CCTACGCAAT CGATACGATC
AGAACCCTTA TCCGGGAGAA CGGCGATTCA GCTCCTTTGC CGATATTCGG AATCTGTCTT
GGCCACCAGC TGCTTTCCCT GGCTTACGGA GCAAACACCT ACAAATTGAA GTTCGGACAT
CACGGCAGCA ATCATCCTGT TAAAAATCTT TCAACCGGAT CAATCGAGAT AACATCCCAG
AACCACGGAT TTGCCGTCGA GATGAGCTCA CTTCCGGAAG AACTTGAACT TACTCACCTC
AACCTTTACG ACAACACTGT CGAAGGTGTG CGGCATCGTG AGTTGCCCTG TTTTTCCGTC
CAATACCACC CCGAAGCGGC TCCGGGACCT CATGACTCAA ACTATCTTTT CAGTCTTTTC
ACCGATATGA TGGCCGGATA G
 
Protein sequence
MQPIPAKLVL ENGSVYKGEA FGHIGEAAGE VVFNTSLTGY QEILTDPSYA GQMVLMTYPL 
IGNYGVNETD EESGKIWASA IIVREASHIY SNFAATDSLD NYLKESEVLG LAGIDTRKLV
REIREKGAMR GVISAIDADE KSLQEKAIAV PEMTGLDLVQ KVSTPQSYTA DCPDAQYHVV
AMDFGIKRNI LRMLQDAGCR VTVLNAGATA DDIRDLNPDG LFLSNGPGDP FAVTYAIDTI
RTLIRENGDS APLPIFGICL GHQLLSLAYG ANTYKLKFGH HGSNHPVKNL STGSIEITSQ
NHGFAVEMSS LPEELELTHL NLYDNTVEGV RHRELPCFSV QYHPEAAPGP HDSNYLFSLF
TDMMAG