Gene Cagg_1133 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1133 
Symbol 
ID7267799 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp1399547 
End bp1401520 
Gene Length1974 bp 
Protein Length657 aa 
Translation table11 
GC content62% 
IMG OID643565976 
ProductCarbamoyl-phosphate synthase L chain ATP-binding 
Protein accessionYP_002462479 
Protein GI219848046 
COG category[I] Lipid transport and metabolism 
COG ID[COG4770] Acetyl/propionyl-CoA carboxylase, alpha subunit 
TIGRFAM ID[TIGR00514] acetyl-CoA carboxylase, biotin carboxylase subunit 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0279279 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCAACC GCATCTTGAT CGCAAATCGC GGCGAGATTG CCGTGCGGAT CATTCGCGCT 
TGCCGCGAAT TGGGCATTAG CCCAGTGGTT GTCTATTCCG AAGCCGACGC TAAGGCGCTG
CACGTGCGCA TGGCCGATGC TGCGGTGTTG ATCGGACCAC CACCCGCCGC CCAAAGCTAC
CTCCGCGCCG AAGCAATCGT TGAGGCCGCA GTGAAACTCA AGGCAGAGGC CATCCACCCC
GGTTATGGCT TTTTGAGTGA AAGTGTGAAG TTGGCCGAAC AGTGTGCCGC AGCAGGGATC
ACCTTTATCG GGCCACCCCC GCACGCCATT GCCGCGATGG GGGGAAAGGC TGAAGCACGC
GCGTTGGCCC AAGCTGCCGG TGTGCCGGTT GTGCCCGGCT ACGACGGCGA AGATCAGAGC
GACGAACGCC TGATTGCCGA GGCACGACGG ATCGGACCGC CGTTGTTGAT CAAAGCAAGT
GCCGGTGGTG GTGGCAAAGG AATGCGCTCG GTTGGCGATC TCGCCGAGAT ACCGGCGGCT
ATCGAAGGCG CCCGCCGTGA GGCACGGGCG GCTTTCGGCG ACGACCGCCT GATCATCGAG
CGGCTCGTCC TGCGACCACG CCACGTCGAG ATTCAAGTCT TGGCCGACCG CTACGGCAAT
GTCGTGCATC TCGGTGAACG CGATTGTTCA ATTCAGCGCC GTCATCAGAA GATCGTTGAA
GAGGCACCTT CCCCGGCTCT GACCCCAGCA CTCCGCGCTG CGATGGGAAA TGCTGCCGTT
GCCATCGCTC GCGCTGCCGG TTATGTCAAT GCCGGTACGG TTGAGTTTAT TCTGGCGCCA
AACGGCGAGT TCTACTTTTT GGAAATGAAT ACCCGGCTGC AAGTGGAGCA TCCCGTAACC
GAGCTTGTCT GCGGCTACGA TCTCGTCCAC TTGCAGATTG CGATTGCCGC CGGCGAGCCG
CTCCCCTTCC GCCAAGAAGA GATTACCGTG CGCGGCCACG CGATTGAGGT ACGGTTGTAC
GCCGAAGACC CACGCACCTA TTTGCCGGCA GTCGGTAAAG TGGCGCTGTT TGTCGCTCCT
CAAGGACCGG GCGTGCGCGT TGATGCCGGT CTGACCGGCG GCGATGAGGT GATGGTGCAT
TACGATCCGT TGCTGGCCAA GATTATTGTG TTCGGCGCCG ACCGGCCCCA AGCCGTAGCA
CGCTTACGCC GTGCGTTACG CGAGATGGCT GTACTCGGTC CGACCACCAA TCTACCACTC
TTACAGGCAA TTGCCGAGCA TCCCGCGTTC GCCGATGGCG CGACTCATAC CGGCTTCCTC
ACCGAGCACG AGGGGATTGT TCCACCGGCA GGCCCTCCAC CACGCGAAGT ACTCGTCGCC
GCGGCCATCC TCACTGTCAC CGCCGAGCCA CCGGCCCGCG ATCCGCTGGC CGCCGTCTGG
CGACTTGGCG GTGATACCAT TCCGCTGACA TTCACTGCGC ACGGCGAGCA CCGGCTGCGA
GTGACGCCGC AAACCTTTGG GTGGCACGTA GCCGGCGATC ATTGGCATGT CCATGCGACG
TTGGTGCGGC GTGGCGATTA CGAATTGGCT CTTGATATTG ATGGTCAGCG GAGACAATTC
TTCTTTGCAC GAGCCAACGA CGGTTGGCTG ATCGGTTGGC GTGGCGAAGC GTATCATGTG
CAACGACCGG CCCCGCTTAC CGCCGACACG GTTATTCGCA CCGCCGACCA GAATGCTGCG
CGTTTCAATG CTCCGATGCC GGGTACCATC GTGCGGCTAC ACGTTGCCGT CGGTGAACAG
GTGCGCGAGG GGCAACCCCT CCTCGTTCTC GAAGCGATGA AGATGGAACA TACCATTGTC
GCGCCATACG CCGGCATTGT CCGCCGCTTG CCGTATCAGA CCGGCGCAAG TGTTGCCGCC
GGTGCTCACC TCGTCGATCT CGAACCGCTG CCGGCCAATG AGCAGCCGGA TTGA
 
Protein sequence
MFNRILIANR GEIAVRIIRA CRELGISPVV VYSEADAKAL HVRMADAAVL IGPPPAAQSY 
LRAEAIVEAA VKLKAEAIHP GYGFLSESVK LAEQCAAAGI TFIGPPPHAI AAMGGKAEAR
ALAQAAGVPV VPGYDGEDQS DERLIAEARR IGPPLLIKAS AGGGGKGMRS VGDLAEIPAA
IEGARREARA AFGDDRLIIE RLVLRPRHVE IQVLADRYGN VVHLGERDCS IQRRHQKIVE
EAPSPALTPA LRAAMGNAAV AIARAAGYVN AGTVEFILAP NGEFYFLEMN TRLQVEHPVT
ELVCGYDLVH LQIAIAAGEP LPFRQEEITV RGHAIEVRLY AEDPRTYLPA VGKVALFVAP
QGPGVRVDAG LTGGDEVMVH YDPLLAKIIV FGADRPQAVA RLRRALREMA VLGPTTNLPL
LQAIAEHPAF ADGATHTGFL TEHEGIVPPA GPPPREVLVA AAILTVTAEP PARDPLAAVW
RLGGDTIPLT FTAHGEHRLR VTPQTFGWHV AGDHWHVHAT LVRRGDYELA LDIDGQRRQF
FFARANDGWL IGWRGEAYHV QRPAPLTADT VIRTADQNAA RFNAPMPGTI VRLHVAVGEQ
VREGQPLLVL EAMKMEHTIV APYAGIVRRL PYQTGASVAA GAHLVDLEPL PANEQPD