Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Plut_1592 |
Symbol | |
ID | 3746118 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium luteolum DSM 273 |
Kingdom | Bacteria |
Replicon accession | NC_007512 |
Strand | + |
Start bp | 1782529 |
End bp | 1784388 |
Gene Length | 1860 bp |
Protein Length | 619 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637769625 |
Product | carbamoyl-phosphate synthase, large subunit |
Protein accession | YP_375489 |
Protein GI | 78187446 |
COG category | [E] Amino acid transport and metabolism [F] Nucleotide transport and metabolism |
COG ID | [COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) |
TIGRFAM ID | [TIGR01369] carbamoyl-phosphate synthase, large subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.430094 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCACAC ACGCTTCCGC CCTCAGCGCC GAGGTTGCCG CACTGGCGGC GAAACTCCCT CAGGACCGTC TCCGCAAAGC CAAGGAACAC GGTTTTTCTG ATACCCAGAT CGCCAACATA TTCAGCACTG AGGAAGCCGT CGTGCGGCAG CTCCGCAAGC AGTACGGCCT CGACTCGGTC TTTAAAACCG TCGACACCTG TGCTGCGGAA TTCGACGCAA AAACCCCCTA CCACTACTCC ACCTACGACG AAGAAAACGA ATCGGTCCGT TCCGAGAATA AAAAAGTGAT CATCCTCGGC GGCGGACCGA ACCGCATCGG CCAGGGAATC GAGTTCGACT ACTGCTGCGT GCAGGCGGTC TTCGCCCTCA GGGAAGCCGG CTACGAGACC ATCATGGTCA ACTGCAACCC CGAAACCGTC TCGACCGACT ACGACATCGC CGACAAGCTT TACTTCGAAC CGCTGACCTT CGAAGACACC ATCCGCATCA TCGAGCATGA GCAGCCGCTC GGCGTCATCG TCAGCTTCGG CGGACAGACC CCCCTGAAGC TCTCCGGCAG ACTGCACGAA GCAGGCGTAA CCATCCTCGG CACCTCTCCG GAAGGCATTG ACCTTGCCGA AGACCGGAAG AAGTTCGGCG CCCTGCTTGA CCGCCTCAAC ATCCCGCACC CCGAATACGG CACGGCTGTC TGCCTCGAGG AGGCCCAGGC CATCACCCGC CGGATCGGCT ATCCGGTCCT CGTCCGTCCA AGTTACGTGC TCGGCGGCCG GGCCATGAAA ATCATCTACA GCGACGACTC CCTGAAGGAG TACGTCGATC AGGCACTCTT CATCACTGAA AAGTACCCGC TCCTCGTCGA CCGCTTCCTG GAAACCGCCG TCGAGTTCGA CATCGACGCC ATTGCCGACG AGAGCGACTG CGTCATCAGC GGCATCATGC AGCATGTCGA GGCTGCAGGC ATCCACAGCG GCGACTCCAC CTCCATCCTC CCCTACCACA ACATCAGCCC GGAGGTCATC GCAAAGATGA AGGAGTACAC CCGCATCATG GCCCGTAACA TCAAGGTGGT CGGACTCATG AATGTGCAGT ACGCAGTGCA GAACAACAGC GTCTATGTGA TTGAAGTCAA CCCTCGGGCC AGCCGCACGG TGCCGTTCGT CGGCAAGGCG ACGGCCATCC CCGTCGTGAA GATCGCCACC CGCGTCATGC TCGGCGAAAA GCTCTCGGAC CTGCGCCGCG AATTCGATCT GAAAGACTGC GACGAACTCG GCATGAAACA CATGGCCATC AAGGAACCGG TCTTCCCCTT CTCGAAATTC GTCAAGTCCG GCGTCTACCT CGGCCCCGAA ATGCGCTCCA CCGGAGAGGC CATGAGCCTT GCTGAAGAGT TCCCCGAAGC GTTTGCCAAA GCCTACCAGG CGGCAAACAT GCACCTGCCG CTCTCCGGCT CGGTGTTCAT AAGCGTCAAC GACCAGGACA AGAACCACCG CATCCTCGAC ATCGCCCGTT CGCTGTACAG AATGGACTTC GACCTTGTCG CAACTGAAGG CACTTACCGG TTCCTGAAGG ACAACGGCAT TGAATGCAAA ATGGTGTTCA AGGTAGGTGA AGAGGGACGG CCGAACATTT TCGACATCAT CAAGCACGGC AAGATCAACT TCGTCATCAA CACACCGAGG GGCGAGCAGG CACTCCACGA CGAAGAGGCC ATCGGTGCCG CCTCGGTGCT CAGCAACGTG CCGTTCGTCA CCACCATCGA AGCCGCTGAA GCCTCCGTAC AGGCGATTGA CTGCATCCGC CACCAGGAGT TTGGAGTGAA AAGTCTCCAG GAATACGCAT CCTATCGCAA CGCCAGATAA
|
Protein sequence | MSTHASALSA EVAALAAKLP QDRLRKAKEH GFSDTQIANI FSTEEAVVRQ LRKQYGLDSV FKTVDTCAAE FDAKTPYHYS TYDEENESVR SENKKVIILG GGPNRIGQGI EFDYCCVQAV FALREAGYET IMVNCNPETV STDYDIADKL YFEPLTFEDT IRIIEHEQPL GVIVSFGGQT PLKLSGRLHE AGVTILGTSP EGIDLAEDRK KFGALLDRLN IPHPEYGTAV CLEEAQAITR RIGYPVLVRP SYVLGGRAMK IIYSDDSLKE YVDQALFITE KYPLLVDRFL ETAVEFDIDA IADESDCVIS GIMQHVEAAG IHSGDSTSIL PYHNISPEVI AKMKEYTRIM ARNIKVVGLM NVQYAVQNNS VYVIEVNPRA SRTVPFVGKA TAIPVVKIAT RVMLGEKLSD LRREFDLKDC DELGMKHMAI KEPVFPFSKF VKSGVYLGPE MRSTGEAMSL AEEFPEAFAK AYQAANMHLP LSGSVFISVN DQDKNHRILD IARSLYRMDF DLVATEGTYR FLKDNGIECK MVFKVGEEGR PNIFDIIKHG KINFVINTPR GEQALHDEEA IGAASVLSNV PFVTTIEAAE ASVQAIDCIR HQEFGVKSLQ EYASYRNAR
|
| |