Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tneu_0135 |
Symbol | carB |
ID | 6164717 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermoproteus neutrophilus V24Sta |
Kingdom | Archaea |
Replicon accession | NC_010525 |
Strand | + |
Start bp | 121924 |
End bp | 125001 |
Gene Length | 3078 bp |
Protein Length | 1025 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 641667301 |
Product | carbamoyl phosphate synthase large subunit |
Protein accession | YP_001793538 |
Protein GI | 171184619 |
COG category | [E] Amino acid transport and metabolism [F] Nucleotide transport and metabolism |
COG ID | [COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) |
TIGRFAM ID | [TIGR01369] carbamoyl-phosphate synthase, large subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.0153019 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGGACG TTAGAAAGAT CTTGGTGGTA GGGTCGGGCG CCATCAAGGT GGCGGAGGCG GCTGAGTTCG ACTACTCGGG CTCGCAGGCC TTAAAGGCGT TTAGGGAGGA GGGGATAGAG ACGGTGCTCG TCAACCCCAA CATAGCCACT ATACAGACGT CGAAGCTTCT GGCGGATAAG GTCTACTTCG TGCCTATACA GAGGCAGTTC CTGGCGGAGG TCATAGAGAG GGAGAGGCCT GACGCAATAG CGTGCGGCTT CGGCGGACAG ACGGCGCTGT CGGCCTGCGT AGATCTACAC GACTCCGGCG TCTTGGATAA ATACGGCGTT AAGGTCGTGG GGACGCCGGT GCGGGGGATA AAGAGGGCTC TCTCGAGGGA TCTGTTTCAA AAGGCCATGA GGGAGGTCGG CATACCGATC CCGCCCAGTA GCCCCGCGAG GTCGCCCGAG GAGGCGCTGA AGGTCGCGCG GGAGATAGGC TACCCGGTGG TCGTGAGGGT GAGCTTCAAC CTAGGCGGCG CGGGCGCCTT CGTCGCCAGG AGCGAGGAGG ACCTAAGGGC CAGGGTGTAC AAGGCCTTCG CCCAGTCCGC AATTGGGGAA GTCCTTGTGG AGAAGTACCT GGAGGGGTGG AAGGAGGTGG AGTTCGAGGT GGTGCGCGAC GCCTACGACA ACGTCGCAGC TGTGGTCTGT ATGGAGAACA TAGACCCAAT GGGCGTACAC ACGGGGGACT CCATAGTGGT GGCCCCCTGC CTCACCTTGA CAGATGAGGA GTACCAGACT GCCAGGAACA TCTCCATCGG CGTGGCGCGC GCCATCGAGC TAGTGGGCGA GGGCAACGTC CAGGTGGCGG TCAACTACGC CGGGCCTGAG CAGTACGCCA TAGAGACAAA CCCACGTATG TCCCGCTCCA GCGCCCTCGC CTCCAAGGCC TCGGGCTACC CCCTGGCCTT CATCGCGGCT AAGTTGGCCT TGGGCTACCG CCTAGACGAG GTTTTGAACC AGGTGACGAG GCGGACAGTG GCGGCGTTTG AGCCCGCGCT TGACTACATA GTGGTTAAAC ACCCGAGGTG GGAGAACGAC AGATTCGGCG TATCCGAGGG CCTGGGGCCG GAGATGATGT CTATCGGCGA GGCCATGGCG GTGGGGAGGA CGCTGGAGGA GGCTTGGCAG AAGGCGGTTA GGATGATCGA CATAGGCGAG CCCGGCCTAG TTGGCGGGCC CATGTTTAGG GAACTTACGC TTGAGGAGGC GAGGCGGTGT CTAGAGGGGT ACAGGCCCTA CTGGCCGATA TGCGCTGCCA AGGCCATGTA CCTAGGCCTC TCTATAGACG AGGTGTACAG CTATGTGAAG GTGGATAGGT TCTTCCTAAG GGCTATACAG CGCGTCGTAG AGGCCTACAA GGCGCTTGAG CAAGGCCGGT ACGACCTGGA GGAGCTGAAG GTCTTGGGCT TCTCAGACGG CCAGATAGCA AGAGCGCTGG GGGTCGAAGA GGAGGAGGTG AGGCGGGCGA GGAGGCGGCC GGTGGTGAAG AGGATAGATA CGCTAGCCGG CGAGTGGCCG GCCGAGACGA ACTACCTCTA CCTCACCTAC GGCGGCGTGT ACGACGACGA CGTGCCGCGG GTGGACTACC TAGTGGTGGG CGCCGGAGTC TTTAGAATAG GCGTGAGCGT GGAGTTCGAC TGGTCCACGG TGAACCTGGC GCAGGAGCTG AGAAACAGGG GGTTTAAGGT GGCTATCCTG AACTACAACC CCGAGACGGT GTCGACGGAC TGGGACATAG TGGATAAGCT CTACTTCGAC GAGATCTCCA GCGAGAGGAT ACTGGACATA GTGGAGAAGG AGGGCGGCGG CGTCGCGGTT GTCCTCTACG CGGGAGGCCA GATAGGCCAG AGGCTTTATA AGCGTCTTGA GGCGGCGGGG GTGAAGATCG GGGGGACCCG CGCCGCCTCT ATAGACGCGG CGGAGGACAG GAGCAAGTTT TCGGAGCTTC TGGAAAAGCT CGGCATAAAA CAGCCGCCGT GGTTCGCCGC TAGATCCCCA GAGGAGGCGG CTAAGCTCGC CGAGGCGCTG GGCTACCCCG TGTTGGTGAG GCCCAGCTAC GTCCTAGGCG GCACCTATAT GGCCGTGGCT TACGACAGGG AGGAGCTCCT GAGCTTCCTC ACAAAGGCGG CTAGGGTGAG CGGGGAGTAC CCGGCGGTTG TGTCCAAGTT CATGCCGCGC GGCGTTGAGG CGGAGGTAGA CGCCGTGTCT GACGGAGTTC GGCTCGTCGC CACACCAATC GAGCACGTGG AGCCGCCTGG CGTTCATTCC GGCGACTCCA CCATGGTGCT CCCGCCGAGG AGGCTGGAGG AGGGGGCCGT CAAGAAGATG GTTGAGGCTA CTCAGAGGAT CGCCGCCGAG CTCGGGGTCA AGGGCCCTCT CAACGTCCAG TTCATAGTCT ACGATGACGT GTACGTAATA GAGGCGAACC TCAGGGTAAG CCGCTCCATG CCCTTCGTGA GCAAGGCCAC GGGGGTGAAC TACATGTCTC TGACGGCCGA CGTGTTGGTG AACGGCCGCC TAGCCGTAGA CGAGGAGGTC GTGGTGCTTA AGCCGACGAA GTGGTGGGTG AAGTCTCCGC AGTTCTCTTG GTCTAGGCTG AGGGGGTCGT ACCCGCGGCT GGGGCCTGTG ATGTACAGCA CTGGGGAGGT GGCCTCAAAC GGGGCCACAT ACGAGGAGGC TCTGCTGAAG AGCTGGCTGT CCGCAGCGCC GAATAGGATA CCGGAGAGAT CCGCACTGAT ATATACATAT GATAAACACG GCGCGGAGGC CCTTGGGCAA GCGGCTTCTC TGCTGGCCGG CAGGCTGGAG GTACACACCC CCGAGTCGCT GGGGGAGAAG GCCGTGGAAA TGTTAAAGTG GAAGAAGATA GACATAGTTA TGACGTCTGG CGTAACGCCG GAGAGGGATT TCCACATCAG GAGAACCGCG GCCGACACCA ACACGCCGTT GGTGCTTGAC GCGTCGCTGG CGCTGGAGTT AGCCAAGGCG TTTACGTGGT ACTACAAAAA CGGGAAGCTC GAGGTAGCGC CGTGGTAG
|
Protein sequence | MPDVRKILVV GSGAIKVAEA AEFDYSGSQA LKAFREEGIE TVLVNPNIAT IQTSKLLADK VYFVPIQRQF LAEVIERERP DAIACGFGGQ TALSACVDLH DSGVLDKYGV KVVGTPVRGI KRALSRDLFQ KAMREVGIPI PPSSPARSPE EALKVAREIG YPVVVRVSFN LGGAGAFVAR SEEDLRARVY KAFAQSAIGE VLVEKYLEGW KEVEFEVVRD AYDNVAAVVC MENIDPMGVH TGDSIVVAPC LTLTDEEYQT ARNISIGVAR AIELVGEGNV QVAVNYAGPE QYAIETNPRM SRSSALASKA SGYPLAFIAA KLALGYRLDE VLNQVTRRTV AAFEPALDYI VVKHPRWEND RFGVSEGLGP EMMSIGEAMA VGRTLEEAWQ KAVRMIDIGE PGLVGGPMFR ELTLEEARRC LEGYRPYWPI CAAKAMYLGL SIDEVYSYVK VDRFFLRAIQ RVVEAYKALE QGRYDLEELK VLGFSDGQIA RALGVEEEEV RRARRRPVVK RIDTLAGEWP AETNYLYLTY GGVYDDDVPR VDYLVVGAGV FRIGVSVEFD WSTVNLAQEL RNRGFKVAIL NYNPETVSTD WDIVDKLYFD EISSERILDI VEKEGGGVAV VLYAGGQIGQ RLYKRLEAAG VKIGGTRAAS IDAAEDRSKF SELLEKLGIK QPPWFAARSP EEAAKLAEAL GYPVLVRPSY VLGGTYMAVA YDREELLSFL TKAARVSGEY PAVVSKFMPR GVEAEVDAVS DGVRLVATPI EHVEPPGVHS GDSTMVLPPR RLEEGAVKKM VEATQRIAAE LGVKGPLNVQ FIVYDDVYVI EANLRVSRSM PFVSKATGVN YMSLTADVLV NGRLAVDEEV VVLKPTKWWV KSPQFSWSRL RGSYPRLGPV MYSTGEVASN GATYEEALLK SWLSAAPNRI PERSALIYTY DKHGAEALGQ AASLLAGRLE VHTPESLGEK AVEMLKWKKI DIVMTSGVTP ERDFHIRRTA ADTNTPLVLD ASLALELAKA FTWYYKNGKL EVAPW
|
| |