Gene Tneu_0135 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTneu_0135 
SymbolcarB 
ID6164717 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermoproteus neutrophilus V24Sta 
KingdomArchaea 
Replicon accessionNC_010525 
Strand
Start bp121924 
End bp125001 
Gene Length3078 bp 
Protein Length1025 aa 
Translation table11 
GC content62% 
IMG OID641667301 
Productcarbamoyl phosphate synthase large subunit 
Protein accessionYP_001793538 
Protein GI171184619 
COG category[E] Amino acid transport and metabolism
[F] Nucleotide transport and metabolism 
COG ID[COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) 
TIGRFAM ID[TIGR01369] carbamoyl-phosphate synthase, large subunit 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0153019 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGGACG TTAGAAAGAT CTTGGTGGTA GGGTCGGGCG CCATCAAGGT GGCGGAGGCG 
GCTGAGTTCG ACTACTCGGG CTCGCAGGCC TTAAAGGCGT TTAGGGAGGA GGGGATAGAG
ACGGTGCTCG TCAACCCCAA CATAGCCACT ATACAGACGT CGAAGCTTCT GGCGGATAAG
GTCTACTTCG TGCCTATACA GAGGCAGTTC CTGGCGGAGG TCATAGAGAG GGAGAGGCCT
GACGCAATAG CGTGCGGCTT CGGCGGACAG ACGGCGCTGT CGGCCTGCGT AGATCTACAC
GACTCCGGCG TCTTGGATAA ATACGGCGTT AAGGTCGTGG GGACGCCGGT GCGGGGGATA
AAGAGGGCTC TCTCGAGGGA TCTGTTTCAA AAGGCCATGA GGGAGGTCGG CATACCGATC
CCGCCCAGTA GCCCCGCGAG GTCGCCCGAG GAGGCGCTGA AGGTCGCGCG GGAGATAGGC
TACCCGGTGG TCGTGAGGGT GAGCTTCAAC CTAGGCGGCG CGGGCGCCTT CGTCGCCAGG
AGCGAGGAGG ACCTAAGGGC CAGGGTGTAC AAGGCCTTCG CCCAGTCCGC AATTGGGGAA
GTCCTTGTGG AGAAGTACCT GGAGGGGTGG AAGGAGGTGG AGTTCGAGGT GGTGCGCGAC
GCCTACGACA ACGTCGCAGC TGTGGTCTGT ATGGAGAACA TAGACCCAAT GGGCGTACAC
ACGGGGGACT CCATAGTGGT GGCCCCCTGC CTCACCTTGA CAGATGAGGA GTACCAGACT
GCCAGGAACA TCTCCATCGG CGTGGCGCGC GCCATCGAGC TAGTGGGCGA GGGCAACGTC
CAGGTGGCGG TCAACTACGC CGGGCCTGAG CAGTACGCCA TAGAGACAAA CCCACGTATG
TCCCGCTCCA GCGCCCTCGC CTCCAAGGCC TCGGGCTACC CCCTGGCCTT CATCGCGGCT
AAGTTGGCCT TGGGCTACCG CCTAGACGAG GTTTTGAACC AGGTGACGAG GCGGACAGTG
GCGGCGTTTG AGCCCGCGCT TGACTACATA GTGGTTAAAC ACCCGAGGTG GGAGAACGAC
AGATTCGGCG TATCCGAGGG CCTGGGGCCG GAGATGATGT CTATCGGCGA GGCCATGGCG
GTGGGGAGGA CGCTGGAGGA GGCTTGGCAG AAGGCGGTTA GGATGATCGA CATAGGCGAG
CCCGGCCTAG TTGGCGGGCC CATGTTTAGG GAACTTACGC TTGAGGAGGC GAGGCGGTGT
CTAGAGGGGT ACAGGCCCTA CTGGCCGATA TGCGCTGCCA AGGCCATGTA CCTAGGCCTC
TCTATAGACG AGGTGTACAG CTATGTGAAG GTGGATAGGT TCTTCCTAAG GGCTATACAG
CGCGTCGTAG AGGCCTACAA GGCGCTTGAG CAAGGCCGGT ACGACCTGGA GGAGCTGAAG
GTCTTGGGCT TCTCAGACGG CCAGATAGCA AGAGCGCTGG GGGTCGAAGA GGAGGAGGTG
AGGCGGGCGA GGAGGCGGCC GGTGGTGAAG AGGATAGATA CGCTAGCCGG CGAGTGGCCG
GCCGAGACGA ACTACCTCTA CCTCACCTAC GGCGGCGTGT ACGACGACGA CGTGCCGCGG
GTGGACTACC TAGTGGTGGG CGCCGGAGTC TTTAGAATAG GCGTGAGCGT GGAGTTCGAC
TGGTCCACGG TGAACCTGGC GCAGGAGCTG AGAAACAGGG GGTTTAAGGT GGCTATCCTG
AACTACAACC CCGAGACGGT GTCGACGGAC TGGGACATAG TGGATAAGCT CTACTTCGAC
GAGATCTCCA GCGAGAGGAT ACTGGACATA GTGGAGAAGG AGGGCGGCGG CGTCGCGGTT
GTCCTCTACG CGGGAGGCCA GATAGGCCAG AGGCTTTATA AGCGTCTTGA GGCGGCGGGG
GTGAAGATCG GGGGGACCCG CGCCGCCTCT ATAGACGCGG CGGAGGACAG GAGCAAGTTT
TCGGAGCTTC TGGAAAAGCT CGGCATAAAA CAGCCGCCGT GGTTCGCCGC TAGATCCCCA
GAGGAGGCGG CTAAGCTCGC CGAGGCGCTG GGCTACCCCG TGTTGGTGAG GCCCAGCTAC
GTCCTAGGCG GCACCTATAT GGCCGTGGCT TACGACAGGG AGGAGCTCCT GAGCTTCCTC
ACAAAGGCGG CTAGGGTGAG CGGGGAGTAC CCGGCGGTTG TGTCCAAGTT CATGCCGCGC
GGCGTTGAGG CGGAGGTAGA CGCCGTGTCT GACGGAGTTC GGCTCGTCGC CACACCAATC
GAGCACGTGG AGCCGCCTGG CGTTCATTCC GGCGACTCCA CCATGGTGCT CCCGCCGAGG
AGGCTGGAGG AGGGGGCCGT CAAGAAGATG GTTGAGGCTA CTCAGAGGAT CGCCGCCGAG
CTCGGGGTCA AGGGCCCTCT CAACGTCCAG TTCATAGTCT ACGATGACGT GTACGTAATA
GAGGCGAACC TCAGGGTAAG CCGCTCCATG CCCTTCGTGA GCAAGGCCAC GGGGGTGAAC
TACATGTCTC TGACGGCCGA CGTGTTGGTG AACGGCCGCC TAGCCGTAGA CGAGGAGGTC
GTGGTGCTTA AGCCGACGAA GTGGTGGGTG AAGTCTCCGC AGTTCTCTTG GTCTAGGCTG
AGGGGGTCGT ACCCGCGGCT GGGGCCTGTG ATGTACAGCA CTGGGGAGGT GGCCTCAAAC
GGGGCCACAT ACGAGGAGGC TCTGCTGAAG AGCTGGCTGT CCGCAGCGCC GAATAGGATA
CCGGAGAGAT CCGCACTGAT ATATACATAT GATAAACACG GCGCGGAGGC CCTTGGGCAA
GCGGCTTCTC TGCTGGCCGG CAGGCTGGAG GTACACACCC CCGAGTCGCT GGGGGAGAAG
GCCGTGGAAA TGTTAAAGTG GAAGAAGATA GACATAGTTA TGACGTCTGG CGTAACGCCG
GAGAGGGATT TCCACATCAG GAGAACCGCG GCCGACACCA ACACGCCGTT GGTGCTTGAC
GCGTCGCTGG CGCTGGAGTT AGCCAAGGCG TTTACGTGGT ACTACAAAAA CGGGAAGCTC
GAGGTAGCGC CGTGGTAG
 
Protein sequence
MPDVRKILVV GSGAIKVAEA AEFDYSGSQA LKAFREEGIE TVLVNPNIAT IQTSKLLADK 
VYFVPIQRQF LAEVIERERP DAIACGFGGQ TALSACVDLH DSGVLDKYGV KVVGTPVRGI
KRALSRDLFQ KAMREVGIPI PPSSPARSPE EALKVAREIG YPVVVRVSFN LGGAGAFVAR
SEEDLRARVY KAFAQSAIGE VLVEKYLEGW KEVEFEVVRD AYDNVAAVVC MENIDPMGVH
TGDSIVVAPC LTLTDEEYQT ARNISIGVAR AIELVGEGNV QVAVNYAGPE QYAIETNPRM
SRSSALASKA SGYPLAFIAA KLALGYRLDE VLNQVTRRTV AAFEPALDYI VVKHPRWEND
RFGVSEGLGP EMMSIGEAMA VGRTLEEAWQ KAVRMIDIGE PGLVGGPMFR ELTLEEARRC
LEGYRPYWPI CAAKAMYLGL SIDEVYSYVK VDRFFLRAIQ RVVEAYKALE QGRYDLEELK
VLGFSDGQIA RALGVEEEEV RRARRRPVVK RIDTLAGEWP AETNYLYLTY GGVYDDDVPR
VDYLVVGAGV FRIGVSVEFD WSTVNLAQEL RNRGFKVAIL NYNPETVSTD WDIVDKLYFD
EISSERILDI VEKEGGGVAV VLYAGGQIGQ RLYKRLEAAG VKIGGTRAAS IDAAEDRSKF
SELLEKLGIK QPPWFAARSP EEAAKLAEAL GYPVLVRPSY VLGGTYMAVA YDREELLSFL
TKAARVSGEY PAVVSKFMPR GVEAEVDAVS DGVRLVATPI EHVEPPGVHS GDSTMVLPPR
RLEEGAVKKM VEATQRIAAE LGVKGPLNVQ FIVYDDVYVI EANLRVSRSM PFVSKATGVN
YMSLTADVLV NGRLAVDEEV VVLKPTKWWV KSPQFSWSRL RGSYPRLGPV MYSTGEVASN
GATYEEALLK SWLSAAPNRI PERSALIYTY DKHGAEALGQ AASLLAGRLE VHTPESLGEK
AVEMLKWKKI DIVMTSGVTP ERDFHIRRTA ADTNTPLVLD ASLALELAKA FTWYYKNGKL
EVAPW