Gene Acid345_2197 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2197 
Symbol 
ID4071449 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2617724 
End bp2621032 
Gene Length3309 bp 
Protein Length1102 aa 
Translation table11 
GC content60% 
IMG OID637984213 
Productcarbamoyl-phosphate synthase large subunit 
Protein accessionYP_591272 
Protein GI94969224 
COG category[E] Amino acid transport and metabolism
[F] Nucleotide transport and metabolism 
COG ID[COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) 
TIGRFAM ID[TIGR01369] carbamoyl-phosphate synthase, large subunit 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCGCGTC GTAACGACAT CTCAAAGATC CTCATCATTG GCTCCGGCCC AATTGTCATC 
GGCCAATCCG CAGAATTCGA CTACTCGGGC GCGCAAGCCT GCAAAGCGCT CAAAGCCGAA
GGGTACGAAG TCGTCCTCGC CAATTCGAAT CCGGCGACGA TCATGACCGA TCCCGAAATG
GCCGACCGGA CTTATATCGA GCCGCTCACG CCGGAATTTC TGGAAGAGAT CATCCGCATT
GAAGCGGCAA TGATGCCCGC CGGAGCAGGG AAGTTCGCGC TGCTACCAAC CGTCGGCGGA
CAGACCGCAC TAAACTTGGC CGTAGATCTG GCCGATAGTG GTGTCCTCGA CAAGTACAGC
GTCATCCTCA TAGGCGCACA GTTGGGCGCA ATTAAGAAGG CCGAAGACCG TTTGTTGTTC
AAAGATGCCA TGGCCAAGAT CGGCCTCGAT GTGCCACGGT CAGCGCTCAT CAACAACTTA
AAGGACGGCC TCGAGTTCAG CGGCAAGATC GGATTCCCGC TGGTGCTTCG GCCTTCGTTC
ACGCTCGGCG GCAGCGGCGG CGGCATCGCC TATAACCGCG AAGAGTTGAT GGACCTGCTG
GGGAAGGGAC TCGACCTCTC GCCGGTACAT GAAGTGTTAC TTGAAGAGTC GGTACTCGGC
TGGAAAGAGT ACGAACTCGA GTTGATGCGC GACCTCGCCG ACAACGTCAT CGTCATTTGT
TCGATCGAGA ACTTCGATCC CATGGGCGTG CACACCGGCG ATTCGATCAC CGTCGCACCC
GCGCAAACGC TGAGCGATCG CGAATACCAG ATCATGCGCG ATGCGGCGAT CAAGGTCATT
CGCGAGATTG GCGTCGAAAC CGGTGGCTCG AACATCCAGT TCGCGACCAA TCCTGAAAAC
GGTCGCATGA TCGTCATCGA GATGAACCCG CGCGTGTCGC GGTCGTCAGC CCTGGCGTCG
AAAGCTACCG GCTTCCCAAT CGCCAAGATC GCGGCGCGCC TCGCAGTTGG TTACACGCTC
GATGAGATCA CCAACGACAT CACCCGCAAA ACGCCAGCCT GCTTCGAGCC GACGCTCGAT
TACGTCGTCG TCAAAATTCC CAAGTGGCAG TTCGAGAAGT TCCCAGGCGC CGATGCATCC
CTCGGTCCGC AGATGAAGTC TGTCGGCGAA GCGATGGCGA TTGGCCGAAC TTTCAAGGAA
GCCTTGATGA AGGGCATCCG CTCGCTCGAA ACAGGGAAGA GCATCGCGTC GGAGAAGGTC
GAAGAACGGA TCATCACCAA GCGCCTGGTC ACGGCACATC CCGAACGCCT GCAATACGTC
CGCCACGCGC TGCTGCATGG ATGGTCGGTG GAAAAAGTCC ATTCTCTGAC GAAGATTGAC
CCGTGGTTCC TGTATCAGCT GAAGGAAATC GCGCAGGCCC ACGCGCATAC CGAGCAGCAC
ACCATGGACG AGGTCGGCCC CGACGAGTTG CGCGACCTGA AACGCATGGG CTTCAGCGAC
GAACGTCTCG CACATTTGTG GAAAGCGAAA AATGGCAAGG GCGCCTCGCG ACTGGTTTAC
GAGAAACGGC ACGCCAGCGG GATTCGCCCG GTATACAAGC GCGTGGACAC ATGCGCTGCC
GAGTTTGAAA GTTTCACACC ATATCTCTAT TCGACGTACG AGGAAGAAGA CGAAGCCGCG
CCGACCGACA AGAAGAAGGT CATCATCCTG GGCAGCGGAC CGAACCGCAT CGGGCAGGGA
ATTGAGTTCG ATTACTGCTG CTGCCACGCT GCCTTTGCGT TGCGCGACGA CGGTTACGAG
ACGATCATGG TCAACTGCAA TCCGGAGACC GTCTCCACCG ACTACGACAC CAGCGATCGC
CTCTACTTCG AACCCCTCAC GTACGAAGAT GTGATGGAGA TCTACGAGCA CGAAGCGTCA
GGCGGCGCGC CCATCGGCGT AATCGTGCAA TTCGGCGGAC AAACACCGCT GAATCTCGCG
CTGCCGCTGA AGGCCTCAGG CGTTCCCGTC ATCGGAACCT CGCCGGAGTC CATCGACCTC
GCCGAGGATC GCAAGCGCTT CAACAAGCTG CTCGAAGAAC TCGATATCCC GCAGCCTCCC
GGATCGACCG CGACGTCACT CGAAGAAGCC GTAGCGAACG CCAACAAGAT CGGCTACCCG
GTCCTCGTGC GTCCTTCCTA CGTGCTAGGC GGACGCGCCA TGATGATCTG CTACGAGCAG
GAAGAGGTCG AGCGCTACAT GCGCCAGGCC GTCGAGTACT CGCAGGAGCG CCCGGTGCTG
ATCGACCACT TCCTCGAGGA AGCGACCGAA GTCGACGTCG ACTGCCTGTC TGACGGCGAA
GACTGCGTCA TCGGCGGCAT CATGCAGCAC ATCGAAGAGG CCGGTATTCA CTCTGGCGAT
TCGTCGTGCG TGCTGCCTTC GGTTGACCTA TCGGAGCAGG TGCTGAAGAC GATCCGCGAA
TACACGTTCA AACTGGCGCG TGCGCTCAAG GTCATCGGCC TGATGAACGT TCAGTACGCC
ATCCAACGTG AAAAGGTCTA TGTCATCGAA GTAAATCCGC GTGCCTCACG CACCGTGCCC
TACGTTTCGA AAGCCACCGG CGTGCCGATG GCGAAGATCG CGGCACGACT GATGACCGGG
CGCAAGCTGC GCGAATTCCT GCCGCAGAAT ATCGAGCAGG GGGCAGACCT CGCGACAGGG
AATTGCTACT ACGTGAAATC GCCGGTGTTC CCGTGGGGCA AGTTCCCCGG CGTCGACACC
GTTCTTGGCC CGGAGATGAA ATCCACCGGC GAAGTGATGG GTGTAGCCGA CAATTTCGGC
GAGGCCTTCG CCAAGGCACA ACTCGCCGCC GGACAAAAGC TGCCGACCAA GGGTACGGTC
TTCATCAGCC TGAACAAGCG CGACAAGCAA CACGCAGCAG CGCTGGCAAA GAAATTCGTG
GATCTGGGCT TCAAGATCGT CGCTACCCAC GGAACCGCCG ACGAGATGGA AGACGGCGGC
ATCGAGGTCG AGCGCGTCTT CAAGGTGAAA GAAGGCCGTC CCAACGTAGT GGACCTGATC
AAGGGCGACC GCATTCAAAT GATCATCAAC ACGCCGCAGG GCGCCGAGCC ATGGTTCGAC
GAGAAGGCGA TCCGACGGGC CGCGATCACC GCGCGCATCC CGACCATCAC CACGCTCTCG
GCCGCACGCG CGGCGGTCGA AGGCATTGCG GCCCTTCAGC GTGGCAAGAC GACGGTCTAC
GCGCTTCAGG AATTGCACCG AGAGCGGCGT CAGGGCATGC CCGGCCAGCA AGTAAATGGA
CTCCGCTGA
 
Protein sequence
MPRRNDISKI LIIGSGPIVI GQSAEFDYSG AQACKALKAE GYEVVLANSN PATIMTDPEM 
ADRTYIEPLT PEFLEEIIRI EAAMMPAGAG KFALLPTVGG QTALNLAVDL ADSGVLDKYS
VILIGAQLGA IKKAEDRLLF KDAMAKIGLD VPRSALINNL KDGLEFSGKI GFPLVLRPSF
TLGGSGGGIA YNREELMDLL GKGLDLSPVH EVLLEESVLG WKEYELELMR DLADNVIVIC
SIENFDPMGV HTGDSITVAP AQTLSDREYQ IMRDAAIKVI REIGVETGGS NIQFATNPEN
GRMIVIEMNP RVSRSSALAS KATGFPIAKI AARLAVGYTL DEITNDITRK TPACFEPTLD
YVVVKIPKWQ FEKFPGADAS LGPQMKSVGE AMAIGRTFKE ALMKGIRSLE TGKSIASEKV
EERIITKRLV TAHPERLQYV RHALLHGWSV EKVHSLTKID PWFLYQLKEI AQAHAHTEQH
TMDEVGPDEL RDLKRMGFSD ERLAHLWKAK NGKGASRLVY EKRHASGIRP VYKRVDTCAA
EFESFTPYLY STYEEEDEAA PTDKKKVIIL GSGPNRIGQG IEFDYCCCHA AFALRDDGYE
TIMVNCNPET VSTDYDTSDR LYFEPLTYED VMEIYEHEAS GGAPIGVIVQ FGGQTPLNLA
LPLKASGVPV IGTSPESIDL AEDRKRFNKL LEELDIPQPP GSTATSLEEA VANANKIGYP
VLVRPSYVLG GRAMMICYEQ EEVERYMRQA VEYSQERPVL IDHFLEEATE VDVDCLSDGE
DCVIGGIMQH IEEAGIHSGD SSCVLPSVDL SEQVLKTIRE YTFKLARALK VIGLMNVQYA
IQREKVYVIE VNPRASRTVP YVSKATGVPM AKIAARLMTG RKLREFLPQN IEQGADLATG
NCYYVKSPVF PWGKFPGVDT VLGPEMKSTG EVMGVADNFG EAFAKAQLAA GQKLPTKGTV
FISLNKRDKQ HAAALAKKFV DLGFKIVATH GTADEMEDGG IEVERVFKVK EGRPNVVDLI
KGDRIQMIIN TPQGAEPWFD EKAIRRAAIT ARIPTITTLS AARAAVEGIA ALQRGKTTVY
ALQELHRERR QGMPGQQVNG LR