Gene Daud_1302 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaud_1302 
Symbol 
ID6026290 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Desulforudis audaxviator MP104C 
KingdomBacteria 
Replicon accessionNC_010424 
Strand
Start bp1377747 
End bp1380995 
Gene Length3249 bp 
Protein Length1082 aa 
Translation table11 
GC content65% 
IMG OID641594119 
Productcarbamoyl-phosphate synthase, large subunit 
Protein accessionYP_001717445 
Protein GI169831463 
COG category[E] Amino acid transport and metabolism
[F] Nucleotide transport and metabolism 
COG ID[COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) 
TIGRFAM ID[TIGR01369] carbamoyl-phosphate synthase, large subunit 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGAAAG ACAAAGCGCT AAAGAAAGTA ATGGTCATCG GTTCCGGTCC GATCATCATC 
GGCCAGGCGG CCGAGTTCGA CTATGCCGGG ACCCAGGCCT GCCGCGCCCT CCGTGAGGAG
GGCCTCGAGG TGGTCCTCAT CAACTCCAAC CCGGCCACCA TCATGACCGA CGCCAACATG
GCTGACCGGA TTTATATTGA GCCGCTCACC CCGGAATTCG TGGCCAAGGT CATCAGCCAG
GAAAGGCCGG ACGCGTTGCT GCCCACCCTG GGCGGGCAGA CCGGTCTGAA CCTGGCCAAG
CAGGTGGCGG ACGCGGGCAT CCTGGACCAG TACGGGGTCC GACTTTTGGG CACGCCGCTG
GAATCCATCA AGCGCGCCGA GGACCGGGAA CACTTCAAGA ACATGTGCCT GGAGATCGGC
GAGCCGGTAC CCGAGAGCAG CATCATTTCC GACGTGGACG CCGCCGTGGC CTTCGCCCGG
AAGATCGGCT ACCCGGTGGT GGTCCGCCCG GCTTACACCT TGGGCGGCAC CGGGGGCGGG
GTCGCCTTTT CCGAGGATGA ACTGCGGGAG ATCGCCGTCC GCGGGCTGAC CATGAGCATC
ATTCACCAGG TGCTGGTGGA AAAATGCGTG CTCGGCTGGA AGGAGATCGA GTACGAGGTG
ATGCGCGACG CCGCCGGCAA CTGCATCACC ATCTGCAGCA TGGAGAACAT CGACCCGATG
GGCATCCACA CCGGGGACAG CATCGTGGTC GCCCCGACCC AGACCTTAAG CGACCGGGAG
CACCAGATGC TGCGTAGCGC CTCACTGAAG ATCATCCGGG CCCTCGGGGT GGAAGGCGGC
TGCAACGTCC AGTTCGCCCT GGACCCGGAA AGCTACGATT ACTACGTGAT TGAGGTGAAC
CCGCGCCTGT CGCGTTCCTC AGCCCTGGCG TCCAAGGCCA CCGGCTATCC CATCGCCAAG
GTGGCGACCA AGGTGGCCGT CGGCCTGACC CTGGACGAGA TCAAGAACGC GGTCACCGGC
AAGACCTACG CCTGCTTCGA GCCGGCGCTG GACTACGTGG TGGTGAAGTA CCCACGCTGG
CCGTTCGACA AGTTTTCCCT GGCCAACCGG AACCTCGGCA CCCAGATGAA GTCGACTGGC
GAGGTGATGG CCATCGGCCG TACCTTCGAG GAGGCGCTAC TGAAGGCGGT GCGCTCACTG
GAAACGGGTG TCACGGGCAT GAACCTGCCC GAACTCCGGG AGTGGGACAA CGACCGACTG
CGCGCCCGGA TGGCCAGACC GGACGACCTG CGCCTGTTTC TGGTGGCCGA GGCTCTGCGC
CGGGGCTTCC CCGTGAACGA GATTTTCGAG CTCACCCGGA TCGACCGCTT CTTCCTGGAC
AAGATTAAGA ACATCACCGA GGCCGAGGAG TTGGTGCGCG CGGCCCGACC CACTGACGTG
GGTACCCCGA CGGGCGTGGG TGCCCCGGCC GGGCTGACCC CCGAACTCCT GCGGCGCGTG
AAGCGGATCG GCCTCTCCGA CACCACCATC GCGGAGCTGG TGGGCACCAC CTCCCGGGAG
GTCCACCGCC TGCGCCGGGA GCTGGGGGTG GAACCGGTCT ACAAGATGGT GGATACCTGC
GCCGGAGAGT TCGAGGCCAC CACGCCCTAC TACTACTCCA CGTACGAGGA CGAGGACGAG
GCCGAGCCCC AAGCGGTACG CAAGGTGGTC GTTTTAGGCT CGGGCCCGAT TCGGATCGGG
CAGGGGATCG AGTTCGACTA CTGCTCGGTG CACTCGGTAT GGGCCTTGAA GGAGCAGGGC
GTCAAGGCGA TCATCATCAA CAACAACCCG GAGACGGTCA GCACTGACTT CGACACCGCC
GACCGGCTGT ACTTTGAGCC ACTGGTCCCG GAGGACGTGA TGAACATCCT GCACAAAGAA
AAACCCGACG GGGTGATCGT GCAGTTCGGC GGGCAGACGG CGATCAACCT GGCCCGTCCG
GTGGAAAAGG CCGGCTTCAA CATCTTGGGC ACCTCGGTGG CCGACATCGA CCGCGCCGAG
GACCGGGAGC GCTTCGACCA GCTCGTGGCT GAACTCGGCA TCCCGCGGCC GCCCGGGGGG
ACCGGCTTTT CGGTCGAGGA AGCACAGCGG ATCGCGGAGC AGGTGGGCTT CCCGGTCCTG
GTCCGGCCGT CCTACGTCCT GGGCGGAAGG GCGATGGAGA TCGTGTATAA CTCCCAGGAG
CTTTTAGAGT ACATGGCAGA CGCGGTCCGG GTGACTCCGA AGCATCCGGT GCTGGTCGAC
AAGTACCTTT TGGGCAAGGA GTTGGAGGTC GACGCCGTCT GCGACGGCGA AACCGTTCTA
GTTCCGGGCA TCATGGAGCA CGTGGAACGG GCCGGGATCC ACTCCGGGGA CAGCATCGCG
GTGTTCCCGC CGCAGACCCT GACCCCGGAG ATCAAGGAGC AGCTTTTCGA GTACACACAG
CAAATCGCCC GGGCGCTCAA GATCCGGGGC CTGGTGAACA TCCAGTTCGT CCTGCACGAA
GGGCGGGTGT TCGTCCTGGA GGTGAACCCG CGCTCCAGCC GCACCGTGCC CTACCTGAGC
AAGGTCACCG GCATCCCGAT GGTGAACCTG GCGACCAGGA TCTGCCTGGG GGCCACGCTG
CCGGAATTGG GGTACCGGGG CGGGCTGTAC GAGGAGACGC GGAACATCGC GGTGAAGGCG
CCGGTCTTCT CCTTCGCGAA ACTCTTGGAC GTGGACGTGT GCCTGGGTCC CGAAATGAAG
TCCACCGGGG AAGTGATGGG CGTGTCGAAG GACTATGCCC TGGCGCTCTA CAAGGCCTGC
CTCTCGGCCG GCTACACGCT GCCCTCGAGC GGCAAGGCGG TGGTGACCAT CGCCGACCGG
GACAAGGACG AGGCCCTGCC GCTGGTCCGG AGCTTGGTGA ACCTGGGGTT TGAGATCGTG
GCCACCGAGG GCACGGCGGC GTTCCTGCGC AGCCGGGCGA TCACCGTGGA GGTGGCCCGC
AAAGTGCACG AGGGCTCCCC GAACATCGTG GACCTAATCC GCGAAAACCG GATCCACCTA
GTGGTGAACA CCCTGACCAA GGGCAAGCTG ACCACCCGCG ACGGTTTCCG GATCCGGCGG
GCCGCGGTGG AGATGGGTGT GCCCTGCCTG ACGTCGCTGG ATACGGCCCG GGTGGTCATC
GAAGTGATGC GGGCGCGCCA GCGGGGCGAA ACGATGCCGC TGATCCCGCT CCAGGAGTAC
GTGTCCTGA
 
Protein sequence
MPKDKALKKV MVIGSGPIII GQAAEFDYAG TQACRALREE GLEVVLINSN PATIMTDANM 
ADRIYIEPLT PEFVAKVISQ ERPDALLPTL GGQTGLNLAK QVADAGILDQ YGVRLLGTPL
ESIKRAEDRE HFKNMCLEIG EPVPESSIIS DVDAAVAFAR KIGYPVVVRP AYTLGGTGGG
VAFSEDELRE IAVRGLTMSI IHQVLVEKCV LGWKEIEYEV MRDAAGNCIT ICSMENIDPM
GIHTGDSIVV APTQTLSDRE HQMLRSASLK IIRALGVEGG CNVQFALDPE SYDYYVIEVN
PRLSRSSALA SKATGYPIAK VATKVAVGLT LDEIKNAVTG KTYACFEPAL DYVVVKYPRW
PFDKFSLANR NLGTQMKSTG EVMAIGRTFE EALLKAVRSL ETGVTGMNLP ELREWDNDRL
RARMARPDDL RLFLVAEALR RGFPVNEIFE LTRIDRFFLD KIKNITEAEE LVRAARPTDV
GTPTGVGAPA GLTPELLRRV KRIGLSDTTI AELVGTTSRE VHRLRRELGV EPVYKMVDTC
AGEFEATTPY YYSTYEDEDE AEPQAVRKVV VLGSGPIRIG QGIEFDYCSV HSVWALKEQG
VKAIIINNNP ETVSTDFDTA DRLYFEPLVP EDVMNILHKE KPDGVIVQFG GQTAINLARP
VEKAGFNILG TSVADIDRAE DRERFDQLVA ELGIPRPPGG TGFSVEEAQR IAEQVGFPVL
VRPSYVLGGR AMEIVYNSQE LLEYMADAVR VTPKHPVLVD KYLLGKELEV DAVCDGETVL
VPGIMEHVER AGIHSGDSIA VFPPQTLTPE IKEQLFEYTQ QIARALKIRG LVNIQFVLHE
GRVFVLEVNP RSSRTVPYLS KVTGIPMVNL ATRICLGATL PELGYRGGLY EETRNIAVKA
PVFSFAKLLD VDVCLGPEMK STGEVMGVSK DYALALYKAC LSAGYTLPSS GKAVVTIADR
DKDEALPLVR SLVNLGFEIV ATEGTAAFLR SRAITVEVAR KVHEGSPNIV DLIRENRIHL
VVNTLTKGKL TTRDGFRIRR AAVEMGVPCL TSLDTARVVI EVMRARQRGE TMPLIPLQEY
VS