Gene Ndas_3943 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3943 
Symbol 
ID9247814 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4714548 
End bp4716317 
Gene Length1770 bp 
Protein Length589 aa 
Translation table11 
GC content73% 
IMG OID 
ProductCarbamoyl-phosphate synthase L chain ATP-binding protein 
Protein accessionYP_003681846 
Protein GI297562872 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.705932 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGTAAAG TACTGATCGC CAACCGCGGC GAGATCGCCG TGCGCATCGC CCGCGCCTGC 
CGCGACGCCG GACTCGCCAG CGTCGCCGTC TACGCCGAGC CCGACCTGGA CGCCCTGCAC
GTCAAGGTCG CCGACGAGGC CCACTCCCTG GGCGGGAGCA CCCCGGCCGA CTCCTACCTG
GACATCGCCA AACTCCTGGC GGTGGCCGAG GCCTCCGGTG CGGACGCCGT CCACCCCGGC
TACGGGTTCC TGGCCGAGAA CGCCGACTTC GCCCAGGCCG TCATCGACGC CGGGCTGACC
TGGATCGGCC CGCCCCCCTC GGCGATCACC GCCCTGGGCG ACAAGGTCCA GGCCCGCCAC
ATCGCCCAGA AGGTCGGCGC CCCGCTGGTG GCGGGCACCG CCGACCCCGT GGAGTCGGCC
GAGGAGGTCG TGGCCTTCGC CGAGGAGCAC GGCCTGCCCA TCGCGATCAA GGCCGCCTTC
GGCGGCGGCG GGCGCGGCCT CAAGGTCGCC CACACCCTGG ACGAGGTCGC CGACGCCTAC
GAGTCCGCGG TGCGCGAGGC GGTCACCGCC TTCGGCCGGG GCGAGTGCTT CGTGGAGCGC
TACCTCGACC GGCCCCGGCA CGTGGAGACC CAGTGCCTGG CCGACACCCA CGGCAACGTC
GTGGTGGTCT CCACCCGCGA CTGCTCGCTC CAGCGGCGCC ACCAGAAGCT GGTGGAGGAG
GCGCCCGCGC CGTTCCTGTC CGCGGAGCAG ATGGAGCGGC TGCACAGCTC CTCCAAGGCC
ATCCTCGCCG AGGCCGGGTA CACCGGCGCG GGCACCTGCG AGTTCCTGGT GGGCGTGGAC
GGCACCATCT CCTTCCTGGA GGTCAACACC CGTCTCCAGG TCGAACACCC CGTGACCGAG
GAGGTCACCG GGATCGACCT GGTGCGGGAG ATGTTCCGCA TCGCCGACGG CGAGGAGCTG
GGCTACGGCG ACCCCGAGGT GCGGGGGCAC TCCTTCGAGT TCCGCATCAA CGCCGAGGAC
GCCGGGCGGG GCTTCATGCC CGCGCCGGGC ACCATCACCG AGCTGAACCT GCCCGGCGGC
CCGGGCGTGC GCGTGGACAC CGGCTGCGAG GCCGGTTTCA CGGTGCCCCA GGCCTTCGAC
TCGATGGTCG CCAAGCTCGT GGTGACGGGC CGCACCCGCC AGGAGGCGCT CCAGCGCTCG
CGGCGGGCCC TGGCCGAGTT CACGGTCGGC GGGATGCCCA CGGTGATCCC GTTCCACCAG
GCCGTGGTGA GCGACCCGGC CTTCGCACCG GCCGACCCCG AGCAGCCGTT CGGGGTGTAC
ACCCGGTGGA TCGAGACCGA GTTCGAGAAC ACCATCGAGC CGTGGTCGGG CACTCCGGGC
GAGCCCGTCG AGGCCGAGCG CGAGAGGGTC ACCGTCGAGG TGGGCGGCAA GCGTGTGGAG
GTGATCCTGC CCGCCGGGCT GGGCGCCTCG GCCGCCGCCC CCGCCGGTGG CGGTCAGAGC
CGCAGGAAGC GGACCGCGCG CAAGGGCGGC GGGAACACCG TGGCGGCGGG CGGCGACTCG
CTGGTCTCGC CGATGCAGGG CACCGTGGTC AAGCTGGTGG CCGAGGAGGG CCAGCAGGTC
GCCGAGGGCG ACACCGTGGT CGTCATCGAG GCGATGAAGA TGGAGCAGCC GCTCAACGCG
CACAAGGCCG GGACGGTGAC GGGCCTGAGG ATCGCCGCGG GCGAGACCGT GGGCAACGGC
GCGGTGGTCT GCGAGATCAA GGACGCCTAG
 
Protein sequence
MRKVLIANRG EIAVRIARAC RDAGLASVAV YAEPDLDALH VKVADEAHSL GGSTPADSYL 
DIAKLLAVAE ASGADAVHPG YGFLAENADF AQAVIDAGLT WIGPPPSAIT ALGDKVQARH
IAQKVGAPLV AGTADPVESA EEVVAFAEEH GLPIAIKAAF GGGGRGLKVA HTLDEVADAY
ESAVREAVTA FGRGECFVER YLDRPRHVET QCLADTHGNV VVVSTRDCSL QRRHQKLVEE
APAPFLSAEQ MERLHSSSKA ILAEAGYTGA GTCEFLVGVD GTISFLEVNT RLQVEHPVTE
EVTGIDLVRE MFRIADGEEL GYGDPEVRGH SFEFRINAED AGRGFMPAPG TITELNLPGG
PGVRVDTGCE AGFTVPQAFD SMVAKLVVTG RTRQEALQRS RRALAEFTVG GMPTVIPFHQ
AVVSDPAFAP ADPEQPFGVY TRWIETEFEN TIEPWSGTPG EPVEAERERV TVEVGGKRVE
VILPAGLGAS AAAPAGGGQS RRKRTARKGG GNTVAAGGDS LVSPMQGTVV KLVAEEGQQV
AEGDTVVVIE AMKMEQPLNA HKAGTVTGLR IAAGETVGNG AVVCEIKDA