Gene Ndas_3102 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3102 
Symbol 
ID9246958 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3713970 
End bp3717272 
Gene Length3303 bp 
Protein Length1100 aa 
Translation table11 
GC content71% 
IMG OID 
Productcarbamoyl-phosphate synthase, large subunit 
Protein accessionYP_003681017 
Protein GI297562043 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGCGCC GTAGTGACCT TTCATCCGTC CTCGTGATCG GCTCCGGCCC GATCGTGATC 
GGGCAGGCGG CCGAGTTCGA CTACTCGGGC ACCCAGGCCT GCCGGGTTCT GCGGGCCGAG
GGCCTGCGGG TCATCCTGGT CAACTCCAAC CCGGCCACGA TCATGACCGA CCCGGAGATC
GCCGACGCCA CCTACGTGGA GCCGATCACC ACCGAGATGA TCGAGAAGAT CATCGCCAAG
GAGCGCCCCG ACGCGCTCCT GCCCACCCTG GGCGGGCAGA CCGCCCTCAA CGCCGCCGTC
GCCCTGGACG AGGCGGGCAT CCTCGCCAAG TACGGCGTCG AGCTCATCGG CGCCAACATC
GAGGCCATCC AGTCCGGTGA GGACCGCGAG AGCTTCAAGG GCATCGTCGA GCGCATCGGC
GGCGAGTCCG CCCGCTCGCG CATCTGCCAC ACCCTGGAGG AGTGCCTGGC CGCGGCCGAG
GAGCTCTCCT ACCCCGTCGT CGTGCGCCCC TCCTTCACCA TGGGCGGCGC GGGCTCGGGC
TTCGCCCACG ACGAGTCCGA GCTGCGCCGC ATCGCCGGGC AGGGCCTGGC GCTCTCCCCG
ACCACCGAGG TGCTCCTGGA GGAGTCCATC CTCGGCTGGA AGGAGTACGA GCTGGAGCTG
ATGCGCGACA CCAACGACAA CGTCGTGGTC GTGTGCTCCA TCGAGAACCT CGACCCCATG
GGCGTGCACA CCGGCGACTC CATCACCGTC GCCCCCGCCA TGACCCTCAC CGACCGCGAG
TACCAGAAGC TGCGCGACAT CGGCATCGCC GTCATCCGCG AGGTCGGCGT GGACACCGGC
GGCTGCAACA TCCAGTTCGC CGTCCACCCC ACCACCGGCC GGGTCATCGT CATCGAGATG
AACCCGCGCG TGTCGCGCTC CTCGGCGCTG GCCTCCAAGG CCACCGGCTT CCCGATCGCC
AAGATCGCCG CCAAGCTCGC CGTCGGCTAC ACCCTGGACG AGATCCCCAA CGACATCACC
CGGGAGACCC CGGCCAGCTT CGAGCCCACG CTCGACTACG TCGTCGTCAA GGTGCCCCGC
TTCGCCTTCG AGAAGTTCCC CGGCGCCGAC CCCGGCCTGA CCACCACCAT GAAGTCGGTG
GGCGAGGCCA TGGCCATCGG CCGCTCCTTC CCCGAGGCCC TCCAGAAGGC CATGCGCTCC
CTGGAGAAGA AGGGCGTCGG CCTGACCTGG GAGGGCGAGC CCGGCGACAA GGACGAGCTG
CTGCGCCTGG CCGCCACCCC CACCGAGCAC CGCCTGCGCC AGGTCCAGCA GGCCCTGCGC
GCCGGGGCCA CCGTCGAGGA GGTCCACGAG GCCACCCGCA TCGACCCCTG GTTCGTGGAC
CAGATGCTGC GCCTGGAGGA GGCGGCCCGC GTCCTGGCCG ACGCCCCCAA GCTCGACGCC
GACCTGCTGC GCCAGGTCAA GGGGCTGGGC TTCTCCGACC TCCAGATCGG CCAGATCACC
GGCCGCAGCG AGGACGTGGT CCGCGAGCTG CGCCACGCCC TGGGCGTCCA CCCCGTCTAC
CTGACCGTGG ACACCTGCGC CGCCGAGTTC GAGGCCAGCA CGCCCTACCT GTACTCCAGC
TACGACGAGG AGACCGAGGT CCCCACGGGC GACCGCCCCA AGATCATCAT CCTGGGCTCG
GGACCCAACC GGATCGGCCA GGGCGTGGAG TTCGACTACT CCTGCGTCCA CGCCTCCTTC
GCCCTCTCCG ACGCCGGGTA CGAGACCGTG ATGGTCAACT GCAACCCCGA GACCGTCTCC
ACCGACTACG ACACCAGCGA CCGGCTCTAC TTCGAGCCGC TCACCCTGGA GGACGTGCTG
GAGGTCGTGC GCGCCGAGCA GGCCGCCGGA CCGGTCGTCG GCGTCATCGT CCAGCTGGGC
GGCCAGACCC CGCTCGGCCT GGCCCGACGC CTCAAGGACG CCGGGGTGCC CATCATCGGC
ACCAGCCCCG AGGCCATCGA CCTGGCCGAG GACCGCGGCG AGTTCGGCAA GGTCCTGGCC
GACGCCGGGC TGCCCGCGCC CAAGTACGGC ACCGCCTACT CCTTCGCCGA GGCCAAGGCC
GCCGCGGACG AGATCGGCTA CCCCGTCATG GTCCGCCCCT CCTACGTGCT GGGCGGCCGC
GGCATGGAGA TCGTCTACAG CGAGGCGATG CTCGCCGACT ACATCGAGCG CAACGCCGAG
GTCAGCCCCG AGCACCCGGT GCTCATCGAC CGCTTCCTCG ACGACGCCAT CGAGATCGAC
GTCGACGCCC TCTACGACGG CCGCGACCTG TACCTGGGCG GGGTCATGGA GCACATCGAG
GAGGCCGGGA TCCACTCCGG CGACTCGGCG TGCACCCTGC CCTCCATCAC CCTGGGCCGC
GAGGACATCG AGCGCATCCG CTACTCCACC GAGGCCATCG CCCGCGGCAC GGGCGTGCGC
GGCCTGATCA ACGTCCAGTA CGCGCTCGCC TCCGGTGTGC TCAACGTCCT GGAGGCCAAC
CCGCGCGCCT CGCGCACCGT GCCGTTCGTG TCCAAGGCCA CCGCGGTGCC GCTGGCCAAG
GCCGCCGCCC GCGTCATGGC CGGGGCCACC ATCGCCGAAC TGCGCGAGGA GGGCATGCTG
CCCGCCCGCG GCGACGGCGG CGACCTGCCC GTGGACGCCC CGGTGTCGGT GAAGGAGGCC
GTACTGCCCT TCAACCGCTT CATCGACAAG GAGGGCGAGG GGGTCGACAC CATCCTCGGC
CCCGAGATGC GCTCCACCGG CGAGGTCATG GGCCTGGACA CCGAGTTCGG CGCCGCCTAC
GCCAAGTCGC AGCTGGCCCT CAACGACTCC CTGCCCGAGC GGGGCCGGGT GTTCGTCTCG
GTGGCCAACC GCGACAAGCG CTCGATGATC TTCCCGGTCA AGCGCCTGGC CGACCTCGGC
TTCGAGATCC TGGCCACCGA GGGCACCGCC GTGGTCCTGC GCCGCAACGG CGTCCACGCC
ACCGTGGTGC GCAAGCACAG CGAGGGCAGC GGCCCGGCGG GCGAGCCCAC CATCGTGCAG
CTCATCCACT CCGGGGGCGT CGACCTCATC GTCAACACGC CCTTCGGCAG CGCCGGGCAG
GCCGGACCGC GTCTGGACGG CTACGAGATC CGCACCGCGG CCGTGGTGCG CGGCGTGCCC
AGCGTGACCA CCGTGCAGGG CCTGGCCGCC GCGGTCCAGG CCATCGAGGC GCGGGTGCGC
GGGGACCTCG GCGTGCGCTC CCTCCAGGAG CACGCGAGCA CCCTCACCGC CAGCCGGGCC
TAG
 
Protein sequence
MPRRSDLSSV LVIGSGPIVI GQAAEFDYSG TQACRVLRAE GLRVILVNSN PATIMTDPEI 
ADATYVEPIT TEMIEKIIAK ERPDALLPTL GGQTALNAAV ALDEAGILAK YGVELIGANI
EAIQSGEDRE SFKGIVERIG GESARSRICH TLEECLAAAE ELSYPVVVRP SFTMGGAGSG
FAHDESELRR IAGQGLALSP TTEVLLEESI LGWKEYELEL MRDTNDNVVV VCSIENLDPM
GVHTGDSITV APAMTLTDRE YQKLRDIGIA VIREVGVDTG GCNIQFAVHP TTGRVIVIEM
NPRVSRSSAL ASKATGFPIA KIAAKLAVGY TLDEIPNDIT RETPASFEPT LDYVVVKVPR
FAFEKFPGAD PGLTTTMKSV GEAMAIGRSF PEALQKAMRS LEKKGVGLTW EGEPGDKDEL
LRLAATPTEH RLRQVQQALR AGATVEEVHE ATRIDPWFVD QMLRLEEAAR VLADAPKLDA
DLLRQVKGLG FSDLQIGQIT GRSEDVVREL RHALGVHPVY LTVDTCAAEF EASTPYLYSS
YDEETEVPTG DRPKIIILGS GPNRIGQGVE FDYSCVHASF ALSDAGYETV MVNCNPETVS
TDYDTSDRLY FEPLTLEDVL EVVRAEQAAG PVVGVIVQLG GQTPLGLARR LKDAGVPIIG
TSPEAIDLAE DRGEFGKVLA DAGLPAPKYG TAYSFAEAKA AADEIGYPVM VRPSYVLGGR
GMEIVYSEAM LADYIERNAE VSPEHPVLID RFLDDAIEID VDALYDGRDL YLGGVMEHIE
EAGIHSGDSA CTLPSITLGR EDIERIRYST EAIARGTGVR GLINVQYALA SGVLNVLEAN
PRASRTVPFV SKATAVPLAK AAARVMAGAT IAELREEGML PARGDGGDLP VDAPVSVKEA
VLPFNRFIDK EGEGVDTILG PEMRSTGEVM GLDTEFGAAY AKSQLALNDS LPERGRVFVS
VANRDKRSMI FPVKRLADLG FEILATEGTA VVLRRNGVHA TVVRKHSEGS GPAGEPTIVQ
LIHSGGVDLI VNTPFGSAGQ AGPRLDGYEI RTAAVVRGVP SVTTVQGLAA AVQAIEARVR
GDLGVRSLQE HASTLTASRA