Gene Ndas_3229 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3229 
Symbol 
ID9247086 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3859257 
End bp3861356 
Gene Length2100 bp 
Protein Length699 aa 
Translation table11 
GC content78% 
IMG OID 
ProductCarbamoyl-phosphate synthase L chain ATP-binding protein 
Protein accessionYP_003681141 
Protein GI297562167 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0790093 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACCACA CGCCCTCCGC CGTCCCCTCC GGTGGTTCCG CCGAGGCGGG CGCCACGGCC 
CCCGGCCGAC CCGCCCCGCC GCGCCCCGTC ACCCGCCTCC TGGTCGCCAA CCGGGGCGAG
ATCGCCCGCC GGATCCTGCG CACCTGCCGC TCCCTGGGCG TGGCCACCGT CGCCGTGCAC
GCCCCCGGCG AGGAGGACGC CGCGCACGTG GCCGAGGCCG ACACCGCCGT GCGCCTGCCC
CTGCCCCAGG GACGCGGCCC CGTGGACGCC TACCTCGACC CCGACGCCGT GGTGGCCGCC
GCCCGCGCCT CCGGCGCCGA CGCCGTCCAC CCCGGCTACG GCTTCCTGTC CGAGAACCCC
GCCCTGGCCC GCGCCGTCAC CGGCGCCGGG CTGACCTGGA TCGGCCCCCC GGCCGAGGCC
GTCGAGAGCA TGGGCGCCAA GACCCGGGCC AAACGCATCG CCCGGGACGC CGGGGTGCCC
GTCGCCGACG CCCTGGACCC CGAGCGGGTG CGCCCCGAAC ACCTGCCCGT GCTGGTCAAG
GCCGTCTCCG GCGGCGGCGG ACGCGGCATG CGCGTGGTCC GCGACCTGGC CGACCTGCCC
GCCGAGGCCG CCGCCGCCCG CGCCGAGGCG GCCTCGGCCT TCGGCGACCC GGCGGTGTTC
TGCGAGCCCT ACGTCCCCCA CGGCCGCCAC GTCGAGGTGC AGGTCCTGGC CGACACCCAC
GGCACCGTGT GGGCCCTGGG CGAACGCGAC TGCTCCCTCC AGCGCCGCCA CCAGAAGGTC
GTCGAGGAGA CGCCCGCCCC CGGCCTGCCC GACGACCTGC GCGAACGCCT GCACGAGGCC
GCCCGCCGCC TGGCCCGCGC CATCGGCTAC ACCGGCGCCG GAACGGCCGA GTTCCTCGTC
CCCGTGGGGG AGGAGGGCTT CGGCGAACCG GTCTTCCTGG AGATGAACAC CCGGCTCCAG
GTCGAACACC CCGTCACCGA GTGCGTCACC GGGCTGGACC TGGTCGAGTG GCAGATCCGC
GTCGCCGAGG GCGAGGCCCT GCCCGGGGGC GGCCCGCCCG CGCCGCGCGG CCACGCCGTC
CAGGCCCGCC TGTACGCCGA GGACCCCCGC GACGGCTGGC GGCCCCGCAC CGGCGTCCTG
CACGCCTTCG ACGTCCCCGC CGACACCCGC TTCGCCGCGC CGCCCGCCTT CGGGGTGCGC
CTGGACAGCG GCGTCGAACC CGGCGACACC GTCGGCGCGG ACTTCGACCC CCTGCTCGCC
AAGGTCGTCG CCCACGGGCG CGACCGCCGC GACGCGCTGC GCCGCTTGGC CGCGGCCCTG
GCCGCCGCGC GCGTGCACGG GGTGGGCACC AACCGCGACC TGCTCGTGCG GGCGCTGCGC
CACCCCGCCT TCGCCGGGGC CGAGACCCGC CCCGAGGGGC TGCACACCGG CTACCTGGAC
GGCGACCGCC TCAGCGCACT CGTCCAGCCC CTGGCCGACC CTGCCACCGA ACGCCTCGCC
GCGCTGGCCG CGTCCCTGTC CGCCGCCGAG GCCGCCCGCG CCGACGCCGC CACCCCCGCC
GGGATCCCCG CCAACTGGCG CGGCCTGCCC TCCCAGCCCT TCGTCCGCCA GCACGACGTC
GACGGCGACG AGACGAGGAG GACCACCACC GCCTACCGCA CCCTGAGGGG GTCCTACCTG
ACCGAACAGG CCGGTGTCCG CGTGCTCTCG GCCGCACCCG ACCGGGTGGT CCTGGAGGCG
GACGGCCTGC GCCGCGCCTT CGACGTCCAC CGCCCCGCCG CGGGCGGCGA CGTCCACGTC
GACTCCCCCC TGGGGTCGGT CACCCTCACC CCCGTCGACC CCCTGCCCGA ACCCGAGGCC
GCGGTCGACC CCGGCGCCCT GCCCGCGCCC ATGCCCGGCA CGGTCACCGC CGTGGAGGTC
GCCGTGGGCG AGCGGGTCGA ACCCGGGCAG ACCCTGCTGC GCATGGAAGC CATGAAGACG
GAGCACCGCG TCACCGCCCC CGCCGCCGGT GCCGTCCGGG AGATCCCGGT CGCGGCGGGG
CAGAGGGTGC CCGCCGGGGC GCCCCTCGCC GTCCTCGACT ACGAAGGGGC CACGCCGTGA
 
Protein sequence
MNHTPSAVPS GGSAEAGATA PGRPAPPRPV TRLLVANRGE IARRILRTCR SLGVATVAVH 
APGEEDAAHV AEADTAVRLP LPQGRGPVDA YLDPDAVVAA ARASGADAVH PGYGFLSENP
ALARAVTGAG LTWIGPPAEA VESMGAKTRA KRIARDAGVP VADALDPERV RPEHLPVLVK
AVSGGGGRGM RVVRDLADLP AEAAAARAEA ASAFGDPAVF CEPYVPHGRH VEVQVLADTH
GTVWALGERD CSLQRRHQKV VEETPAPGLP DDLRERLHEA ARRLARAIGY TGAGTAEFLV
PVGEEGFGEP VFLEMNTRLQ VEHPVTECVT GLDLVEWQIR VAEGEALPGG GPPAPRGHAV
QARLYAEDPR DGWRPRTGVL HAFDVPADTR FAAPPAFGVR LDSGVEPGDT VGADFDPLLA
KVVAHGRDRR DALRRLAAAL AAARVHGVGT NRDLLVRALR HPAFAGAETR PEGLHTGYLD
GDRLSALVQP LADPATERLA ALAASLSAAE AARADAATPA GIPANWRGLP SQPFVRQHDV
DGDETRRTTT AYRTLRGSYL TEQAGVRVLS AAPDRVVLEA DGLRRAFDVH RPAAGGDVHV
DSPLGSVTLT PVDPLPEPEA AVDPGALPAP MPGTVTAVEV AVGERVEPGQ TLLRMEAMKT
EHRVTAPAAG AVREIPVAAG QRVPAGAPLA VLDYEGATP