Gene Ndas_4213 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4213 
Symbol 
ID9248087 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp5030581 
End bp5031741 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content71% 
IMG OID 
Productarginine biosynthesis bifunctional protein ArgJ 
Protein accessionYP_003682111 
Protein GI297563137 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAGCC AGACCCCCGT GCCCGCAGGT TTCCGCGGCG TGGCCGGAAA CATCGGGATC 
AAGGACCCCA ACGACGACTT CTTCGCGGTG GTGTCGGAGG TCCCCGCCCG CGTGTCCGCC
GTGTTCACCC GGTCGCGGTT CGCCGGACCC AGCGTGCTGC TCAGCCGCCG TTCGGCGGCC
GACGGCAGCG CGCGGGGGGT CGCGGTGATC GCCCGCAACG CCAACGTGGC CACGGGCAGG
GTCGGCGACA GCCACGCACA CGAACTCCAG GAGCGCGTGG CTCGGGCCGC GGGGGTGCCG
CCCGAGGAGG TGCTGGTCGC CTCCACCGGC GTGATCGGCC GCCCGTACCC GATCGAGAAG
GTGCGCGCCT ACCTGGACGC GCTGCCCGAG GAGTTCCCCC CGGCGGACCT GGACCGCTCG
GCCGCGGCGA TGATGACCAC CGACACCCGG CCCAAGACCG CCTCGGCGAC GGTGGGCGGG
GCGACGCTGA CGGGCATCGC CAAGGGAGTC GGCATGATCG AGCCGAACAT GGCGACCATG
CTGGCCTGGT TCTTCACCGA CGCCGAACTG GAGCAGCCGG TGCTGGACGA GGTGTTCCGT
CGGGTGGTGG ACCGCACGTT CAACGCGCTG AGCATCGACA CCGACACCTC CACCAGCGAC
TCGGCGGCGG TGTTCGCCAA CGGGCTGGCC GGGCCGGTGG ACGTGGCGGA GTTCGAGGCG
GCGCTGCATG AGGTGGCCCT GAAACTGGTG CGGATGATCG CCTCCGACGG CGAGGGCGCC
AGCAAGCTGA TCGAGGTGCG CGTGACCGGC GCGCGCGACG ACGCCCAGGC CAAGCGGGTG
GCCAAGGCCG TGGTGAACTC CCCGCTGGTG AAGACCGCCG TGCACGGCGC GGACCCCAAC
TGGGGCCGGG TGACGATGGC GGTCGGCAAG TGCGAGGAGG AGACCGACAT CCTCCCCGAC
AACGTGCGGA TCTCCTTCGG TGACGTGGAG ACCTACCCGG CGGAGGCCAC GGACGAGGTG
CTGGAGCGCG CCGCCCAGCA CATGAAGGGC GACGAGGTGG TGATCGGCGT CGGCCTGGGC
ATCGCCGACG GCGCCTTCAC GGTCTACGGG TGCGACCTGA CCGAGGGGTA CATCCGGATC
AACGCCGACT ACACGACCTG A
 
Protein sequence
MSSQTPVPAG FRGVAGNIGI KDPNDDFFAV VSEVPARVSA VFTRSRFAGP SVLLSRRSAA 
DGSARGVAVI ARNANVATGR VGDSHAHELQ ERVARAAGVP PEEVLVASTG VIGRPYPIEK
VRAYLDALPE EFPPADLDRS AAAMMTTDTR PKTASATVGG ATLTGIAKGV GMIEPNMATM
LAWFFTDAEL EQPVLDEVFR RVVDRTFNAL SIDTDTSTSD SAAVFANGLA GPVDVAEFEA
ALHEVALKLV RMIASDGEGA SKLIEVRVTG ARDDAQAKRV AKAVVNSPLV KTAVHGADPN
WGRVTMAVGK CEEETDILPD NVRISFGDVE TYPAEATDEV LERAAQHMKG DEVVIGVGLG
IADGAFTVYG CDLTEGYIRI NADYTT