Gene Ndas_3165 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3165 
Symbol 
ID9247022 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3784696 
End bp3786150 
Gene Length1455 bp 
Protein Length484 aa 
Translation table11 
GC content70% 
IMG OID 
Productglutamate synthase, NADH/NADPH, small subunit 
Protein accessionYP_003681079 
Protein GI297562105 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00040465 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.173868 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGACC CCAAGGGCTT CCTGAAGACC ACCGAGCGAG AGCTCCCCAA GCACCGACCC 
GTCGACGTGC GCATCCAGGA CTGGCGCGAG GTCTACGAGG ACTTCGACCG GGGAACCGTC
ACCAAGCAGG CATCACGCTG CATGGACTGC GGCATCCCCT TCTGCCACAA CGGCTGCCCC
CTCGGCAACC TCATCCCCGA GTGGAACCAC CTCGTCCACA CCCACGACTG GGCCGAGGCG
ATCGAACGCC TGCACGCCAC CAACAACTTC CCGGAGTTCA CCGGCAGGCT CTGCCCCGCC
CCGTGCGAAT CCGCGTGCGT GCTCGGCATC AACCAGCCCG CCGTCACCAT CAAGAACGTC
GAGGTCTCCA TCATCGACCG CGCGTGGGAG GAGGGCTGGG TCAAGCCCCT GCCCCCCACC
ACCCGCACCG GCAAGAAGGT CGCCGTCGTC GGCTCCGGCC CCGCCGGACT CGCCGCCGCC
CAGCAGCTCA CCCGCGCCGG ACACGACGTC ACCGTCTACG AACGCGCCGA CCGCATCGGC
GGCCTCCTGC GCTACGGCAT CCCCGAGTTC AAGATGGAGA AGCGGCACAT CGACCGCCGC
CTCGCCCAGA TGAGCGCCGA GGGCACCACC TTCCGCGCCG GAGTCGACGT CGGCACCGAC
ATCACCGCCG ACCAGCTGCG CGCCGACCAC GACGCCGTCG TCATCAGCGG CGGCGCCACC
GCCTGGCGCG ACCTGCCCGC CAAGGGCCGC GAACTCGCCG GTATCCACCA GGCCATGGAG
TACCTGCCCC TGGCCAACCG GGTCCAGGAG GGCGACTACG ACACGCCCGC CATCAGCGCC
AAGGGCAAGC ACGTCGTCGT CATCGGCGGC GGCGACACCG GCGCCGACTG CGTCGGCACC
GCCCACCGCC AGGGCGCCGC CTCGGTCACT CAGCTGGAGA TCATGCCCAA GCCGCCCGCC
ACGCGGCCCG ACAACCAGCC CTGGCCGACC ATGCCCATGC TCTACAAGGT CACCAGCGCC
CACGAGGAGG GCGGCAAGCG GATCTACTCC GTCAACACCG TCGAGTTCCT CGGCGACGAC
AACGGCCAGG TCCGCGCCCT CAAGCTCGTC GAGGTCAAGC GCACCGACAA GGGCTTCGAA
CCCGTCCAGG GCACCGAGCG CGAGATCCCC GCCGAACTCG TCACGCTCGC CATGGGCTTC
GTCGGACCCC AGAAGGAGGG CCTGCTCGAC CAGCTCGGCG TCGAACTCGA CGGACGCGGC
AACGTCGTCC GCGACACCGA CTACCGCACC ACCGTCGACG GCGTCTTCTG CGCCGGAGAC
ATGGGCCGCG GCCAGTCGCT CATCGTCTGG GCCATCGCCG AGGGCCGCTC CGCCGCCGCC
GGAGTCGACC GCTACCTCAC CGACGACAGC GCACTGCCGG TCACCATCCC GCCGACCGCA
CGACCGCTCG TCTAG
 
Protein sequence
MADPKGFLKT TERELPKHRP VDVRIQDWRE VYEDFDRGTV TKQASRCMDC GIPFCHNGCP 
LGNLIPEWNH LVHTHDWAEA IERLHATNNF PEFTGRLCPA PCESACVLGI NQPAVTIKNV
EVSIIDRAWE EGWVKPLPPT TRTGKKVAVV GSGPAGLAAA QQLTRAGHDV TVYERADRIG
GLLRYGIPEF KMEKRHIDRR LAQMSAEGTT FRAGVDVGTD ITADQLRADH DAVVISGGAT
AWRDLPAKGR ELAGIHQAME YLPLANRVQE GDYDTPAISA KGKHVVVIGG GDTGADCVGT
AHRQGAASVT QLEIMPKPPA TRPDNQPWPT MPMLYKVTSA HEEGGKRIYS VNTVEFLGDD
NGQVRALKLV EVKRTDKGFE PVQGTEREIP AELVTLAMGF VGPQKEGLLD QLGVELDGRG
NVVRDTDYRT TVDGVFCAGD MGRGQSLIVW AIAEGRSAAA GVDRYLTDDS ALPVTIPPTA
RPLV