Gene Ndas_3446 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3446 
Symbol 
ID9247314 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4126849 
End bp4128327 
Gene Length1479 bp 
Protein Length492 aa 
Translation table11 
GC content68% 
IMG OID 
Productintegrase family protein 
Protein accessionYP_003681357 
Protein GI297562383 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCGCGC GTGAGGGGTC CACGTTCAAA CGGTGCGGAT GCCGTGATGA GAGCACCGGT 
AGGGCGTTGG GGGTGAGGTG TCCGCAGCTC AAGCGCCGGA CCGGGGGTTG GAATCCGGCT
CATGGGGCGT GGGCGTTCCA GTACGAGCTT CCCCCGACTG GTGAGGGTGG GCGCCGCCAG
GCCCGCCGGG TCGGCCTGGC CTCCCAGGAA CAAGCCCGTA AAGAGCTCGG GCATGTCAAG
GCGTTGATCG CCTTGGCGGA GAAGGACCCC ACGGTCGAAG CGCAGATCGC TGACCTGGTC
CAGGCGGCGT TGAAGCAACG CCGCCCACTC CCCGACCTGG AAGAGGTACG TAAACGGGTC
GCGATGGCGG TGGACGTGCG CACCGAACCA CCCACCGTTG CCGCGTGGTT GGGGGAGTGG
TTGAAGGGCA AGCCCGATCT GGCTGCGGGA ACCCGGACCT CCTACGCCGG GCACATCCGC
ACCTATCTCG CCCCGCACCT GGGCCGGCTC CGGGTGGATA AGCTCCAGGC TCGCCACGTC
GAGAGCATGT TCGCGGCCAT CGAGGAAACC AACGCCCACA TCCTCGAATG TCGCGAGTCC
GACGACACTC AGGTGCGGGT TTCGGTCAGG GGGCGGAGGG TGATCTCGCT GTCCACCAAG
CACCGTATCC GCGCCACGTT GCGCAGCGCA TTGTCGGAGG CGGTGCGCCG CCCGGACCTG
CCGGTGTCGG TGAACATCGC CTCGCACGTA CGCCTGCCCT CCTGCCCCAG AAAGCGTCCG
CTGGTGTGGA CACCCGATCG GGTGCGCCAA TGGGCCAAGG ACGGGACCGT GCCGGGAGAG
GTCATGGTGT GGACCCCGGA ACAGACCCGC GCGTTCCTGG TGCACGCGAG AAAGTATCCG
TGGCTGTACC CGATGTTCCA CCTGATCGCG GTCAAAGGCC TACGGCGAGG AGAAGCGGTT
GGCTTGCCCT GGTCCAACAC CCGGCTGACG GACGGGCAGA TCGACATCCG CGTCCAGGTC
GTCCAGTTGG CCTGGGAGAC GATTACCTCC ACCCCGAAGA GCGCGGCGGG ACAGCGCACC
ATCACCTTGG ACGCGGACAC CATCAAGGTG TTGCGGGTCT GGAAGCGGTT CCAGAACGAA
GCCCGTCTCA AAGCCGGACC AGCGTGGACG GACAGCGGCC TGGCCTGCAC CCGCCAGGAC
GGTTCGGGGT GGCATCCGGG GCAGGTCAGT GACTGGTTCC TGCGCATTGC CAGGGCCGCC
GGGTTGCCGC CGATCACGTT GCACGGGCTG CGCCACGGGG CCGCTTCGCT GGCGCTGGCC
GCCGGGACGG ACGTGAAGGT GGTCTCGGCC GAGCTTGGGC ACGCGACCAC GCACTTCACC
CAGGACACCT ACCAGTCGGT GTTCCCGGAC GTGGCCAAAG CGGCGGCCGA AGCCACCGCC
GCGCTGCTGC GTGGCCCGGC TCCGGTGAGG GCCGGGTAG
 
Protein sequence
MVAREGSTFK RCGCRDESTG RALGVRCPQL KRRTGGWNPA HGAWAFQYEL PPTGEGGRRQ 
ARRVGLASQE QARKELGHVK ALIALAEKDP TVEAQIADLV QAALKQRRPL PDLEEVRKRV
AMAVDVRTEP PTVAAWLGEW LKGKPDLAAG TRTSYAGHIR TYLAPHLGRL RVDKLQARHV
ESMFAAIEET NAHILECRES DDTQVRVSVR GRRVISLSTK HRIRATLRSA LSEAVRRPDL
PVSVNIASHV RLPSCPRKRP LVWTPDRVRQ WAKDGTVPGE VMVWTPEQTR AFLVHARKYP
WLYPMFHLIA VKGLRRGEAV GLPWSNTRLT DGQIDIRVQV VQLAWETITS TPKSAAGQRT
ITLDADTIKV LRVWKRFQNE ARLKAGPAWT DSGLACTRQD GSGWHPGQVS DWFLRIARAA
GLPPITLHGL RHGAASLALA AGTDVKVVSA ELGHATTHFT QDTYQSVFPD VAKAAAEATA
ALLRGPAPVR AG