Gene Ndas_0121 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0121 
Symbol 
ID9243952 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp149539 
End bp150807 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content68% 
IMG OID 
Productintegrase family protein 
Protein accessionYP_003678077 
Protein GI297559103 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.534504 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGTGG TCGACACCTG GCACAAGACC GTCAAGCTCC CGGACGGCTC CGCCCGGCGC 
GAGAAGTCCG CCTCCTACGG CCGCGGCAAG CGCTGGGCCG CTCGCTACCG GGACAGCAAC
GGACAGCAGA AGTCCCCGAA GTTCAGGACC AAGCCCGAAG CCGAACGCCA CCTCAAGAAG
GTGGAGGGGG AACTTGCGCG GGGTCTCTAC GTCGATCCCA GCGCCGGACA GGTCACCCTC
AAGGAGTTCG CGGAAGACGT GGTCAAGAAC GCTTCCGTGG AGGAGTCCAG CCGCCACGAC
CTGGAGCGCC GCTTTCGCAA GCACGTGTAC CCGTTCCTCG GGGGCAGCCA GCTCCGCGCC
ATCCGGCCAT CGATGATCCA GGAGTGGATC AAGGGCCGCT CCGCCGAACT CGGAGACCAG
ACCGTGCGGA CCGTCTTCGA CAACCTGTCG ATGGTCTTCC AGGCCGCCGT GGACGACGAA
CTCATCGCCC GCAACCCGTG CCGCGCTGGC TCCGTCAAGC CTCCGTCAGT GACTCGCCGG
AAGGTGATCC CGTGGTCGGT CGAACTCGTC TCGGGGATCC GGTCCGCCCT GCCCGCTCGC
TATCAGGCCC TGGTGGTCCC CGGTGCGGGA TGCGGCCTGC GGCAGGGTGA AGTCCTGGGC
CTGGCCGTCG ATGACCTGTC CGCCTCGAAG CACACGCTGC ACGTCCGGCG ACAGGTGAAA
CTCATGGCGG GGAAGCCGGT GTTCGCTCCT CCCAAGGGCG GCAAGGAGCG AGAGGTTCCC
CTCCCCGGCC ACGTGCTGTC CGCTCTGGCC GCTCACATGG AGCGCTTCCC GCCCGTGGCG
GTGACGCTCC CCTGGAAGCA CTTCGGGGGC AAGCCGGTGA CGGTCAGCCT GATCTTCACC
AGCCGGGAGA GGAAGGCGCT GAACGCGACC TACGTCAACG CCTATCTGTG GAAACCGGCC
CTGGTGGCTG CTGGCGTCCT GCCCGCGCCT CCTGCCGGTG AGCGCATCCA GGCGGCCCAT
GACAAGGGCT TCCACCAACT GCGCCACCAC TACGCCAGTG TGATGCTGGA CTCAGGGGTG
AGCGTTCGGG CGCTGGCTGA CTTCCTCGGA CACCACGACC CCGGGTTCAC CCTGCGGACC
TACGCGCACA TGATGCCGAA GAACGAGGAA CGGGCTCGGG AAGCCATCGA CCGTGCCTGG
TCCGCAGTCG ATGGCCTCCG CTCCCCGTGT GCGCCCGATG TGCGCCCGTC CGACAGCCAG
GGTTCCTGA
 
Protein sequence
MPVVDTWHKT VKLPDGSARR EKSASYGRGK RWAARYRDSN GQQKSPKFRT KPEAERHLKK 
VEGELARGLY VDPSAGQVTL KEFAEDVVKN ASVEESSRHD LERRFRKHVY PFLGGSQLRA
IRPSMIQEWI KGRSAELGDQ TVRTVFDNLS MVFQAAVDDE LIARNPCRAG SVKPPSVTRR
KVIPWSVELV SGIRSALPAR YQALVVPGAG CGLRQGEVLG LAVDDLSASK HTLHVRRQVK
LMAGKPVFAP PKGGKEREVP LPGHVLSALA AHMERFPPVA VTLPWKHFGG KPVTVSLIFT
SRERKALNAT YVNAYLWKPA LVAAGVLPAP PAGERIQAAH DKGFHQLRHH YASVMLDSGV
SVRALADFLG HHDPGFTLRT YAHMMPKNEE RAREAIDRAW SAVDGLRSPC APDVRPSDSQ
GS