Gene Ndas_0604 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0604 
Symbol 
ID9244446 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp740028 
End bp741242 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content68% 
IMG OID 
Productintegrase family protein 
Protein accessionYP_003678557 
Protein GI297559583 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTGTGG TCGACACTTG GCACAAGACC GTCAAGCTCC CGGACGGCTC GCTTCGACGT 
GAGAAGTCCG CCTCCTACGG CCGCGGCAAG CGCTGGGCTG CCCGCTACCG GGACGACAGG
GGACAGCAGA AGTCCCCGAA GTTCAAGACC AAGCCCGAGG CCGAACGCCA TCTGAAGAGG
GTGGAAGGCG AACTCCTGCG AGGAACCTAT GTGGACCCTG CGGCGGGCAA GGTCACCTTG
CGGGAGTTCG CGGACCGATG GCGTGAGGAC GTTCTCCACC GGCCGCTGAC GGCCTCCCGC
GTCCTCGGGG AGCTGGAGAA CCACATCTAC CCGGAGTTCG GGCACCGCGG GCTGTCCAGC
ATCCTTCCCT CGGACGTGCA GCGCTGGGTG ACCCGGCTCG GCAAGTCACT GGCCCCGGCC
ACGGTGGAGA GCATCTACGC CACCCTGCGG GGTGTCTTCG AGGCCGCCGT GCGTGACGAC
CTACTGACCA AGACCCCCTG CCGGGGCATC CGCCTTCCGG AGAAGCGGAA GACGGCTGAG
CCCGTCATGC CGGTGGAGAC CCTGCGCGAG GTCGTGGACG GCGTTCCCGA ACGGTTCAAG
GCCGTCGTGC TCCTGGCCGC TGGCAGTGGC CTCCGGCAAG GGGAACTCTT CGGGCTGGAG
CTTCACCACC TGGACGTCGA GCACCTGGTG CTGCGCGTGG AACAGCAACT GGTCACCCCG
GTGGACAACG TCGCCTACCT GGCACAGCCG AAGAGCCCGG CCAGCTACCG GGACATCCCG
CTGACGCGGG AAACCGCGGA CATGATGCTC GTCCACCTCG ACTCCTTCCC GGCCGGGCCG
GTGGAGATCG AGGACCGGAC CGACCCCCAC AAGCCGCGGC GGCGGACAGC ACACCTGCTC
TTCACCCTGC CGAACGGTGG TCCCGTCCGC CGTTGGGAGT GGAACAGGAT CCTCACCCCG
GCGATGGCCA AAGCGGGGAT GCCGAAGCGT TCCGGCCTGC ACTCGGTACG GCGCTTCTAC
GCCTCCCTGC TGATCCGCTA CGGCGAGTCG GTCAAGACCG TGCAGACGCG CATGGGCCAC
AGCTCGGCCG CCATCACGCT GGACGTCTAC GCCGGGCTGT GGCCAGACAG TGAGGACCGC
ACCCGCGAAG CGGTCGCCAG CGGGCTCGGG AATCTCCACA GTGTGCGCCC TGTGTGCGCC
GACGACTCCG ACTGA
 
Protein sequence
MPVVDTWHKT VKLPDGSLRR EKSASYGRGK RWAARYRDDR GQQKSPKFKT KPEAERHLKR 
VEGELLRGTY VDPAAGKVTL REFADRWRED VLHRPLTASR VLGELENHIY PEFGHRGLSS
ILPSDVQRWV TRLGKSLAPA TVESIYATLR GVFEAAVRDD LLTKTPCRGI RLPEKRKTAE
PVMPVETLRE VVDGVPERFK AVVLLAAGSG LRQGELFGLE LHHLDVEHLV LRVEQQLVTP
VDNVAYLAQP KSPASYRDIP LTRETADMML VHLDSFPAGP VEIEDRTDPH KPRRRTAHLL
FTLPNGGPVR RWEWNRILTP AMAKAGMPKR SGLHSVRRFY ASLLIRYGES VKTVQTRMGH
SSAAITLDVY AGLWPDSEDR TREAVASGLG NLHSVRPVCA DDSD