Gene Ndas_3568 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3568 
Symbol 
ID9247437 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4279126 
End bp4280358 
Gene Length1233 bp 
Protein Length410 aa 
Translation table11 
GC content76% 
IMG OID 
ProductLycopene beta and epsilon cyclase 
Protein accessionYP_003681475 
Protein GI297562501 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.579585 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGACT ACGACGTGGC GATCATCGGC GGGGGAGCCG CTGGGCTCAC CCTGACCCAC 
CAGCTTCGCG GGGTCAATGA CCGACGTGGC CGACCGCTGC GGGTCGCCCT GGTGGAACCG
CCCCCCGGGC CGCACACCCC TCCGCCCCGC ACCTGGTGCT TCTGGGAGCC CGACGGCGGG
CCGTGGGACC ACCTGCTGGC CGCCCGCTGG AGGGACCTGT CCGTGGTGGG GCCGGACGGG
GCCGTCCACG ACTCCCCGGC GGCGCCCTAC GTGTACAAGA TGCTGCGGTC GGCCGACGTG
GACGCGCATG TGCGCGCTTC GGCCGGTGAA CACGTGGACC AGCTTCCGGT GCTGGTCACC
GAGGTCGTCG ACGGCGTCGA GCACGCCGTG GTACGGGGCA CCTGCCCCGG AGGCCCCGGG
GGAGGGGAGC GGGAGCTGAC CGCCTCATGG GTGTTCGACT CGCGTCCGCC CCGGCCCGCC
CCGCGCGGGC GCACGCACCT GCTCCAGCAC TTCCGCGGCT GGTTCGTGCG CACGCCCGAC
GACGCCTTCG ACCCCGCCTC GGCCGTGCTC ATGGACCTGC GCCCTCCCCA GCCGGCCAAC
GGCGTGGCCT TCGGCTACGT GCTGCCGCTG TCGCCGCGCG AGGCGCTGGT GGAGTACACC
GAGTTCGGGC GCGAGGCGCT CACGACCCCC GAGTACGAGC GCGCGCTCGA GGACTACTGC
GGCCTGCTCG GGCTGGGGGA CGTGGAGGTG ACCGCGGCCG AGCAGGGCGT CATCCCGATG
ACCGACGCGC GGTTCCGCCC CCGCGCGGGG CGGCGCGTGT TCCGGGTGGG GACGGCGGGC
GGCGCCACCC GGCCCTCGAC CGGGTACACG TTCAGCGGTG TGCGGCGCCA GACGGCCGCC
GTGGCGCGGG CGCTGGCCCA GGGGCGGGCC CCGGTACCGC CGGTGCCCCA CCGCCGCCGC
CACCTGGCGA TGGACGCGGT CATGCTGCGG GCCCTGGACA CGGGGCGGGT GCGGGGAGCG
GAGTTCTTCG CCGGGCTGTT CGCGGCCAAC CGCCTCGGGG ACGTGCTGGC CTTCCTGGAC
GGTGGCTCGC GCCTGCCCCG GGAACTGGCG ATGGGCCTGA GCACACCGGT CGCGGCCATG
TCGCTGACCA GCCTGGACCA GGCGTGGTAC GCGCTGCGCG GGGTCGGTGC GAGGAGCCTC
AGCCGGGGGC CAGGGCCCGC ACGGCGTCGG TGA
 
Protein sequence
MADYDVAIIG GGAAGLTLTH QLRGVNDRRG RPLRVALVEP PPGPHTPPPR TWCFWEPDGG 
PWDHLLAARW RDLSVVGPDG AVHDSPAAPY VYKMLRSADV DAHVRASAGE HVDQLPVLVT
EVVDGVEHAV VRGTCPGGPG GGERELTASW VFDSRPPRPA PRGRTHLLQH FRGWFVRTPD
DAFDPASAVL MDLRPPQPAN GVAFGYVLPL SPREALVEYT EFGREALTTP EYERALEDYC
GLLGLGDVEV TAAEQGVIPM TDARFRPRAG RRVFRVGTAG GATRPSTGYT FSGVRRQTAA
VARALAQGRA PVPPVPHRRR HLAMDAVMLR ALDTGRVRGA EFFAGLFAAN RLGDVLAFLD
GGSRLPRELA MGLSTPVAAM SLTSLDQAWY ALRGVGARSL SRGPGPARRR