Gene Ndas_4276 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4276 
Symbol 
ID9248150 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp5089910 
End bp5091016 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content74% 
IMG OID 
Producttransport system permease protein 
Protein accessionYP_003682171 
Protein GI297563197 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGACG ACGCGGCACC CGACGGCGCG GCGTCCGGGC GGACCGGCCC GGCCGGGGTC 
AGATGGCTGG CCACACGGGC GGGCACGGCC ACCGCCGCGG TCGTCCTCGG CGTCCTCCTC
GCGGTCTCCG CGGTGCTGGC CATCGGCCTG GGATCGGCGG TGGTCCCCCC GGCCGAGACG
GTGCGCTACC TGTGGGCGGC CCTGAGCGGC GGCGTGATCC AGGCAGACGA GGTGGTCCGC
TACCAGATCA TCTGGCAGAT CCGCACCCCC CGGGTCCTGC TCGCCGCCGT CGTCGGGGCC
GGTCTGGGGG TCGTGGGCGT GGCCGTCCAG GCCATGGTCC GCAACGCCCT CGCCGACCCC
TACATCCTGG GGGTCTCCTC GGGCGCCTCC GTCGGCGCGG TACTGGTCAG CGTGCTGGGC
GCACTGTCCG CGCTCGGTAT CCACGCGGTC TCCGCCGGGG CCTTCCTGGG CGCGCTGGGC
GCGACGCTGC TCGTCCACCT CGTCGCCCGC TCCCCCACGG GGGTCACCCC CCTGCGCCTG
GTGCTCACCG GGGTGGCGCT CTCCTTCGGC TTCCAGGCCG TGATGAGCGT GATCGTCTAC
CTGGCGCCGA GCAGCGAGGC CACCGCCACC GTGCTCTTCT GGACCATGGG GAGCTTCGGC
GCGGCCACCT GGAGCTCCCT GCCCGTGGTG GCCGTCGTGG TGGTCCTGGG CATCCTCGTG
CTGCGCGGCC TCGCGCGCCC GCTCGACGTG CTGGCCCTGG GCGACGAGAC CTCGGCGAGC
CTGGGAGTGG ACGCGGCCGG GCAGCGCAGG GTCCTGTTCG CGGTGACCGC GGTCATGACC
GGGGCCATGG TCGCCGTCAG CGGCGCGATC GGCTTCGTCG GCCTGGTCAT CCCCCACATC
GTGCGGATCG CGGTGGGCGC CGACCACCGG CGCGTGCTCA CCGTCGCCCC GCTGCTGGGC
GCGCTCCTCA TGGTCTGGGT GGACCTCTTC TCCCGCGTGG TGGTGGCCCC GCGCGAACTC
CCCCTGGGCG TCATCACCGC CCTGATCGGC GTACCGGTGT TCCTCGGTCT CATGCGCCGC
CGCGGCTACC TGTTCGGAGG ACGATGA
 
Protein sequence
MPDDAAPDGA ASGRTGPAGV RWLATRAGTA TAAVVLGVLL AVSAVLAIGL GSAVVPPAET 
VRYLWAALSG GVIQADEVVR YQIIWQIRTP RVLLAAVVGA GLGVVGVAVQ AMVRNALADP
YILGVSSGAS VGAVLVSVLG ALSALGIHAV SAGAFLGALG ATLLVHLVAR SPTGVTPLRL
VLTGVALSFG FQAVMSVIVY LAPSSEATAT VLFWTMGSFG AATWSSLPVV AVVVVLGILV
LRGLARPLDV LALGDETSAS LGVDAAGQRR VLFAVTAVMT GAMVAVSGAI GFVGLVIPHI
VRIAVGADHR RVLTVAPLLG ALLMVWVDLF SRVVVAPREL PLGVITALIG VPVFLGLMRR
RGYLFGGR