Gene Ndas_5231 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_5231 
Symbol 
ID9249124 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp383046 
End bp384470 
Gene Length1425 bp 
Protein Length474 aa 
Translation table11 
GC content76% 
IMG OID 
Productphage shock protein C, PspC 
Protein accessionYP_003683117 
Protein GI297564144 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.118703 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0028531 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGACCATGA CTGACGACCC GGTACCGGAG GGCGCCCCGG CCGCGTCCGC TTCCGGTACG 
GCGGAGGCGG GCGCCGGGGC CTCCCCGCGT GAGCTGCGCA AGGGGGACGA GGAGCGGGTC
CTGGCCGGTG TGTGCGCGGG CCTTGGCCGG TACACCGGGG TGGATCCGGT GGTGTGGCGT
GCGGCCTTCG TGCTGACCTC GTTCGCCGGG GCGACGGGCC TGCTGCTCTA CATCGCCGCG
TGGATGCTCA TGCGCGACGC CCAGGGTGTT CCGGCGACGT TCGAGCAGAT GCTCAACCGC
AGCATCCCGC CGCGGACGGT GCCCAAGCTG CTCGCGGTGG GGTTGGCGGT CGCGACGACG
TTCAGCCTGG TCGGCGGGTT GGGGTGGTCG ACGCTGGTGC TGGCGGTGCC CCTGGTGTTG
GGTCTCCTGG CCGCGCGCAA CCGGGGCGTG GACCTTCGGA CGGCGTTCAC CGAGCTGCGC
GAGGACCTGC GGGCCACGGA TCCGCCGCCC GCGACGCCGT CCCCGGAGCC GACCGCGACC
TACTACAACC CGGCCCAGCC GTGGGCGTCC GCGCCGCACG GGCCGGTGGA CCTGGCGGTG
GTGTCGGAGC GCAGCGCCGC GCGGGACGCC GGAGGGGACG AGGACGAGGA GGAGGGGCGT
TCCGGCGGGT CGGGCGAGCC CGGCGCGCGG GGGGCGGAGG ACGAGGGCCG CGCCCCTGGG
GGCAGGTGCC TGCCGCTGGC CTCCATGGCC CTGTGGACGG TCGTGGCCGG GGCGGTCGTG
GTGTCGGTGC TGGAGTTCGG CTGGTCGTCC TCGCTGTGGA GCGGGCGGAC GGCCGACCTG
CTGTTCGGGC CGGAGACCGG GGTGTTCTTC CTGGCCGGGG CCCTGGCGGT GGTCGGCGTG
TACGCGCTGG TCGGCGTCTG GGCGGGCAAC CCGCGGGGGC TGCTGCCGAT GGGCGCGGCG
GCCGTTCTGC TGCTGGTGCT GGCGTCGGTG ACCGACCTGA CCCGGGTGCG GATCGGCGAC
GAGACGTGGC GGCCGACCAC GGTGGCCGCG GCCGAGTCCG GCGACCACCG GCTCACCGTG
GGCTCGGGGA CCCTGGACCT GACCGGGCTG GAGGACCTGG AGCCCGGCGA GGCCGTCGAC
GTGTCGCTGT GGATGGGGGC GGGCCGCGTG GAGCTGGTCC TGCCCGAGGA CGCGGAGGTC
GCGGTGCACT CGAAGATCGG TCTGGGGGGC GTGGACTTCG CCGAGGACGG GCGGGACGAC
ATGTTCGGGG TCTCCCTGGA CCACGAGGAG GTCCACGGGG CCTCGGCGGA CGCCGGGGCC
CGGGACGGGC AGGCGCGGGG CGGCGACGGG GCCGGGGCCC CGCGGATCAA CGTGCGCACC
GACTCACTGG TCGGGGTCGT GGAGGTGAAG CGTGGCGGAG CGTAG
 
Protein sequence
MTMTDDPVPE GAPAASASGT AEAGAGASPR ELRKGDEERV LAGVCAGLGR YTGVDPVVWR 
AAFVLTSFAG ATGLLLYIAA WMLMRDAQGV PATFEQMLNR SIPPRTVPKL LAVGLAVATT
FSLVGGLGWS TLVLAVPLVL GLLAARNRGV DLRTAFTELR EDLRATDPPP ATPSPEPTAT
YYNPAQPWAS APHGPVDLAV VSERSAARDA GGDEDEEEGR SGGSGEPGAR GAEDEGRAPG
GRCLPLASMA LWTVVAGAVV VSVLEFGWSS SLWSGRTADL LFGPETGVFF LAGALAVVGV
YALVGVWAGN PRGLLPMGAA AVLLLVLASV TDLTRVRIGD ETWRPTTVAA AESGDHRLTV
GSGTLDLTGL EDLEPGEAVD VSLWMGAGRV ELVLPEDAEV AVHSKIGLGG VDFAEDGRDD
MFGVSLDHEE VHGASADAGA RDGQARGGDG AGAPRINVRT DSLVGVVEVK RGGA