Gene Ndas_5431 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_5431 
Symbol 
ID9249334 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp617126 
End bp618460 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content73% 
IMG OID 
Productprotein of unknown function DUF571 
Protein accessionYP_003683316 
Protein GI297564343 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.55205 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGTGGAT ACTGCCGGGT CACGGTGACC GGACCGGAAC GCTGGGCCGA TCTGGCACTT 
CCCGGAACGG TTCCCGTGGC CACACTGATG CCCCGCATCC TGGAGGTCTG CGCACCCGAG
GAGGAGGGCA CCGAACCCGC CGCGTGGACG CTCACCACCG TGGAGGGCGA CCCCGTCCAC
CCCGACCAGC CGCTGGAGAG CGCGGGCGTC TACGACGGCG ACGTGCTGGT GCTGGACCGC
CGCACCGCGC CGGGCAGGCC CGCGCACGTG GACGACGTCC GCGGCGCGGT CGAGGACCGC
GTCGACGCCA CCGCGCACAT CTGGAACCCC ACGACCACGC TGTCCTTCGG CCTCCTCGTC
GCGGCCATCG GCCCCCTGCT GCTGCTCGGG CTGATGACGC GGCTGAGCCC GTCCGCCTGG
CACCTGGGGA TCGCCTCGGC GGGAACCCTG TTCACGGTCG CGGTCATGCT GCTCGCCGCG
CGCAGGCCGC TGCCCGCCGT CGCGCACGTG CTGTTCACGA CGGCCTGCGC GTGGGGCGCG
GTCACCGCCG TGCTCGCCGC GAACCTGCTG ACCGACGCGA ACTTCCTGGT GCAGGCGGCC
TTCGCGCTCT CCGGCGCCCT GCTGGTGGCG GTGATCGGCT GGACGATGCA CGAGACGGGG
CTGGCCTACA TCTGCGCGCT CGGCGTGCTC GCGGTGACGG CCGGAGTGCT CGTGGTGGTG
GGCGTCTTCG TGGAGCCGGT GCAGGGGGCG CGGTCCATGG GGCTGGTCCT GGCACTGTGC
GTGGGCGCGC TGCCGCGCGT GGCGATGGTG ATGGGCGGGC TGTCCGGGCT CGACTACGAG
GTGGGGCGCT CGGGGCAGGT CACCACGGAC CGGTTCGAGG ACACCTTCGG CAACACCGAC
CGCATCCTGT TGGGCGTCGT GCTGGGCGCG GCGGTGAGCG GCGGCGCGAC GACCGTGCTG
CTGGCCTACC TGGCCACGGG CCTGCCCGAC CTGCTGCTGT GCGCGCTGCT CTCGCTGCTG
CTGGTGCTGC GCTCGCGGCT GTTCGACCGG ATCCGGCACG TGCTGCCGCT GCGCCTGGCG
GGGGTGCTGG GCCTGGGCGC GGCGGGTGTC GCGACGGTCG GCGAGTACGC CTTCCTGGCG
CCGTGGCTGC CGCTGGTCGC GCTCGTGGCG GGGATCGCTC TGGGAGTGCT GAGCTGGGTG
CGGCTGACCG ACGTGCCGCG TGCCTCGCTG CGCCGCCTCC TCAACTGGAC GGAGATCCTG
GTGATCATCG CGATGTGCGC GGTGTTCGCC TGGGGGATGG GCCTGTTCGC GTTCGTGGAG
CGGATGACCT CGTAG
 
Protein sequence
MSGYCRVTVT GPERWADLAL PGTVPVATLM PRILEVCAPE EEGTEPAAWT LTTVEGDPVH 
PDQPLESAGV YDGDVLVLDR RTAPGRPAHV DDVRGAVEDR VDATAHIWNP TTTLSFGLLV
AAIGPLLLLG LMTRLSPSAW HLGIASAGTL FTVAVMLLAA RRPLPAVAHV LFTTACAWGA
VTAVLAANLL TDANFLVQAA FALSGALLVA VIGWTMHETG LAYICALGVL AVTAGVLVVV
GVFVEPVQGA RSMGLVLALC VGALPRVAMV MGGLSGLDYE VGRSGQVTTD RFEDTFGNTD
RILLGVVLGA AVSGGATTVL LAYLATGLPD LLLCALLSLL LVLRSRLFDR IRHVLPLRLA
GVLGLGAAGV ATVGEYAFLA PWLPLVALVA GIALGVLSWV RLTDVPRASL RRLLNWTEIL
VIIAMCAVFA WGMGLFAFVE RMTS