Gene Ndas_4539 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4539 
Symbol 
ID9248419 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp5382898 
End bp5384127 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content69% 
IMG OID 
Productbinding-protein-dependent transport systems inner membrane component 
Protein accessionYP_003682432 
Protein GI297563458 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.584645 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACACCCG TTGACTTTCC GCCGTCCCTG CCGATCGGTG ACGGCTTCGA GGCACTCAAC 
GCCTGGCTCA AGAGCACCTT CGGCCTCTTC TTCGACGCCG TAGGCGAGTT CATCCGCTGG
GCCGTGAGCG CCCTGTCCGG CTTCTTCACC GAGCCGGCCG CCGGACAGCT CGCCTTCGTG
GCCGCCGTGA TCATCGCCGT TCCGCTGCTG CGCCGCCGGC GGCTGCCGCT GGTGGTGATC
GTCGCCGCGG TCTCGCTGCT CTACGTGCTG CACGCGCAGT TCGCGCTCGG CAACACCATC
GTCAACAGCC TGCCGCCGCA GCTCTTCCTG TGGGCGATGA ACCTGTTCGA CCAGCAGTAC
GTGGACGCCT ACTTCTGCAT CTCGCTGGTC TTCCTGGTCG CGGTCACCGT CCTGGGCGTG
CTCGACCGGG GCACCCGGGT GCGCACCGGC GTCGTCGCGG GCGGCTGGCT CGTGCTGGTC
CTGCTCGGCT GGCTCGTGCT CCCGATGCTG GTGACCCGGC CCGGGGCACT GCTGATGATC
CTGCTGCTGA GCCTTCTGGC GCTCTCGGTG GCCGGGTGGC GCATGGGCCT GTTCGGCCTG
ATCGCCCTCA CCCTGGTGGC CTCGGTCGGC CAGTGGACCA ACGCCATGGA CACGCTCGGC
CTCGTCCTCG TGGCCAGCGC CATCGCCGTG GTCATCGCGA TCCCGATCGG CGTCCTGGCC
GCCTACAACG ACCTGGTCAG CAAGATCGTC AAGCCGGTCC TGGACCTGAT GCAGACGCTG
CCCGCGTTCG TGTACCTGAT CCCGGCGATC TTCTTCTTCT CCATCGGCGC CGTCCCCGGC
GTGGTCGCCA CCATCGTGTT CGCGATGCCG CCCGGCGTCC GCCTGACCGA GCTGGGCATC
CGCCAGATCG ACAAGGAGCT GGTGGAGGCG GGCGAGTCCT TCGGCGCCTC CCCGACCAAG
GTCCTCACCG GCATCCAGCT GCCGCTGGCC CTGCCCACCA TCATGGCCGG GGTCAACCAG
GTCATCATGC TCGGACTGTC GATGGTCGTC ATCGCGGGCA TGGTCGGCGC GGGCGGCCTG
GGCAGCGAGG TCTACCAGGG CATCACCCGC AACGACGGCG CCCTCGGTTT CGAGGCCGGT
ATCGCCGTGG TCATCCTGGC GATCTTCCTC GACCGCCTGA CCGCGGCCGT CACCCGCAAC
AGCCCGCGCG CGGCCGCGTC GGCCGCCTGA
 
Protein sequence
MTPVDFPPSL PIGDGFEALN AWLKSTFGLF FDAVGEFIRW AVSALSGFFT EPAAGQLAFV 
AAVIIAVPLL RRRRLPLVVI VAAVSLLYVL HAQFALGNTI VNSLPPQLFL WAMNLFDQQY
VDAYFCISLV FLVAVTVLGV LDRGTRVRTG VVAGGWLVLV LLGWLVLPML VTRPGALLMI
LLLSLLALSV AGWRMGLFGL IALTLVASVG QWTNAMDTLG LVLVASAIAV VIAIPIGVLA
AYNDLVSKIV KPVLDLMQTL PAFVYLIPAI FFFSIGAVPG VVATIVFAMP PGVRLTELGI
RQIDKELVEA GESFGASPTK VLTGIQLPLA LPTIMAGVNQ VIMLGLSMVV IAGMVGAGGL
GSEVYQGITR NDGALGFEAG IAVVILAIFL DRLTAAVTRN SPRAAASAA