Gene Ndas_0631 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0631 
Symbol 
ID9244473 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp776259 
End bp778154 
Gene Length1896 bp 
Protein Length631 aa 
Translation table11 
GC content71% 
IMG OID 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003678583 
Protein GI297559609 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00116155 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.250691 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGCCC ACCACCCTCC CCCCGGTTCC CCCGTGCGCC GCACCGTCCC TCCCCACGCC 
CCTTCCACCG GCGCGTTCCC GCGCCGGGCC CCGCGCCGTA TCGCCGCCAC GGCCGCCGCG
CTGCTCCTCC TGGCGGGCAC CGCCGCGGCC CCCGCCGCCG CGGACACCTC CGACGGCCAG
ACCCTCAGCA TCGCCACCTC CCAGCAGGTG GACTCCTTCA ACCCCTTCAC CGCGCAGCTC
GCGATCACCA CCAACGTCCT GCGCCACGTC TACGACTCCC TCGTCACGGT CGACCCCCAG
ACGAACCAGC CCGCCCCCTC CCTCGCCGAG TCCTGGGAGT CCAGCGACGA CGGCCTCACC
TGGACCTTCC ACCTGCGCGA GGGCGTCCGG TTCTCCGACG ACGAGCCCCT GACCGCCGAC
GACGTGGTCT GGACCTTCAC CACCATGATG GAGAACGAGG CCGCGGCCGT CGCCAACGGC
AACTACGTCT CCGGCTTCGA CACGGTCACC GCCGAGGACG ACCACACCGT GGTCATCGAA
CTCGACGAGC CGCAGGCCAC CATGACCTCC CTCAACGTCC CGATCGTCCC CAGGCACGTC
TGGGAGCCGA TCCTGGAGCG CGAGGGCGAC GCCTTCGCCG ACTACGGCAA CGAGGACTTC
CCGACCGTCG GCAGCGGCCC CTTCGTCCTC ACCGGCCACG ACCGGGGCCG CTCCATCACC
CTGGAGGCCA ACCCCGACCA CTGGCGCGGC GCACCCGCCT TCGAGCGGGT CGTCCTGCGC
TACTACTCCG AGAAGGACGC CGCGGTGGAG GCGCTGCGCA GCGGTGAGGT CTCCCTGGTC
TACGAACTCA CCCAGGCGCA GGCCGCAGCG CTGGAGTCGG CGCAGGACGT CCGGGTCAAC
ATCGCCGACG GCAAGCGCTT CCAGGCCTTC ACCATCAACC CCGGCGCGGT CACCCAGGAC
GGGGAGGAGT TCGGCGACGG GCACCCCGCC CTCGCCGACC GCACCCTGCG CCAGGCCATC
GTCATGGCCA TCGACAACCA GGAGATCGTC GACAAGGCGC ACGGCGGCGA GGCCGTGGCC
GCGGGCGGCT ACATCCCGCC CCGCTACGAG GACTTCCACT GGGCGCCCGA GGGCGAGGAG
GCCGTCCTCG ACTTCGACCC CGAGGCGGCC AACGCCATGC TCGACGAGGC CGGGTACGAG
CGGGGTGAGG ACGGCGTGCG CGTCTCACCC GAGGGCGACC GCCTCGAACT GCGGATGCAC
GTCCACCAGG ACCGGCCCGA CAACGTCAAC ACCGGGTTGG TCATCGTCGA GCGCCTGGCC
GACATCGGCA TCGAGGTGGA GAACCTCACC GTCGACCCCG GCGTGCTCAG CGACGCCCTC
TTCGCGGGCG AGTACGACCT CATCTTCACC GGCTGGACGG TCAACCCCGA CCCCGACTAC
GTGCTGAGCA TCCACACCTG CGGCGCCCTG CCCACCGAGC CGGGCACCAT GCAGGGCGAC
GCCTACTTCT GCGACGAGGA GTACGACGAG CTCTACGAGG CCCAGCTCGC CGAGTACGAC
CGCCAGGCCC GCGCGGAGAT CATCCACCAG CTCCAGGAGG TCCTCTACCG CGAGGCCGTC
GTGAACGTGC TGGCCTACCC CAACATCATG GAGGCCTACC GCACCGACCA CATCGCCTCC
ATCCAGTACG AGCCCGCCGA GGGCGGCAAC ATCTGGGGAC AGGACGGCTA CTGGGCCTGG
TGGTCGGCCG AACCCGCCGC CGAGCGGACC GCGGGCGCGG CCTCCGGTCC CTCCGCCGGG
GTCTGGATCG GCGTCGGGGC CGTCGTGCTC GTCCTCGCCG CGGTCGGGGG CTTCCTGCTG
CTGCGCCGAC GTTCCACCAT GGAGGACCGC GAGTGA
 
Protein sequence
MNAHHPPPGS PVRRTVPPHA PSTGAFPRRA PRRIAATAAA LLLLAGTAAA PAAADTSDGQ 
TLSIATSQQV DSFNPFTAQL AITTNVLRHV YDSLVTVDPQ TNQPAPSLAE SWESSDDGLT
WTFHLREGVR FSDDEPLTAD DVVWTFTTMM ENEAAAVANG NYVSGFDTVT AEDDHTVVIE
LDEPQATMTS LNVPIVPRHV WEPILEREGD AFADYGNEDF PTVGSGPFVL TGHDRGRSIT
LEANPDHWRG APAFERVVLR YYSEKDAAVE ALRSGEVSLV YELTQAQAAA LESAQDVRVN
IADGKRFQAF TINPGAVTQD GEEFGDGHPA LADRTLRQAI VMAIDNQEIV DKAHGGEAVA
AGGYIPPRYE DFHWAPEGEE AVLDFDPEAA NAMLDEAGYE RGEDGVRVSP EGDRLELRMH
VHQDRPDNVN TGLVIVERLA DIGIEVENLT VDPGVLSDAL FAGEYDLIFT GWTVNPDPDY
VLSIHTCGAL PTEPGTMQGD AYFCDEEYDE LYEAQLAEYD RQARAEIIHQ LQEVLYREAV
VNVLAYPNIM EAYRTDHIAS IQYEPAEGGN IWGQDGYWAW WSAEPAAERT AGAASGPSAG
VWIGVGAVVL VLAAVGGFLL LRRRSTMEDR E