Gene Ndas_2365 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2365 
Symbol 
ID9246215 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2813315 
End bp2814565 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content71% 
IMG OID 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003680293 
Protein GI297561319 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.618021 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGAGAC AGAGACCGCG GGCCACCGCC GTGGCCGCCG CGCTGTCGGC GGCCGCGCTG 
CTCGCCGCGG GGTGCGGTGG GGACGGGGGC GGGGACCCGA ACACCATCGA GCTGGTCGTG
GCGCAGTACA CCGAGGGGAC CCAGCCCTAC TGGACCGACC TGATCCAGGA CTTCGAGGCC
GACCACCCCG GCACGAGCGT CCGGCTGCGG GTGATCGGCT GGGACGACCT CCAGAACCAG
GTCAACACGA TGGTGCAGAC CCGGCAGTTC CCCGACATCC TCAACACGAA CCTCTTCGCC
GACTACGCCG AGGCGGGCCT GCTGCACCCG GCGCGGGACG TGCTGCCCGA GGACAAGTTC
ACCGACTTCG TCCCGGTCCT GGCCGAGAAC GCCTCGCTGG AGGGTGAGCA GTACGCCCTG
CCGTTCGTCG CGACGGTGAA CGCGATGTAC TACAACCGGA CCATCTTCGC CGAGGCCGGG
ATCAGCGAGC CCCCGCGGAC CTGGGACGAG TTCCTGGAGG CGGCGGAGCG CGTCAAGGCG
CTCCCCGGCG ACCACGTCCC CTACGCGCTG GCGCTGGGCT TCGACGGCGG CGACTACGAG
TTCGGCACGT GGGCGCGCTC CAACGGCGGC GGCTGGAAGC AGGACGGCGA GTGGACGGTC
AACAGCGACC GCAACGTCGC CACGCTGGAG TTCCTCCGGG ACCTGGTGGT GGAGCACGAA
GCCACCCAGC CCAACCCCGG GCAGACCAAC CGCCCCGACG GCACGTGGCC GCTCTTCGCC
CAGGGCAGGG CCGCCATGGT GTACGCGCCG CTGGGCGGCA GCGCGTTCCT GGACCCGGTG
CACGAGGCGG GCGTGGACTA CGGCACGACG ACCCACCCGA CCAACGGCGG CGCCGAGCCC
TCCACCCACG GCATCCAGGA CTACCTGGTG GCCTTCGACA ACCCCGGCAA CCAGGAGCTG
GTCACCGAGT TCCTGGACTA CTTCTACGAA CCGGAGAACT ACACCGCCTA CCTGGAGGTC
GAGGGGCTGC TGCCGACCAC CGAGTCCGGC GTCGAGGAGT TCCGCGACGA CCCCGACGTG
GGGCAGTACG TCGAGCAGAT CCCCGAGGCA CGGCTGGACC CCACCTACGA ACCGGTCTGG
GCCCAGCTGC GCGGCACGAT GGCCGGGGAG CTGGGCACGG CCGTGGCCCC GGACGGGGAC
CCGCGCGCCG TCCTGGACAG GGGCCAGGAG ATCGCCGCCT CCGGCCCGTG A
 
Protein sequence
MRRQRPRATA VAAALSAAAL LAAGCGGDGG GDPNTIELVV AQYTEGTQPY WTDLIQDFEA 
DHPGTSVRLR VIGWDDLQNQ VNTMVQTRQF PDILNTNLFA DYAEAGLLHP ARDVLPEDKF
TDFVPVLAEN ASLEGEQYAL PFVATVNAMY YNRTIFAEAG ISEPPRTWDE FLEAAERVKA
LPGDHVPYAL ALGFDGGDYE FGTWARSNGG GWKQDGEWTV NSDRNVATLE FLRDLVVEHE
ATQPNPGQTN RPDGTWPLFA QGRAAMVYAP LGGSAFLDPV HEAGVDYGTT THPTNGGAEP
STHGIQDYLV AFDNPGNQEL VTEFLDYFYE PENYTAYLEV EGLLPTTESG VEEFRDDPDV
GQYVEQIPEA RLDPTYEPVW AQLRGTMAGE LGTAVAPDGD PRAVLDRGQE IAASGP