Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_2615 |
Symbol | |
ID | 9246466 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 3118745 |
End bp | 3120229 |
Gene Length | 1485 bp |
Protein Length | 494 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | |
Product | sodium/glutamate symporter |
Protein accession | YP_003680538 |
Protein GI | 297561564 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00493587 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.013038 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCCTG AGACCGTCGG CCTCGCACTC CTGCTGCTGG GAGTGCTGCT GCTGACAGCG AAGCTGGTAC GGGTCAGGTG GAAGCTGACG CAGAGGCTCT ACCTTCCCGC GTCGATCATC GGCGGTGGTG TCGCCCTGCT CCTGGGCCCC GACGTGTTCG GGCGCCTCAT GGGCTGGCTG GCCGACCGGG GCGTCGGTGG GGGCTTCGCT GAGCGCGCGG CCGAGGGCGG CCTCTTCGGT ACCGACGTGA TGGCGGTCTG GTCGGCCCTG CCCGGCCTGC TGATCTCGGT GGTCTTCGCC AGCCTCTTCC TCGGCAAGCC CCTGCCGCGC TTGCGCGAGG CCGCCTCCCT GGCGGGCCCC AACCTCGCCT TCGGCGTCAC CATGGCCAGC GGCCAGTACG TGATCGGCCT GGTGCTCGCC CTGCTGGTGC TCGCACCCCT GTTCGGCGTG CCGGTCATCT CCGGGGCCCT GATCGAGATC GGCTTCCTCG GCGGGCACGG GACCGCGGCG GGGCTGGGCG ACACCTTCGA GCAGGCGGGC TGGGCCGCGG GGCAGGACCT GGCCCTGGGC ATGGCCACCG TCGGACTGCT GTCGGGGATC ATCGTCGGCA TCGTCCTCAT CAACTGGGGC GTGCGCCGCG GCAAGTCCAA CGTGATCAGC GACGACGCGC GCGGCAGCGA CACCGAGCAG GCCGGGCTGG TCGAACCCGG CAAGCGCTCG GCGGGCTCGA CCATGACCGT CCACCCCTCG TCCATGGACC CGATGACGCT CCACTTCGGG CTGGTCGCCC TCGCCGTGCT GATCGGTCAG CTCATCCTCA TGGGGTTGCA GTGGCTGGAG CGGACTCTGT GGGCCGACAC CATCGAGATC ATGGCCTACG TGCCGCTGTT CCCGCTGGCC ATGATCGGCG GCATCCTCCT CCAGGTGTTC ATCGACCGCT TCGACCGCAA CGACGTGGTG GACCACGGGA CCGTGGAGCG GATCCAGGGC CTGTCGCTGG ACGTGCTCAT CATCGCCGCG ATGGCCACCC TCTCGCTCCA GGCCATCGCC GACAACATCG CCGCCTTCAC CATCCTCGCC GTGGCGGGCG TCCTGTGGTC GCTGTGCGGC TTCCTGTTCC TGGCGCCGCG GATGATGCCC AGCCACTGGT TCGAACGGGG CATCGGCGAG TTCGGCCAGT CCCTGGGCGT CACCGCCACG GGCCTGGTGC TGATGCGCGT CGTCGACCCG GAGATGAAGT CGCCCGCCTA CCCGGCGTTC GGCTACAAGC AGCTGATCTT CGAACCGTTC TTCGGGGGCG GCCTGGTCAC CGCGGCCGCC ATCCCGCTGA TCATCAGCCC GCAGGTGGGA GCGGTGGGCT TCCTGGCGAT CATGGCCGTC GTCCTGGCGG CCAGCCTCGC CACGGGCCTG TTCGTGCTCC GTCGCGGGTC GCGCGGGGCC GCAGCCACCG AACGGGAGGC CGAGGAGGCC GAGACGGTGT CCTGA
|
Protein sequence | MSPETVGLAL LLLGVLLLTA KLVRVRWKLT QRLYLPASII GGGVALLLGP DVFGRLMGWL ADRGVGGGFA ERAAEGGLFG TDVMAVWSAL PGLLISVVFA SLFLGKPLPR LREAASLAGP NLAFGVTMAS GQYVIGLVLA LLVLAPLFGV PVISGALIEI GFLGGHGTAA GLGDTFEQAG WAAGQDLALG MATVGLLSGI IVGIVLINWG VRRGKSNVIS DDARGSDTEQ AGLVEPGKRS AGSTMTVHPS SMDPMTLHFG LVALAVLIGQ LILMGLQWLE RTLWADTIEI MAYVPLFPLA MIGGILLQVF IDRFDRNDVV DHGTVERIQG LSLDVLIIAA MATLSLQAIA DNIAAFTILA VAGVLWSLCG FLFLAPRMMP SHWFERGIGE FGQSLGVTAT GLVLMRVVDP EMKSPAYPAF GYKQLIFEPF FGGGLVTAAA IPLIISPQVG AVGFLAIMAV VLAASLATGL FVLRRGSRGA AATEREAEEA ETVS
|
| |