Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_2188 |
Symbol | |
ID | 9246038 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 2614894 |
End bp | 2616369 |
Gene Length | 1476 bp |
Protein Length | 491 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | amino acid permease-associated region |
Protein accession | YP_003680116 |
Protein GI | 297561142 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0000662916 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.000260404 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGACAAGCA GTGCGACGCC GCCCCCGAGC GCGGCCAACC AAACGCAGAC CCCCGGTGCC GGACTGGCCC ACTCGCTCCG GCGCCGCCAC ATGGCGCTGA TCGCGATCGG CGGTTCGGTG GGCGCCGGGC TCTTCATCGG CTCCGGGTCG GTCATCCAGA TGGCGGGCCC GGCGGCGGTG CTCTCCTACG TCCTGGCGGG GGCGCTGGTG TTCTTCACCC TGCGCGCCCT GGGTGAGATG GTGGTGGCGG TCCCGGCCGG GGGCTCCTTC TCCGACTACG CCCGCCTGGC CTTCGGTCCC CGTGCCGGTT TCACCATCGG GTGGGTGTAC TGGTGGATGT ACGCGGTGCT GGTGGCCGCG GAGTCGGTGG CCGGTGCGGC GATCCTGGGC CAGTGGGTGC CGGGGGTGCC CGGGTGGGCG CTGGCGCTGC TGGTGCTGTT GTCGATGACG GTGGCCAACC TGGTGTCGGT GCGGGTGTTC GCCGAGACCG AGTCGTTCTT CTCGTTGGTG AAGGTGGCCA CGATCGTGGC GTTCCTGTTG ATCGGCGGCC TGTGGGCGGT GGGTCTGTGG AGCGGCGCGG ACGGCTCCAG TGTGGCCAAC CTGTGGGAGC ACGGCGGGGT GGCGCCCAAC GGGTGGGTGG CGGTGCTGGC CGCCACGGTG GTGGTGCTGT TCGCCTTCGG CGGGGTGGAG ATCATCACCG TGGCCGCGGG CGAGAGCTCC GAGCCCGAGC GCGGTGTGGC CTCGGCGGTG ACCAACGTGC TCTGGCGGAT CGGGCTGTTC TATGTGGCCT CGATCGTGGT GGTGGTGATG GTGCTGCCGT GGAACAGCGT GGATCCGGGG CGCAGCCCGT TCGTGGCGGT GATGGAGCAC GTGGGTGTGC CCGGGGCGGC GCTGATCATG GAGATCGTGG TCCTCATCGC GGTGCTGAGC GTGTTGAACG CGGCGATGTA CACCTCCTCG CGGATGCTGT TCACGCTGAC CCGGCAGGGG GACGCCCCGC GGGTGCTGCG CGGGACCAAC CGGCGGGGTG TGCCGGTGCG GGCGATCCTG CTCGGCACCG TGGTGGGGTA CGGGGCGGCG GTGGCCGACT ACCTGTGGCC CGACCGGGTG TTCCCGTTCC TGGTGGCCTC CATCGGCGCG ATCCTGCTGG TGCTGTTCCT GACCATCTGC GCCTCGCAGC TGGTGGTGGG GGCGCGGGTG CGTCGGCGTG AGCCGCAGCG GCTGACCCTG CGGATGTGGG CGTTCCCGTA CCTGACGTGG GTGGTGCTGG GGGGTCTGGT GACGATCTTC GTGGCGATGG TGGTCATCCC CGACCAGCGT CAGGCGCTGC TGGCGTCGGT GGGGTCGGTG GTGGTGGCGC TGGTGGCCTA TGAGTTCCGG CGCCGGTGGG GGCGTACGCC GCCCACGGAC CGGGTGCTGG CGGTGCCCGA CCGGCCCGAC GCGGAGGCGC TGCGCCGTCA CCCGGGCGCC GACTAG
|
Protein sequence | MTSSATPPPS AANQTQTPGA GLAHSLRRRH MALIAIGGSV GAGLFIGSGS VIQMAGPAAV LSYVLAGALV FFTLRALGEM VVAVPAGGSF SDYARLAFGP RAGFTIGWVY WWMYAVLVAA ESVAGAAILG QWVPGVPGWA LALLVLLSMT VANLVSVRVF AETESFFSLV KVATIVAFLL IGGLWAVGLW SGADGSSVAN LWEHGGVAPN GWVAVLAATV VVLFAFGGVE IITVAAGESS EPERGVASAV TNVLWRIGLF YVASIVVVVM VLPWNSVDPG RSPFVAVMEH VGVPGAALIM EIVVLIAVLS VLNAAMYTSS RMLFTLTRQG DAPRVLRGTN RRGVPVRAIL LGTVVGYGAA VADYLWPDRV FPFLVASIGA ILLVLFLTIC ASQLVVGARV RRREPQRLTL RMWAFPYLTW VVLGGLVTIF VAMVVIPDQR QALLASVGSV VVALVAYEFR RRWGRTPPTD RVLAVPDRPD AEALRRHPGA D
|
| |