Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_4424 |
Symbol | |
ID | 9248299 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 5265163 |
End bp | 5266656 |
Gene Length | 1494 bp |
Protein Length | 497 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | |
Product | amino acid carrier protein |
Protein accession | YP_003682319 |
Protein GI | 297563345 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.107897 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.578311 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTTGCCTG CCGACCAAGC GCCAACCCTG GACGCGCGTC TGACAGACGC CATCGCGACG TTCAACAACG ACTTCTTCTG GTCCTGGATC CTGATCCCGC TCCTCGTGGT GGTGAGCCTC TACCTCACCG TCCGGACGGG CGCCGTCCAG ATCCGGATGC TCCCGGAGAT GTTCCGGGTG CTCAAGAGCA AGCCCGAGAC CGCTCCTGAC GGGGGCAAGG CGGTCTCCTC CATCCAGGCG TTCATGATCT CGGCCGCGGC CCGCATCGGC ACGGGCAACA TCGTCGGCGT GGCCGTCGCC ATCTCCCTGG GCGGCCCGGG CGCCGTCTTC TGGATGTGGA CGATGGCGAT CGTGGTCTGC GCGGCCAGCT TCGTCGAGTC CACCCTGGCC CAGCTCTACA AGGTCAGGGA CTCCACCGGC TACCGGGGCG GACCGGCCTA CTACATGGAG AAGGGCCTGG GCCAGCGCTG GATGGGCGTG CTCTTCGCCA TCGTCCTCAT TCTGACCTTC CCGATGGTCT TCAACGCCGT GCAGAGCAAC ACCATCGCCG GGGCCGTCAC CAACTCCGCC CAGAACATGA ACGTGGACGC GGGCCTGGGG CTGTCGGCCG CGGTCGGGGT CGTGCTCGTG GCGGTCACCG CGATGGTGAT CTTCGGCGGT GTGCGCAGGA TCGCCAACAC CGCGCAGGCG CTGATCCCGG CCTTCGCGCT GATCTACCTG ATCATGGGCG TCATCATCGT GGGGATGAAC TTCGAGCGCG TCCCCGGGAT GTTCGCGCTC ATCTTCGAGC ACGCCTTCGG CATCCGCGAG TTCGCCGCCG CCGGTCTGGG CACGGTGATC ATCCAGGGCG TGCGCCGGGG CATGTTCTCC AACGAGGCGG GCCTGGGATC GGCTCCCAAC GCCGCGGCCA GCGCCGCCGT CACCCACCCC GCCAAGCAGG GCCTGGTGCA GACCCTCGGC GTCTACTTCG ACACCCTCGT GGTCTGCTCC ATCACGGCCT TCATCATCCT CATCGGCTAC GAGGGCGGCT CCGAGCGCGA GCTGGAGGGC GACCTCACCC AGATCGCGGT CACCGAGGCC CTGGGCCCCT GGGCGCTGCA CCTGATGACG CTGATCATCC TCCTGGTCGC GTTCACGTCG GTGCTGGGCA ACTACTACTA CGGCGAATCG AACCTGGAGT ACCTGACCGC CGACCGGCGC GTCATGCTCG GCTACAAGAT CGTCTTCCTC GTGGCCAGCT TCCTCGGCTC GCTGGGCTCG ATCGACCTGG TCTGGACGCT GGCGGACACG ACCATGGGCA TGATGGCGCT GATCAACCTC GTGGCCATCA CACCGCTGGC CGCGATCGCG GCACGGGTGC TCAAGGACTA CAACGACCAG CGGCGCCAGG GCATCGACCC CGTGTTCACG CGGGACCGGC TGCCCGACCT GCGCGGAGTG GAGTGCTGGG AGCCCAGGGA GAGGGAGGCT GCGGCCGAGA CCGACAAGGT CTGA
|
Protein sequence | MLPADQAPTL DARLTDAIAT FNNDFFWSWI LIPLLVVVSL YLTVRTGAVQ IRMLPEMFRV LKSKPETAPD GGKAVSSIQA FMISAAARIG TGNIVGVAVA ISLGGPGAVF WMWTMAIVVC AASFVESTLA QLYKVRDSTG YRGGPAYYME KGLGQRWMGV LFAIVLILTF PMVFNAVQSN TIAGAVTNSA QNMNVDAGLG LSAAVGVVLV AVTAMVIFGG VRRIANTAQA LIPAFALIYL IMGVIIVGMN FERVPGMFAL IFEHAFGIRE FAAAGLGTVI IQGVRRGMFS NEAGLGSAPN AAASAAVTHP AKQGLVQTLG VYFDTLVVCS ITAFIILIGY EGGSERELEG DLTQIAVTEA LGPWALHLMT LIILLVAFTS VLGNYYYGES NLEYLTADRR VMLGYKIVFL VASFLGSLGS IDLVWTLADT TMGMMALINL VAITPLAAIA ARVLKDYNDQ RRQGIDPVFT RDRLPDLRGV ECWEPREREA AAETDKV
|
| |