Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_2654 |
Symbol | |
ID | 9246505 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 3161870 |
End bp | 3163018 |
Gene Length | 1149 bp |
Protein Length | 382 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | transport system permease protein |
Protein accession | YP_003680577 |
Protein GI | 297561603 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00408489 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.22547 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCAGAC TCACCGACCC TTCGCGGACC GGTCCGCCCG ACGCGGACGC TCTCCCCGAC GCCGACCCGC TGGCGGCGCG GGCGGCGGCG GGCGGCGCCG AGGTCCCTCC CGAACGCACA CCGGGCAGGC TGCGCTCGTC CCTCGTCGTC GCCGCCTTCG TCCTCGCGCT GTGCGTGACG ATCGTCGTCG CGGCCTTCGT CGGCACGGCG AACATCGGGG CGCTCGACGT GCTCGGGATC ATCCTGCGCA ACATCGGGCT GGGCGCCCTC GCCCCCGTCC CGGCGGCGCC GCCGCTCATC GACGCCCTCA TCTGGGAGTC GCGTCTGCCG CGCGTGCTCC TGGCCGCCGT GGTCGGTCTC GGGCTCTCGG TCTCGGGCGC GGTGCTGCAG TCGATCACCC GCAACCCGCT CGCCGAACCG TACCTGCTCG GGGTCTCCTC GGGGGCGTCC ACCGGGGCGG TCGCGATCAT GGTGCTCGGG CTCGGCTCGG GCGCGGTGAC CCTCTCCACG GGCGCCTTCG CCGGGGCGCT GGCCGCCTTC GCGATCGTGC TGGTCCTCAT CGGGGGCGGA CGCGTCTCCA ACCCCGCCCG CGTGGTGCTC ACCGGTGTGC TGGTGTCGCA GTTCTTCTCC GCGATCACCT CGCTCGTGCT GATGCTCGAC GGTGACGCGG ACGCCACCCG GGGCTTCACG TACTGGCTGC TCGGCTCGCT CGGAGGCGCA CGCTGGGAGC CGCTGCTCGT GGCGTCTGCC GTCATCGTGC TCGGCGCCGT CGGCTGCCTG TTCTTCGCCC CGGCCCTGGA CGCGTTCACC TTCGGCTGGG ACACCGCCTC CTCGCTCGGG ATCAACGTGA CCCTGGCACG GGTGACGCTC ATGGTCCTCA CCGCGCTCGT CACGGCGGCG GCCGTCGCGG CCTCCGGGGC GATCGGGTTC ATCGGGCTGC TCGTACCGCA CGTCGTGCGT CTGCTGGCGG GGCCCGCGCA CCGCCTGCTG CTCCCCCTCA GCGGGCTCGG GGGCGCGATT TTCCTGGTGT GGGTCGACAC CTTCGCCCGC TCGGCGTTCT CGCCGCACGA GATCCCGGTG GGAGTGATCA CGGCGCTGCT CGGCGCACCG GTGTTCGCGG TCGTCCTGGG AAGGGCGGCC CGGCAATGA
|
Protein sequence | MTRLTDPSRT GPPDADALPD ADPLAARAAA GGAEVPPERT PGRLRSSLVV AAFVLALCVT IVVAAFVGTA NIGALDVLGI ILRNIGLGAL APVPAAPPLI DALIWESRLP RVLLAAVVGL GLSVSGAVLQ SITRNPLAEP YLLGVSSGAS TGAVAIMVLG LGSGAVTLST GAFAGALAAF AIVLVLIGGG RVSNPARVVL TGVLVSQFFS AITSLVLMLD GDADATRGFT YWLLGSLGGA RWEPLLVASA VIVLGAVGCL FFAPALDAFT FGWDTASSLG INVTLARVTL MVLTALVTAA AVAASGAIGF IGLLVPHVVR LLAGPAHRLL LPLSGLGGAI FLVWVDTFAR SAFSPHEIPV GVITALLGAP VFAVVLGRAA RQ
|
| |