Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_1977 |
Symbol | |
ID | 9245827 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 2400179 |
End bp | 2401360 |
Gene Length | 1182 bp |
Protein Length | 393 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | |
Product | monooxygenase FAD-binding protein |
Protein accession | YP_003679910 |
Protein GI | 297560936 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0000420901 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 0 |
Fosmid unclonability p-value | 0.000000044053 |
Fosmid Hitchhiker | No |
Fosmid clonability | unclonable |
| |
Sequence |
Gene sequence | ATGAGAGCGA TCGTCTGCGG AGCCGGAATC GCCGGACTGG CCGCGGCCCA CCGCCTGCAC ACCCACGGCT GGCAGGTCAC CGTCCTCGAA CAGGCACCCG GCCCCCGCAC CCAGGGCTAC ATGATCGACT TCTTCGGCCC CGGCTACCGG GCCGCCGAAC TCATGGGCCT ACTGCCCCGC CTGCGCGAAC TCGGCTACGA CATCGACCAC GCCACCTTCG TCGACCGCGA CGGCGCCCCC CGCGCCACGC TGGGCATCCA ACAGTTCTCC CACCTGGGCG TGGTCTCCCT CATGCGCCCC GACCTCGAAC GCGCCCTGCG CGAAACCCTG CCCCCCGACG TGCCGGTGCG CTACGACGCG GCCCTGCACC GCGTCCAGGA CCACGGCGAC ACCGTCGAGG TCACCCTCGC GGACGGCCGC ACCCTGACCG GCGACCTGCT CGTGGGCGCC GACGGCATCC ACTCCACCGT GCGCGCCCAG GTCTTCGGCT CCGCGACCGA CCACCTGCGC TACCTGGGCT TCCACACCGC AGCCTTCGTC TTCACCGACC CCGACATCCA CGCCCGGATC GACGGCGGCT TCTACCTCAC CGACACCACC GGCGCCCAGA TGGGCCTGTA CGGGCTGCGC GACGGCCGCG TAGCCGCCTT CACCGTGCAC CGCAGCCCCA CCCCCGACCG GCCCGCCGAC CCGCGCGCCG AACTGCGCCG CACCCACGCC GACCTGGGCT GGCTGGTGCC CCGCGCCCTC CAGCACTGCC CCGACCCCGA ACACGTCTAC TACGACCAGG TCTCCCAGAG CGTGGTGCCC CGCTGGAGCC GGGGACGCAC CGTCCTGCTC GGCGACGCCT GCGGCGCGGT CTCCCTGCTC GCGGGCCAGG GCGCCTCCCT GGCGGTGGGC GGCGCGTTCG TCCTGGCCGA ACACCTGGCC TCGGCCCCCA CCGTCGAGGA GGGACTGAGC CGCTACGAGC ACCAGTGGCG CCCCGAGGTC GAACAGCGCC AGCGCGCCGC CCGCCGCACC GCCCGCTGGT TCCTGCCCCC CACCCGCGCC CAACTGCTCC TGCGCCGCCT GGCCCTGCGC CTGACCCGCC TGCCCGGGGC GAGCTGCCTG CTGGCGGGGG CGCTCACCGG CAAGCCCACC CCCGCCGTAC ACGAGCACAC TCCCTTGACC GTCCGCGACT GA
|
Protein sequence | MRAIVCGAGI AGLAAAHRLH THGWQVTVLE QAPGPRTQGY MIDFFGPGYR AAELMGLLPR LRELGYDIDH ATFVDRDGAP RATLGIQQFS HLGVVSLMRP DLERALRETL PPDVPVRYDA ALHRVQDHGD TVEVTLADGR TLTGDLLVGA DGIHSTVRAQ VFGSATDHLR YLGFHTAAFV FTDPDIHARI DGGFYLTDTT GAQMGLYGLR DGRVAAFTVH RSPTPDRPAD PRAELRRTHA DLGWLVPRAL QHCPDPEHVY YDQVSQSVVP RWSRGRTVLL GDACGAVSLL AGQGASLAVG GAFVLAEHLA SAPTVEEGLS RYEHQWRPEV EQRQRAARRT ARWFLPPTRA QLLLRRLALR LTRLPGASCL LAGALTGKPT PAVHEHTPLT VRD
|
| |