Gene Ndas_1977 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1977 
Symbol 
ID9245827 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2400179 
End bp2401360 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content74% 
IMG OID 
Productmonooxygenase FAD-binding protein 
Protein accessionYP_003679910 
Protein GI297560936 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000420901 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000044053 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGAGAGCGA TCGTCTGCGG AGCCGGAATC GCCGGACTGG CCGCGGCCCA CCGCCTGCAC 
ACCCACGGCT GGCAGGTCAC CGTCCTCGAA CAGGCACCCG GCCCCCGCAC CCAGGGCTAC
ATGATCGACT TCTTCGGCCC CGGCTACCGG GCCGCCGAAC TCATGGGCCT ACTGCCCCGC
CTGCGCGAAC TCGGCTACGA CATCGACCAC GCCACCTTCG TCGACCGCGA CGGCGCCCCC
CGCGCCACGC TGGGCATCCA ACAGTTCTCC CACCTGGGCG TGGTCTCCCT CATGCGCCCC
GACCTCGAAC GCGCCCTGCG CGAAACCCTG CCCCCCGACG TGCCGGTGCG CTACGACGCG
GCCCTGCACC GCGTCCAGGA CCACGGCGAC ACCGTCGAGG TCACCCTCGC GGACGGCCGC
ACCCTGACCG GCGACCTGCT CGTGGGCGCC GACGGCATCC ACTCCACCGT GCGCGCCCAG
GTCTTCGGCT CCGCGACCGA CCACCTGCGC TACCTGGGCT TCCACACCGC AGCCTTCGTC
TTCACCGACC CCGACATCCA CGCCCGGATC GACGGCGGCT TCTACCTCAC CGACACCACC
GGCGCCCAGA TGGGCCTGTA CGGGCTGCGC GACGGCCGCG TAGCCGCCTT CACCGTGCAC
CGCAGCCCCA CCCCCGACCG GCCCGCCGAC CCGCGCGCCG AACTGCGCCG CACCCACGCC
GACCTGGGCT GGCTGGTGCC CCGCGCCCTC CAGCACTGCC CCGACCCCGA ACACGTCTAC
TACGACCAGG TCTCCCAGAG CGTGGTGCCC CGCTGGAGCC GGGGACGCAC CGTCCTGCTC
GGCGACGCCT GCGGCGCGGT CTCCCTGCTC GCGGGCCAGG GCGCCTCCCT GGCGGTGGGC
GGCGCGTTCG TCCTGGCCGA ACACCTGGCC TCGGCCCCCA CCGTCGAGGA GGGACTGAGC
CGCTACGAGC ACCAGTGGCG CCCCGAGGTC GAACAGCGCC AGCGCGCCGC CCGCCGCACC
GCCCGCTGGT TCCTGCCCCC CACCCGCGCC CAACTGCTCC TGCGCCGCCT GGCCCTGCGC
CTGACCCGCC TGCCCGGGGC GAGCTGCCTG CTGGCGGGGG CGCTCACCGG CAAGCCCACC
CCCGCCGTAC ACGAGCACAC TCCCTTGACC GTCCGCGACT GA
 
Protein sequence
MRAIVCGAGI AGLAAAHRLH THGWQVTVLE QAPGPRTQGY MIDFFGPGYR AAELMGLLPR 
LRELGYDIDH ATFVDRDGAP RATLGIQQFS HLGVVSLMRP DLERALRETL PPDVPVRYDA
ALHRVQDHGD TVEVTLADGR TLTGDLLVGA DGIHSTVRAQ VFGSATDHLR YLGFHTAAFV
FTDPDIHARI DGGFYLTDTT GAQMGLYGLR DGRVAAFTVH RSPTPDRPAD PRAELRRTHA
DLGWLVPRAL QHCPDPEHVY YDQVSQSVVP RWSRGRTVLL GDACGAVSLL AGQGASLAVG
GAFVLAEHLA SAPTVEEGLS RYEHQWRPEV EQRQRAARRT ARWFLPPTRA QLLLRRLALR
LTRLPGASCL LAGALTGKPT PAVHEHTPLT VRD