Gene Ndas_4448 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4448 
Symbol 
ID9248324 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp5286630 
End bp5288681 
Gene Length2052 bp 
Protein Length683 aa 
Translation table11 
GC content71% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003682343 
Protein GI297563369 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTCGCA ACGAGCTGGT GTACCGGGGC CGGGACGAGG ACACCGACAA GAAGCGCAAG 
GGCGCGACGT GGGGCGACTA CCTGCGGCCG TGGGCCGCCG CCCCGGCGCT CATCCCGGCC
GGGTTCCTCA CACACTGGAT GTGGGGCGAC CTCGGATGGG GCACCGGCCT GGCCGCCGCC
GCCATCGCGG CCGCGGGCGG GGTGGTCACC TACGCCGCGC ACCGCCTGAC CGGAGCGCGC
ACCTGGTACG CCCACCACAT CTCCACCGCC ATGACCGGCG GGGCCGGGGC GTGGCTGGCA
CTGGCGACCG CGTTCGGCCC CGGACGTCCG CTCATGGACG TGCTGCTCAT CGGCGGAGCG
GCCGGGGCCG CGATCGCGAA CGTGCACCTG TGGGCCCGCT CGCAGGGGGC CGGGGAGACC
GCGCCAAAGA AGGGCGCCGG CCGGATCCTG GCGACCTTCG AGGAGGTCGC GGCCAAGCTC
GACTTGCGCA ACATCAGGGC GCGCCTGCTC AGCGACACCG AGATGCAGCA GCGCCACGAG
CTGACCCTGG AGAACGGCGA GACCGCCGAG TCCTTGCAGA ACCGCGCGAA GGAGCTGGCG
TCCGCGTACG GGGTCGCCCC CGGCGCGATC CGGGTGATCG AGGACCAGGG CCGCGCCGAC
AAGGCGGAGC TGGTCATCAC CAAGCAGGAC GTCATGGGCA AGCTCATCCC GTGGCCGGGG
CTGAACCCGG CGCACGTGGG CACCTCGATC GCGGACCACC CGCTGCACCT GGGGACCTAC
GAGGACGGCG AGCCCTTCCA CAACCAGATC ACCAACCGGC ACTCGCTCAC GGTCGGCATG
GCCGGGGCCG GGAAGTCGGT CTACGGCAAG GTCAAGATGG TGCAGGTCGC CGCGAGGAAG
GACACCTTCA CGCTCGCCAT CGACCTGGCC AAAGGCCGCC AGACGCTCGG ACCGATCGAG
GGCGCGATCG GCTGGCCCGC CTACGACAAG AAGGCCGCCC GCGGCCAACT GGCCGCGATC
AAGCGCGCGA TCAAGGCCCG CGCGAACCAC CTGGCCGACC AGGGGAACTC GCAGTGGGTG
CCCGGGTGCG GGCTGACCTT CCTGCACATC CTGGTAGAGG AAGCCGCTGA GGTGGTCGAC
TTCGACGAGA TCGTGGAGGT GGCCCGGGTC TCCCGGTCGG TCGGCATGCA CCTGGACCTG
TCGTTGCAGC GGGCCACGTG GGGCAACCTC GACACGGACA CGCGCGCGAA CCTGGGTGAT
GGGCTGTGCT TCGGCGTCCG CGACTTCGCC GACGCCAGCT TCGTCCTGCC GGACTACGTC
ACCGACGCCG GGTGCGACCC CTCGCGGTGG CGCAAGTCCA AGCCGGGCGC CGCGTACGCC
GCGATCGAGC ACGTGGACCC CGACCGGCAC GTCGTGGCCG CCAAGATGTT CGGTCCGCCG
ACCACCGACC CCAAAGACGA GAACAAGGTC CTCACAGCCG AAGCCAACTC GCTCCCATCC
CAGGACGAGA AGCTGGACGC CATCACGCGG GCCGCGTTCG GTGCCGAGTA CGCCGAGTAC
CTCGCGACCC GCCCTGGTGG GTCCTCCACT CCGGCCGCGC CCGCGGTTCA GGTGCCTTCC
CCCGCCGTGA CGGTGGCCAC GACCACCGAC ATGGACGACA CCGTGACCGA GGAGGAGCTG
ATCGTGGACC CCGACATCGA GCCGGTCGTG CTCACCACCC CGGACGACGA CCCCGAGATC
CAGGGCGACA TCGACGCGGA GATCCCGCCC CTGGACCCTG AGGACGATTT CGTTCTGCCG
ACCCGCAAGA AGGGCACCAA GGCCAGCGCG GAGGAAGCGC GCGCTGCCCT GGAGGGCGTC
CTGGAGGGGT GGGGGCCGGG CCACCAGTTC ACGGTCTGGG ACGCCACGGT GGCCATGCAG
GAGCGGGGCG CGGAGCGCAA GAAGAGCTGG TTCTACAACA AGCTCACGGC CCTGGCGGAA
GACGGACGCC TGCGCCACGA GGATGACGGC TCCTGGACGA TCCTGGAGTC CCGCGAGTTG
ATCGACGCCT GA
 
Protein sequence
MARNELVYRG RDEDTDKKRK GATWGDYLRP WAAAPALIPA GFLTHWMWGD LGWGTGLAAA 
AIAAAGGVVT YAAHRLTGAR TWYAHHISTA MTGGAGAWLA LATAFGPGRP LMDVLLIGGA
AGAAIANVHL WARSQGAGET APKKGAGRIL ATFEEVAAKL DLRNIRARLL SDTEMQQRHE
LTLENGETAE SLQNRAKELA SAYGVAPGAI RVIEDQGRAD KAELVITKQD VMGKLIPWPG
LNPAHVGTSI ADHPLHLGTY EDGEPFHNQI TNRHSLTVGM AGAGKSVYGK VKMVQVAARK
DTFTLAIDLA KGRQTLGPIE GAIGWPAYDK KAARGQLAAI KRAIKARANH LADQGNSQWV
PGCGLTFLHI LVEEAAEVVD FDEIVEVARV SRSVGMHLDL SLQRATWGNL DTDTRANLGD
GLCFGVRDFA DASFVLPDYV TDAGCDPSRW RKSKPGAAYA AIEHVDPDRH VVAAKMFGPP
TTDPKDENKV LTAEANSLPS QDEKLDAITR AAFGAEYAEY LATRPGGSST PAAPAVQVPS
PAVTVATTTD MDDTVTEEEL IVDPDIEPVV LTTPDDDPEI QGDIDAEIPP LDPEDDFVLP
TRKKGTKASA EEARAALEGV LEGWGPGHQF TVWDATVAMQ ERGAERKKSW FYNKLTALAE
DGRLRHEDDG SWTILESREL IDA