Gene Ndas_5537 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_5537 
Symbol 
ID9249440 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp731025 
End bp733259 
Gene Length2235 bp 
Protein Length744 aa 
Translation table11 
GC content76% 
IMG OID 
ProductFibronectin type III domain protein 
Protein accessionYP_003683422 
Protein GI297564449 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.320231 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGGACC AGCCCACCGC ACCCCCGACG GATCCCGCGC CGAAGCCCCG CCCCTCCCCC 
GTGCGGCGCG CGGCCTCGGC GCTCTGGCGC CGCACCCGCT CCAGCGCCCC GGGCCTGATG
ATCTCGCTCC TGGCCGCGGG CCTGCTCAGC ACCGCGCTCG GCGCCGGGGC CATGGGCCGT
GCGGACGAGA TGTCCGACGG CGCCGTGTGG CTCTGGGACA GCCCCGCGGG CGAGAGCTTC
CGCGTCAACG GCGACAACGC GCGGATCGAC CTCGTCGCCG CGCTGCCCGG CTCCGCCGGA
CGCCCCGTCC AGGTCACCCA GAACGACGAC TACCTCCTGC TCCACGACCC CGAGACCGGC
CGGGTGACTT CGGTGGACCT GCGGGAGATG GGGTTCTCCG GTGTGCTCGA ACTCGGCACC
GGCGGCGACT TCGGCCTCGC GCTGGGCGAG GAGGCCGCGG TGGTCATCGA CCGCGCCAGC
GGTGAGGTCA AGGCGGTGGA CCCCGCGACC CTCCAGCCGA CCGGACCGTC CCTGAGGATC
CCGGCCCCGC TGGTCGGCGG GGCCTTCGAC GACTCCGACA CCCTCTGGCT GGGGGTGCCC
ACCCAGGGCA CGGTCGTCGG AATCCGGGTC GAGGCCGAGG AGGCCGTCAT CACGCAGACC
GCGTCGGTGG CCGACCCCGG CGCCGACATC GCCGTCACGG TCCTCGACGA CGGCGTGCTC
GCGGTCGACC GGAACGGCGA CCGCATGGTC GCCGTCCGCA ACGGTGGCGA GTCCCGGACC
ATCACCTCTC CCGTCCCGCT GGAGGGCGCC GAGGTACCGC CGCGCACCCG GGGCGACCTG
GCCGCCGTGA CGCTGCCCGG CTCCGGCGAC GTCGTCACCG TGTCCGACCC CACCGGGTCG
GCGGGCGTGG ACCACTTCTC CACCGGCCGT GAGGGCGGCG GCACCGCCGT CCCCTACGAG
GGCCGCTTCT ACGTGCCCTT CCCCGAGGAG GGCGCGGTGC GCGTCTTCGG CCCCTCCGGG
GACGAGCTCA ACCCCATCAC CCTGCCCGGT GCCGAGGGGC CGCTGGAGCT GGAGGCGCGC
GAGGGCAGCC TGTACATCAA CTCACCCGAC ACCGGGGTGG CGGCGGTCGT CGACCCCGGG
GGCCGGGCCA CCGTCATCGA CAAGACCGCC CCGCCGCCCG GCCCCGGCGA GACCGACGAG
GACGAGCCCG CGCCCCGGGA ACCGGCCCCG GACGGGACCA CCGCGCCCGA GCCCGGTGAC
GCCCCCGTGC CCGACGCGGG TGCGCCCGAC GCCGGTGACA CCACCGCCCC GGAGTCCGGC
GGCACCGAGG GCGGCGCCGC CCCCAGGGCC CCGGCCGTGG AGAGCCCCCG GGACGACGGC
GAGGAGGAGG ACGAGGGCAC GGCGCCGGGC GCTCCGACAC CCGTCTCCTT CACCGCCGGG
GACGGATCGG TCACGCTGTC CTGGCCGGAG GCCTACTCCC CGGACTCCCC CGTGGAGACC
TACGACATCA CGTGGCAGGG CGGCAGCACG ACCGTCGACG GCTCGGAGCT GGAGGCCACG
ATCACCGGAC TGGAGAACGG CACCTCCTAC CGCTTCCGGG TGCGGGCATC CAACGCCTTC
GGCACCGGTC CCGCGGCGCA GACCGAGGAG GTCACTCCCA GCCCCCGGGC CCCCGGCGCG
CCGAGCGGCG TCGCCGTCGC CGCGGCGGGA AGCGACAGCG TGACCGTGTC CTGGGAGGCC
GCCGAGGGGG CCGCGGACTA CCTCGTCTCC GCCTCCTCCG ACTCCGACCC GGTGAGCGAC
CGCACGTCCA CGGGCACGTC GGTGGAGGTC GCCGGGCTGG CGCCGGGCGG CACCTACACC
TTCACCGTGA CGGCGCGCGG CGCGGGGGGC GTCAGCGGCG AGTCCGCCAC GAGCGCCCCG
TTCACCATGC CCACCCAGGA GATCGGAGCG CCCGCCGGCG TCTCCTTCAG CGCTTCCGGA
GACACGGTGA CGGTCACCTG GTCACAGGTG GAGGGGGCGA CCCAGTACAC CATCACGCCG
CACGGCGACG GCTCCAGGTC GCTGAGCGAG GTGAGCACGC CCGGCACCGC GGCCGGGGGC
GGACAGCTCT CCTACACCTA CCAGCCGCGC GGCTCGGGGC GCTGCTACTC CTTCACCGTG
CTGGCGGCGT CCGAGAGCGC CACCGCCGAC AGCGGCACGA CCAGTACCGC GAGCTGCTCA
CGGGAGTTCA GATGA
 
Protein sequence
MVDQPTAPPT DPAPKPRPSP VRRAASALWR RTRSSAPGLM ISLLAAGLLS TALGAGAMGR 
ADEMSDGAVW LWDSPAGESF RVNGDNARID LVAALPGSAG RPVQVTQNDD YLLLHDPETG
RVTSVDLREM GFSGVLELGT GGDFGLALGE EAAVVIDRAS GEVKAVDPAT LQPTGPSLRI
PAPLVGGAFD DSDTLWLGVP TQGTVVGIRV EAEEAVITQT ASVADPGADI AVTVLDDGVL
AVDRNGDRMV AVRNGGESRT ITSPVPLEGA EVPPRTRGDL AAVTLPGSGD VVTVSDPTGS
AGVDHFSTGR EGGGTAVPYE GRFYVPFPEE GAVRVFGPSG DELNPITLPG AEGPLELEAR
EGSLYINSPD TGVAAVVDPG GRATVIDKTA PPPGPGETDE DEPAPREPAP DGTTAPEPGD
APVPDAGAPD AGDTTAPESG GTEGGAAPRA PAVESPRDDG EEEDEGTAPG APTPVSFTAG
DGSVTLSWPE AYSPDSPVET YDITWQGGST TVDGSELEAT ITGLENGTSY RFRVRASNAF
GTGPAAQTEE VTPSPRAPGA PSGVAVAAAG SDSVTVSWEA AEGAADYLVS ASSDSDPVSD
RTSTGTSVEV AGLAPGGTYT FTVTARGAGG VSGESATSAP FTMPTQEIGA PAGVSFSASG
DTVTVTWSQV EGATQYTITP HGDGSRSLSE VSTPGTAAGG GQLSYTYQPR GSGRCYSFTV
LAASESATAD SGTTSTASCS REFR