Gene Ndas_0543 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0543 
Symbol 
ID9244384 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp668808 
End bp671594 
Gene Length2787 bp 
Protein Length928 aa 
Translation table11 
GC content77% 
IMG OID 
ProductFibronectin type III domain protein 
Protein accessionYP_003678496 
Protein GI297559522 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCTGGG ACGCACAGGA ACGCACACGC TACGAGGACG AGGTGCTGGT GGCCGCGCGC 
GCCCACGGGC TGCCCGCGGA CCTGTTCACG CGCTACGGTG TCGAACCCCG CCTGGAGCTG
CGGCTGTGCG CCGATCCGGC CGCGTTCACC GAGCACGTCA ACGAGGTGTG CGCCCACTGG
CGCGCGCTCC GGAGCAGCCG CCGCTCCCTG GGGGAGGTGC TGGACGAGCT CATCGCCGAG
CACGAGCGGC TGGAGGGCGA GGGCGCGCTC ACGCACGCCC ACTTCCGGCA GGTCCGCGCC
GACGAGGCCC GACGGGTCCT GAGCGCGTGG TCCGAGATCG CCGGGGGTCT GACCACCTCC
CTCCTGGACC GCGACACCCT CCGGTCGATG ATCGCGCCGG TGGGCGTCAC CGAGGCCGAC
GCCGAGCGCG TCCTGCTCGA ACACGACGTG CGCGTCGTGG ACCGGCTTCC CGAACTGGAC
GACGCCCCGT CCGAGGACAC CCTGCGCGCC CTGCGCGGAC ACCTGCGCGC CCGGGGCGTG
CCCTTCTCCC CCATGGAGGT CTTCGGGGAG CAGCGCCTGG CCGAGGGGTT CACCGTGCTG
GACGGCTTCC GGCTCCACGA CGGCACCGCC CTGGACAAGG CCGCCCTCGA CGAGGCGGTC
CAGCGCGTAC GGCTGGAACC GGAGTCCGAG GGCAGGGCGG CGGCCGAGAA CGTGCTGGAG
ATCCTGCGCG CCGAGGGCGA CCGGGCCGAC GCCCTCGTGC TGGGGGAGAT CGTCGGCGAC
CTGCGGGAGC GCCCGGAGTC CCTGAACGAG TCCGGGCTGG CCCGGCACTG GGTCAGCCGG
GGGCTGGCCG AGGAGGAGGC GCTGCTGCTG TCCGCGGCCG TGCGGCGGTC CGCCTCCGGC
GCCGACGCGC CCGCACGGGC GGAGCGCGAC GTGCACGACC TGCTCGCGGA CAACCGGCTG
CGCCAGGCGC AGGAGGCCGC CGAGGACCTG CCCGAGGACC ACGACCTGCA CGAGCGCCTG
CGCGCGCGGA TGCAGCAGGT GGAGGAGCTG ACCGCGCAGG CGGAGCGGGC GCTGCGCGCG
GGCGACCGGG AGGAGGCCGC CCGGGCCCTG GCCGCGGCGG CGGACGCCGC GGCCGACGAC
GAGGCGCTCG CCGCCCGGCT CGGCGAGGTG GCGACGCCGC CGCCGCGCGG TGTGGAGGCC
CGGGTCGACG GCCGGAACGT GGTGGTCTCC TGGCAGCCCG GCCCCGCTCC GGACGAGTCG
GTCACCTACC GGGTGACGCG GCGCACCGGA CGGGACGGGA GCGCCCGCGA GGCGCTGGTG
GGCGAGTCGT CGGGAAACGA GGTCACCGAC ACCGGAGCAC CCGTGGGAGC TGAGGCGCGC
TACGCGGTGG TGGCGGTGCG CGACGGCCTC GGGGTGTCGG AGGAGGCGTC GTCCGCGCCG
GTGATGATCG CGCCGGAGGT CTCCTCGCTG CGGGTGCGCG CCGGGGAGCG GTCGGTGTCC
GGCTCCTGGC AGGCCCCGGC GGAGGCGGTG CGCGTGGAGG TGCTGCGCGG TGTGGGGGCG
CCGCCGCGCG GCGCGGGCGA CGGCGTGCGC GTGGAGACGG ACGGGTCGGG GTTCTGCGAC
ACCGACGTCG AGATGGACGT GGAGTACCAC TACCGGATCC GCGCGGTCTA CGTGACCTCG
AACGGGCACG CGCGGGGGTC GGCGGGGCTC GTGCGCCGGG CCAGCCCGGG TCCGTGCCCG
GAAGCGGTGC GGGACCTGTC CGTGGTGCCG GACGGCGGCG CGGACTTCCG GGCCTCGTGG
ACACGGCCCT CCCGGGGGAG GGTCGTGCTG CGGGTCGGGG AGGAGCCGCC CGAGTGGCCT
CTGGGCACGG TGCTCGGCCC CGACGACCTG GAGTCCTACG GCCGTGAGGT GGCCCAGACC
CCGGTGGCCG ACGACGAGGG GTCCTCTGAG GGCAGGGAAG CCGACAGGAG CAGCGGAACG
GGTGTTCGCG CGGGCGCGGG CCGTGCCTCG GGGCCGCCCC CGGGGTCGGC GAACGGCCAC
GCGGACCCCG GGCGCGGGGG GACGCCCGGC GAGGAGAGGG AGGGCGTGCG CGTACTCGGG
CCCGGGGAGA GAAGCACGGG CCACACCCGC ACCGCCCCCG TCCAGGACGC CCGCGCCCCG
CACGCGCCGC GCGCCGGCGC CGCGGCGCCC GAGGGGGGCA GGGCCCGTCC CCGTGAGGGC
GCGTTCCGCG ACACGGGCCA CCCGCGCACC ATGCCATCCG AGGAGACAGG GGCCAGCCCC
CGTGAGGGCG CGTCCCACGT AACGGACCAG CCGAACACAG TGCCCTCGGA GGAGGGCGGG
ATCCGCTCCT CTGAGGGGGC GTTCCACGCC GCGGGCCACC CGAGCACAGC GCCCTCCGAG
GAGGACGGGC ACGGGGGGAC CGAGGGCATC CGCGTCCTGG GTCCGGGCGA GGGGAGCCCC
GCCCGGAGCC GGGACGCCGA CCACGGCACG GGTGCCCGAC GCGGAGGCTT CGCGGACGAG
GGGGGAGAGG GGGTCCGGAT CCTGGCCCCC GTTGAGCGGC CCCCGGGGCG GACCCGGACC
GTTCCCGGCC AGGACGCCCG CACACCGCAC GCGCCCTCCA GCGGTTCCGC GGCGCCCGGA
GGTACCGCGG GGGACCGTGC TCCCGCCCCC GCCGAGGGGC CCGGTGAGCG CATCCGCAGT
GTCGGAAGGC ACGCCAGGCT CTCCCCCGCC GCGGAGTCCC CGGCACCGGC GGCCACCGGG
GTGGGGGACG GTGCCCTCGG CACTTAG
 
Protein sequence
MPWDAQERTR YEDEVLVAAR AHGLPADLFT RYGVEPRLEL RLCADPAAFT EHVNEVCAHW 
RALRSSRRSL GEVLDELIAE HERLEGEGAL THAHFRQVRA DEARRVLSAW SEIAGGLTTS
LLDRDTLRSM IAPVGVTEAD AERVLLEHDV RVVDRLPELD DAPSEDTLRA LRGHLRARGV
PFSPMEVFGE QRLAEGFTVL DGFRLHDGTA LDKAALDEAV QRVRLEPESE GRAAAENVLE
ILRAEGDRAD ALVLGEIVGD LRERPESLNE SGLARHWVSR GLAEEEALLL SAAVRRSASG
ADAPARAERD VHDLLADNRL RQAQEAAEDL PEDHDLHERL RARMQQVEEL TAQAERALRA
GDREEAARAL AAAADAAADD EALAARLGEV ATPPPRGVEA RVDGRNVVVS WQPGPAPDES
VTYRVTRRTG RDGSAREALV GESSGNEVTD TGAPVGAEAR YAVVAVRDGL GVSEEASSAP
VMIAPEVSSL RVRAGERSVS GSWQAPAEAV RVEVLRGVGA PPRGAGDGVR VETDGSGFCD
TDVEMDVEYH YRIRAVYVTS NGHARGSAGL VRRASPGPCP EAVRDLSVVP DGGADFRASW
TRPSRGRVVL RVGEEPPEWP LGTVLGPDDL ESYGREVAQT PVADDEGSSE GREADRSSGT
GVRAGAGRAS GPPPGSANGH ADPGRGGTPG EEREGVRVLG PGERSTGHTR TAPVQDARAP
HAPRAGAAAP EGGRARPREG AFRDTGHPRT MPSEETGASP REGASHVTDQ PNTVPSEEGG
IRSSEGAFHA AGHPSTAPSE EDGHGGTEGI RVLGPGEGSP ARSRDADHGT GARRGGFADE
GGEGVRILAP VERPPGRTRT VPGQDARTPH APSSGSAAPG GTAGDRAPAP AEGPGERIRS
VGRHARLSPA AESPAPAATG VGDGALGT