Gene Ndas_2892 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2892 
Symbol 
ID9246743 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3453646 
End bp3455103 
Gene Length1458 bp 
Protein Length485 aa 
Translation table11 
GC content77% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003680809 
Protein GI297561835 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.499146 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACGAGC TTCGCCTCGT GGCCGTGAGC GAGGACGGCA CCTACCTGGT GCTCGCCAGC 
GCCGGCCGTG GAACCCGTTT CACGCTGCCC GTCGACGACC GCCTCCGCGC CGCAGTACGC
GGCCAGTTCT CCCGGCTCGG CCAGTACGAG ATCGAAGTGG AGAATCCGTT GCGCCCCAAG
GAAATCCAGG CCCGTATCCG GTCCGGCGAG ACCGCGGAGG CCATCTCCGA GATCTCCGGT
ATCCCCATCG AGAGGGTCCG CTGGTTCGAG GGCCCCGTCC TCCAGGAACG CGAGTACATC
GCCCAGCAGG CCCAGCGCGC GAGCGTGCGC GCGCACGGCG ACGCCGCCCC GGGCCCCTCC
CTGGAGGAGT TGGTCACCAA GCGGATCGGC GCGCACCAGT TGGAGACCGG TGACGCGGCG
TGGGACTCGT GGAAGCGCGA GGACCGGTCC TGGCAGCTCA AGCTCGTGTT CCTGCACGGC
GGCGAGGAGC GCGTCGCCCA CTGGCTGTAC GAGCCCAGAC ACAACAGCGT GGCCCCCGCC
GACGAGGAGG CCGCCCGGTT CTCCTCACCC GACCCCGAAC CCGTCTCCTC CCCCGGCGCG
ACCGTGACGC CCTTCGCCCC GCGCCGCACC GAGACCGCCT CGGACCTGCC CCGGTCCGAG
TCCCAGCCGT CCTCCGAGCG CGAACGCCCC GCTCCCCTGC GCCCGTCCTC CGCCGCGCAC
CGGGACGAGC CGCCGCGCCC GGCGCCGCCC GCGCGTCCGG CCGCCGCCGA GCGCCACCCC
GCCGAACGCC GTCCCGTCGA GCGGGAGCGG GCTCCGGAGC GGCCCGCCGA GCGCGAGCGG
GCCACCGGAC GCCAGCCACT GGAGTGGGAC CGCTACCCGG AGCCGCAGGC GGAGCGGTAC
GAGCGGCCGC GCAGGGAACA GCCCGTCGAG CGCCCCGCCG CGCGGGCGGA ACCCGAGCGG
TGGCCCGCGT ACGAGCGCGG GGCCGAGCCC GAGCGGCGGA CGCGGAGCGA ACGCTGGTCC
GAGTCCGAGC GCTGGCCCGA GACCGAGCGC TGGCCCGAGA CCGCCGCCCG CGACGAGCCC
CCGGCACAGG AGCGCCACCG CCCCGAGCGG CGGACCGAAC CGGCCGCCTA CGACGAGCCG
GCGGTCGAGG AACAGCGCCG CCCCGAACGC CGGGCGGACT GGCCGGAGGC CCGTCCCGCC
AGGCCGGTCG AGCCCGAGGC CGCCCCGCCC GCCGCGGAGG GCCCCGCCCA GCAGCGTCCG
GCCCCGCGCC GCCGCCGCGC CACCATCGAC CTCAACGACA ACGGCGTCCC CCGCCGCGCC
GCCGAGCAGC CGCCGGCGGC CGCGGTCCCG GCCGCCGCCA ACAGCGGCCC TTCCGCGGCC
AAGCGCAAGG GACGCGGCCG ACGCGCCTCG GTCCCGTCGT GGGACGAGAT CATGTTCGGC
TCCAAGAAGG ACGAGTAG
 
Protein sequence
MHELRLVAVS EDGTYLVLAS AGRGTRFTLP VDDRLRAAVR GQFSRLGQYE IEVENPLRPK 
EIQARIRSGE TAEAISEISG IPIERVRWFE GPVLQEREYI AQQAQRASVR AHGDAAPGPS
LEELVTKRIG AHQLETGDAA WDSWKREDRS WQLKLVFLHG GEERVAHWLY EPRHNSVAPA
DEEAARFSSP DPEPVSSPGA TVTPFAPRRT ETASDLPRSE SQPSSERERP APLRPSSAAH
RDEPPRPAPP ARPAAAERHP AERRPVERER APERPAERER ATGRQPLEWD RYPEPQAERY
ERPRREQPVE RPAARAEPER WPAYERGAEP ERRTRSERWS ESERWPETER WPETAARDEP
PAQERHRPER RTEPAAYDEP AVEEQRRPER RADWPEARPA RPVEPEAAPP AAEGPAQQRP
APRRRRATID LNDNGVPRRA AEQPPAAAVP AAANSGPSAA KRKGRGRRAS VPSWDEIMFG
SKKDE