Gene Ndas_4354 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4354 
Symbol 
ID9248229 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp5188605 
End bp5189747 
Gene Length1143 bp 
Protein Length380 aa 
Translation table11 
GC content72% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003682249 
Protein GI297563275 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.501601 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGACGG TAGGACTGGC ATTCATAAGC GCGTTGGTGG GCTGGGGCGC CACCGCCGCG 
CTCGCCGCGT ACGCCCAGCG CCGTCCCAGT GTGGCCACGG TCGGATGGCT CGTCGCCGCA
CTGGGTACGA CGGTCGGCCT CAGCGCGGCC GTGATCGGCG CGATGTTCGA CTTCGGCGGG
ATCACCTTCC GGCTGCTGCA TATCGGCATC AGCCTCCTGG GCCCGCTCTA CATGGCCTGG
GGCGCGCTGG AGTACGGCGT CCGCTCGCCC CGCGGCCGGT TCACCTCGCG GCTGTTCCTC
AGCGCGTTCA CCATCGTCCC CCTGGTGGTG CTGTCCGTCG ACCGGGTGGG CACCCGTTTC
GACGCCTCCT ACCCCGCCAT GGGCGACCAC TACGACCTCA TCCCGCGCTC GCTCGTCAAC
CTCGCGCAGG TCCTCGTCGC GGTCTTCCTC GTCACGGCGC TCGTCGCGGT CGCCCGGCGC
GCCTCCGACC ACCGCAGCAC CGACCTGACG GTGCTGGGGC TGGTCGGCTT GGCCGCGCTG
CTCTCCGTCG TGGTCGGCCG CTTCGGACTC GGCTTCGGGG GGCCGCTGCT GATGCTCGGC
TCCGTCGCCG CGCTGTGGGG GGCCGTCGCC ATGGCCGCGC GCCCGCGCCG CGACCCCTAC
GACGATGACG ACTACTACGA CGACTACGAC GACGGGGGCG CGGACCCCGA CGACGAACTC
CCGGAGGAGC CCGTGCGCCG CAAGCGTCGC CAAGCCGACC CCTACGACGA CTACTACGAC
GGTCCGCCGG TCCGCCAGGC GCCGCCCAGC AAACTGCGCG GCGTCATCAC CATCTACACG
CTCGCCGACG GCCAGGGCCC GGGTTTCGAC CGCATCGCCG ACGCCCTCGT CGCCCAGGTC
TCCCACAGCG AGCCGGACAC CCTGCTGTTC GCCTGCCACA CGGTGCCCAG CGCGCCGCTC
CAGCGCATCG TCTACGCGAT GTACCGCGAC GAGCTGGCGC AGGAGGAGCA CGAGCAGCAG
CCGCACGTCC TGGAGTTCGC CCGCCTCAGC CCCCAGCACG TGGTCGCCAC CAACGTGATC
GAACTCTCCC TCGCGGGCGC GGCGGCCAGC GACGGCCTCG CCGCCATGCT GATGCCCCGC
TGA
 
Protein sequence
MVTVGLAFIS ALVGWGATAA LAAYAQRRPS VATVGWLVAA LGTTVGLSAA VIGAMFDFGG 
ITFRLLHIGI SLLGPLYMAW GALEYGVRSP RGRFTSRLFL SAFTIVPLVV LSVDRVGTRF
DASYPAMGDH YDLIPRSLVN LAQVLVAVFL VTALVAVARR ASDHRSTDLT VLGLVGLAAL
LSVVVGRFGL GFGGPLLMLG SVAALWGAVA MAARPRRDPY DDDDYYDDYD DGGADPDDEL
PEEPVRRKRR QADPYDDYYD GPPVRQAPPS KLRGVITIYT LADGQGPGFD RIADALVAQV
SHSEPDTLLF ACHTVPSAPL QRIVYAMYRD ELAQEEHEQQ PHVLEFARLS PQHVVATNVI
ELSLAGAAAS DGLAAMLMPR