Gene Ndas_0244 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0244 
Symbol 
ID9244078 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp299364 
End bp300791 
Gene Length1428 bp 
Protein Length475 aa 
Translation table11 
GC content71% 
IMG OID 
Productamino acid permease-associated region 
Protein accessionYP_003678199 
Protein GI297559225 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.912876 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCCGACA ACAGCCAGAG TTTCGTCCGG GTGCTGGGAA GGGGTGACGT CCTCGCCCTC 
GGCTTCGGCG CGATGATCGG CTTCGGCTGG ATCGTGCTGG TGGGCGACTT CGTGAGCGCG
GCGGGCAGCG CGGGCGCGGC GCTGGCCTTC GTGATCGGCG GCGTCATCAT GGCCTTCGTC
GGCCTCGCCT ACGCCGAGCT CGTCGCCGCG ATGCCGCACG CGGGCGGGGA GCACCACTAC
GCCATGCGCG CCATCGGCCC CAAGGGGGCC TTCACCGCCT CCTGGGCCAT GATCCTGGGC
TACGTCTCCG TCGTCGCCTT CGAGGCCGTG GCCGTCCCCC AGACCCTCGT CTACCTCTTC
CCCGACATGG CGGTCGGCCG CCTGTGGACC GTCGCCGGGT ACGACGTCCA CGCCGGTCTG
GTCGCCGTCG GCGTGGCCGC GGCCGTGCTG ATGACCGCCG TCAACTACGT CGGCATCAAG
CCCGCGAGCG TCTTCCAGAC CATCGCGGTG CTGTTCCTGC TGCTCACCGG GGCCGCCATG
GTCACCGGCG CGTTCGTCGG CGGTTCCGTG GACAACATGC GGCCGCTGTT CACCGGCGGC
GCCCCGGGCA TGTTCGTGGT CCTGGTCGCG GTCCCGTTCC TCTTCGTCGG CTTCGACGTC
ATCCCGCAGT CGGCCTCCGA GATCAGACTC CCGTACCGCG TGGTCGGCAC CCTGCTGGTG
CTGTCGGTGT TCTGCGCGAC CGCCTGGTAC GTGATGGTCA TGCTGACGGC GGGCTCGGGT
CTGGGCCCGG CCGAACTGGC GGACTCGGAA CTGGCCTCCG CCGACGCCGT CGCCGCCATG
TGGGACAGCG CGACCATGGG CAACCTGCTC GTCCTCGGCG GCGTCGCGGG CCTGCTCACC
AGCTGGAACG CCTTCCTCAT CGGCGGCAGC CGCCTCATCT ACGCCATGGC CGCCTCGCGG
ATGCTCCCGG CCTGGTTCGG CCGCCTGCAC CCGCGCTTTC GCACCCCGTC CAACGCCGTG
CTGTTCGTCG GAGCGCTCTC CCTGGTGTCC CCGTTCTTCG GCGAGCCGAT GCTGGGCTGG
CTGGTCAACG CGGGCGGCCT GAACATCGTG GTCGGTTTCG TCACGGTGGT GGTCAGCTTC
CTCGTACTGC GACGCCGCGA ACCCGCCATG GAGCGGCCCT TCACCGTTCC GGCGGGCGTC
CCCGTCGGCG TGGTGGCCCT CGTGCTGTCC GTGGGGCTGC TCCTGCTCTA CCTGCCCGGC
ATGCCCGCGG CGCTGAGCTG GCCCAACGAG TGGCTGATGG TGCTCGTGTG GTGGGTGGCC
GGGGCCGCCC TGATGTGGCG TCTGCCCCGG ATCACCGCGG GCCCCGACGC CGAGCGGCGC
CTGGTCGAGG TGATGGACGC GCGCTCGCGC CCGGACCCGC GGGCCTGA
 
Protein sequence
MSDNSQSFVR VLGRGDVLAL GFGAMIGFGW IVLVGDFVSA AGSAGAALAF VIGGVIMAFV 
GLAYAELVAA MPHAGGEHHY AMRAIGPKGA FTASWAMILG YVSVVAFEAV AVPQTLVYLF
PDMAVGRLWT VAGYDVHAGL VAVGVAAAVL MTAVNYVGIK PASVFQTIAV LFLLLTGAAM
VTGAFVGGSV DNMRPLFTGG APGMFVVLVA VPFLFVGFDV IPQSASEIRL PYRVVGTLLV
LSVFCATAWY VMVMLTAGSG LGPAELADSE LASADAVAAM WDSATMGNLL VLGGVAGLLT
SWNAFLIGGS RLIYAMAASR MLPAWFGRLH PRFRTPSNAV LFVGALSLVS PFFGEPMLGW
LVNAGGLNIV VGFVTVVVSF LVLRRREPAM ERPFTVPAGV PVGVVALVLS VGLLLLYLPG
MPAALSWPNE WLMVLVWWVA GAALMWRLPR ITAGPDAERR LVEVMDARSR PDPRA