Gene Ndas_4511 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4511 
Symbol 
ID9248391 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp5348801 
End bp5350351 
Gene Length1551 bp 
Protein Length516 aa 
Translation table11 
GC content69% 
IMG OID 
Productanion transporter 
Protein accessionYP_003682405 
Protein GI297563431 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGTCCG AGAAGGGTGT CGACGCGCCC GACACCGCAT CCCCCACGCA GGACGGCAGA 
GTCGCCAGGG GCACCCCGCC CTCGCGCGTG ATCGGCCTGA TCGCGGGCCC GCTGGTCGGG
CTCCTGGTCT ACTTCCTCAT GCCGGACATG CCGATGCCCG TCGTCGACGG CGAGGACGGC
GAGACGGTCC TGTCGGCCAA CGGCCGCGCG GTGGCCGCGG TGACCGCCTT CATCGCCATC
TGGTGGGCCA CCGAGGCGAT CCCGATCCCG GTCGCCTCCC TGCTGCCGCT GGTGCTCTTC
CCCCTCTTGC TGGAGGGCAC TCCCGTCGGG GACGTCGCCT CCTCCTACGG GTCGGACACC
ATCTTCCTGT TCATGGGCGG CTTCATGCTC GCCCTGGCGA TGCAGAAGTG GAACCTGCAC
AAGCGGATCG CCCTGGTCAT CGTCTCCAAG GTGGGCTCCA ACACCGCGGG GCTGGTCGGC
GGCTTCATGA TCGCCACCGG GTTCATCACG ATGTGGGTGT CCAACACCGC CACCGCCGTG
ATGATGCTCC CCGTGGGCCT GTCGGTGATC ACGCTGATCA CCCAGTTCCG CGACGGCAGG
ACCGACGCGA ACTTCGCCAC CGCGCTCATG CTCGGCATCG CCTACTCGTC CTCCATCGGC
TCGGTGGCCA CCATCATCGG CACGCCGCCC AACGTGCTGA TGGTCGGCTA CCTCGCCGAC
GCCCACGACA TCCACATCGG CTTCGGCGAG TGGATGCTGG TCGGCCTGCC GCTGGCCGCC
GTCTTCCTGC TCATCGCCTG GTTCGTGCTG ATCAAGATCT TCCCGCCGCA GGTCAAGAAG
GTGGAGGGCG CCCAGGCCCT CATCCGCTCC GAGCTCGCCG AGATGGGCCC GATGGCGCGC
GGCGAGAAGC TCGTGCTCGC GGTCTTCGCC TTCGCGGCGC TCTCCTGGAT CTTCGTCCCG
GTCCTGGCCG ACAACTTCCT CCCGTGGCTG TCCAGCGTCA GCGACGCCGG GATCGCGATG
ACCGTGGCGG TCCTGCTGTT CCTGATCCCG GTCGAGCGCG GGACCCGCCT GCTGGACTGG
GAGACCGCCG TGCAGCTGCC GTGGGGCGTC CTGCTCCTGT TCGGCGGCGG TCTGGCCATC
TCGGGGCAGT TCACCGCGTC CGGCCTGAGC ACCTGGATCG GCGGCCAGGT CGCCGTCCTG
GAGGGCGTGC CCACCTGGGT GCTGATCCTG GTCGCGGCCG GGCTGGTCCT CTTCCTGACC
GAACTCACCA GCAACACCGC CACCGCGGCC ACCTTCCTGC CCATCCTCGG CGGCGTCGCG
GTGGGCATGC AGATCGACAT CCTGTCCCTG GTGATCCCGG TGGCCCTGGC CGCGACCATG
GCCTTCATGC TCCCGGTGGC CACACCGCCC AACGCGATCG TCTTCGGCTC CGGCCACGTG
CGGATCGGCC AGATGATGCG CGGCGGCGTG TGGCTCAACC TCATCGCGCT GTTCCTGATC
CTGGCCGCGA TGTACACCGT GGTGACCTGG TTCCTGGGAG TCAGCGTGTA G
 
Protein sequence
MASEKGVDAP DTASPTQDGR VARGTPPSRV IGLIAGPLVG LLVYFLMPDM PMPVVDGEDG 
ETVLSANGRA VAAVTAFIAI WWATEAIPIP VASLLPLVLF PLLLEGTPVG DVASSYGSDT
IFLFMGGFML ALAMQKWNLH KRIALVIVSK VGSNTAGLVG GFMIATGFIT MWVSNTATAV
MMLPVGLSVI TLITQFRDGR TDANFATALM LGIAYSSSIG SVATIIGTPP NVLMVGYLAD
AHDIHIGFGE WMLVGLPLAA VFLLIAWFVL IKIFPPQVKK VEGAQALIRS ELAEMGPMAR
GEKLVLAVFA FAALSWIFVP VLADNFLPWL SSVSDAGIAM TVAVLLFLIP VERGTRLLDW
ETAVQLPWGV LLLFGGGLAI SGQFTASGLS TWIGGQVAVL EGVPTWVLIL VAAGLVLFLT
ELTSNTATAA TFLPILGGVA VGMQIDILSL VIPVALAATM AFMLPVATPP NAIVFGSGHV
RIGQMMRGGV WLNLIALFLI LAAMYTVVTW FLGVSV