Gene Ndas_5070 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_5070 
Symbol 
ID9248959 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp214483 
End bp215538 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content71% 
IMG OID 
ProductRarD protein, DMT superfamily transporter 
Protein accessionYP_003682957 
Protein GI297563984 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCGGAAT CCAAGCGTGG CGTCGTCCTC GGGGCGACCG CCTTCCTCCT CTGGGGTGTG 
GCGGCCCTGT ACTGGCCGCT CCTCTCCTCC TCCGAACCCA GCGAGATCCT CGCCCACCGC
ATGGTGTGGG CCCTGCTCGC GATGTGCGTC GTCATCCTGG TCACCCGCCG GGGCTGGTCC
TGGTTCCCCT CGGTGCTGCG CAGCCCGCGC CGCCTGCTGC CGGTGGCCGC AGCGGCGGTG
CTGATCTCCG TCAACTGGTG GGGGTTCATC TACGCGGTCT CCATCGAGCA GACGCTCCAG
GCCTCGCTCG CGTACTTCAT CAACCCGCTG ATGTCGGTGT GTCTGGGGGT GCTGCTGTTC
TCCGAACGGC TGCGGGCCCT CCAGTGGGCG GCGGTCGGCC TGGGCGTCCT GGCCGTGGCC
GTGATGACCG TCGCCTACGG CGTGACCCCG TGGCTGGCGC TGCTCATGGC CGCCACCTTC
GCCGCCTACG GCGCGGTGAA GAAGTACGTG GACCTGGACG GCGTGCGGAG CCTGACCGTC
GAGACCATGG TGATGTTCCT GCCCGCGCTG GGCTTCGTGG TCCACCTGGA GGCCACCGGC
GCGGGAACCA TGTTCTCCGT GTCCCCGGGC CACACCGCCC TGCTGGTCGG CAGCGGTTTC
GTGACCGCCC TGCCGCTGCT GCTGTTCGGC GTGGCCGCGC GCCAGGTCCC CCTGAGCGTC
ATCGGCATCC TCCAGTACAT CGCCCCGGTC ATCATGTTCT TCGTCGGCTG GCTGGTGCAG
GGCGAGGAGA TGCCGCCCGC ACGCTGGCTG GGCTTCGCGC TGGTGTGGCT GGCGCTGTGC
GCGTTCGTCG TCGACCAGGT CCGCGACGCC TGTTCCCGGC CCCGGTCACG CGCCTCCTCC
CGGGGTGGGG AACAGGCCGC GGAGCAGGTC GGGGAAGAGG TCGGGGAAGA GGTCGGGGAA
CAGGACCGGA CCCACGACGA ACAGCGGCGG CGCGAACTCG GACAGCCCGG GGATCCGCTC
CCCGGGGGAC CCGCCCGGCC CGAATCCGCG GACTGA
 
Protein sequence
MPESKRGVVL GATAFLLWGV AALYWPLLSS SEPSEILAHR MVWALLAMCV VILVTRRGWS 
WFPSVLRSPR RLLPVAAAAV LISVNWWGFI YAVSIEQTLQ ASLAYFINPL MSVCLGVLLF
SERLRALQWA AVGLGVLAVA VMTVAYGVTP WLALLMAATF AAYGAVKKYV DLDGVRSLTV
ETMVMFLPAL GFVVHLEATG AGTMFSVSPG HTALLVGSGF VTALPLLLFG VAARQVPLSV
IGILQYIAPV IMFFVGWLVQ GEEMPPARWL GFALVWLALC AFVVDQVRDA CSRPRSRASS
RGGEQAAEQV GEEVGEEVGE QDRTHDEQRR RELGQPGDPL PGGPARPESA D