Gene Ndas_4430 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4430 
Symbol 
ID9248305 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp5274483 
End bp5275679 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content73% 
IMG OID 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003682325 
Protein GI297563351 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.559401 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTACGG AATCGGGAAA AAGGGTGTCG CGCCGCGGCA CGGCCGGCGG CAACCTCGCG 
TTGGCCACGG CGGCGTTCAC GCTCACCTTC TGGGCGTGGA ACCTCATCGG CCCCCTGTCG
CGCACCTACA CCGAGAACCT GGGGCTGACA CCCACGCAGA CCTCGATCCT GGTGGCCTTC
CCGGTGCTGG TGGGGGCGTT GGGGCGCATC CCCGTGGGCG CGCTGACCGA CCGCTACGGC
GGGCGGCTGA TGTTCACGGC CCTGTGCTTC GCGAGCATCG TGCCGACGTT CCTGGTGGGG
GTCTCGGGCG ACTCCTACGG GATGCTGCTG CTGTGGGGAT TCGTCCTCGG CGTCGCCGGG
ACGTCGTTCG CGGTCGGCAT CCCGTTCGTG AACGCGTGGT ACCCGGCGAA CCGGCGCGGG
TTCGCGACGG GCGTGTTCGG CGCGGGGATG GGCGGCACCG CCCTGTCGTC CTTCCTCACC
CCCCGGCTGG TGGGCGCGGT GGGGCTGTTC ACCACCCACC TGCTGCTGTG CGCGGCGCTG
GCGGTCATGG GCGTGGTGAT GTGGCTGATG TGCCGGGACT CGCCGGACTG GCGGCCGCGC
ACGGAGCCCG CGCTGCCGCG CATGGGGGAG GCCGTGCGGC TGCGGGCGAC CTGGCAGGCG
TCGCTGCTGT ACGCGGTCGC CTTCGGCGGT TTCGTGGCCT TCTCGACCTA CCTGCCGACG
CTGCTGACGC TGGCCTACGA CTACGCGCAG ACCGCCGCGG GTCTGCGCAC CGCCGGGTTC
GCGGTGGCGG CCGTGGCGGC CCGGCCGGTC GGCGGCGTCC TGTCGGACCG GATCGGCCCG
GTGCGGGTGT GCCTGATCTC GTTCCTCGGC ACCTGCGTGT TCGCGGTCGT GCTGGCCCTG
CACCCGCCCG CGGAGTTCCC GGCCGGTGCC GCGTTCGTGC TGATCGCGCT GTCGCTGGGT
TTGGGCACCG GTGCCATGTT CGCGCTGGTG GCCAAGCTGG TGGAGCCGTC GAGGGTGGGC
ACGGTCACGG GCCTGGTCGG TGCGGCCGGC GGGCTGGGCG GCTACTTCCC GCCGCTGCTC
ATGGGGGCGG TGTACCAGGC GACCGGGGCC TACACGCTGG GCTTCGTACT GTTGGCGGCG
GTCGCCCTGG CGGTGGCCCT GTACACCTGG CGGGCCTTCG CCCACGTGCG GGGATGA
 
Protein sequence
MRTESGKRVS RRGTAGGNLA LATAAFTLTF WAWNLIGPLS RTYTENLGLT PTQTSILVAF 
PVLVGALGRI PVGALTDRYG GRLMFTALCF ASIVPTFLVG VSGDSYGMLL LWGFVLGVAG
TSFAVGIPFV NAWYPANRRG FATGVFGAGM GGTALSSFLT PRLVGAVGLF TTHLLLCAAL
AVMGVVMWLM CRDSPDWRPR TEPALPRMGE AVRLRATWQA SLLYAVAFGG FVAFSTYLPT
LLTLAYDYAQ TAAGLRTAGF AVAAVAARPV GGVLSDRIGP VRVCLISFLG TCVFAVVLAL
HPPAEFPAGA AFVLIALSLG LGTGAMFALV AKLVEPSRVG TVTGLVGAAG GLGGYFPPLL
MGAVYQATGA YTLGFVLLAA VALAVALYTW RAFAHVRG