Gene Ndas_4056 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4056 
Symbol 
ID9247928 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4852021 
End bp4853130 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content74% 
IMG OID 
Productinner-membrane translocator 
Protein accessionYP_003681958 
Protein GI297562984 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0407655 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACACCCC CACCGACCTC CGTCCTCCGG AGCCTGCGCG CCCGGACGGG CACCACCGGG 
CTGGTCTACC TGGCCCTGGT GCTGCTGCTC GCGGTCAGCG CGGCCTTCGT GGCCGCCCGG
GGCGGCAACC TGTTCACCAC CGCCAACACC GTCGACCTGC TCACCCGCAG CAGCCTGCTG
GGCTTCCTGG CCGTCGGGAT GACCCTCGTC ATCCTGTGCC GCTCCCTGGA CCTGTCGGTC
GGCTACGTGG CCGCCCTGTC CACCGTGGTC GCGGCCACCA CCATGGCGGG CGACCCCTCC
CGGATCGTCC TCGGCGTGGC CGCGGCGCTC GGCCTGGCCG CGCTGATCGG CCTGGTCAAC
GGGCTGGTGG TCACGGGGCT GCGGGTCAAC CCCTTCATCG CCACCCTGGG CATGGGGCTG
GTGATCAAGG GCTACCTGGA CACGAACTTC CAGGGCCCGG CCGGGGCGGT GCCCGCCGCC
TTCCAGACCT TCGGCTACAC CCGGATCGGT GTGCTGCCCG TCTCCACCCT GGTCATGCTG
GGCGTGGCGG TGGCGGCGGT GCTGTTCCTG AGCCGCACAC GGATGGGCTA CCACATCTAC
GCCGTCGGCG GCGACGCCGA CGTGGCCCGG CTCTCCGGGG TCCGCTCCGG GGTGCCCACG
GTCACCGCGC ACGTGTTGTG CTCGGTCACC GCCGGTGTGG CCGGTCTGCT GCTGGCCGCC
CGGTTCGGGA CCGGCAGCGC CACCGTCTAC TCCGGGGGCT ACGAACTGGA GGCCATCGCG
GCCGTGGTGC TGGGCGGGAC CTACCTGCTC GGCGGGCGCG GCGGCGTGGC CGGGACGGTG
GCGGGGGTGC TCATCCTCGC CACGCTCGAC ACCGTGTTCA ACGTGCTGGC GGTCGACCCG
TTCGTCAAGG ACGTCCTGCG CGGCGTCATC GTCATCGCCG CCGTGGCCGT CTACGCCCGC
GGCGGGCGCT CCGCCGTGCG GACGCGCTTC CCCTCCGGCG GCGCGCCGCC GTCCTCCCCC
GTGCCGCGGC CCGCCCCGGA CACCGGGACC GCCCCCGATC CGGACCCCGG GACGGGATCC
GCACCGCAAC CCCTCGGAGG CCGCCGATGA
 
Protein sequence
MTPPPTSVLR SLRARTGTTG LVYLALVLLL AVSAAFVAAR GGNLFTTANT VDLLTRSSLL 
GFLAVGMTLV ILCRSLDLSV GYVAALSTVV AATTMAGDPS RIVLGVAAAL GLAALIGLVN
GLVVTGLRVN PFIATLGMGL VIKGYLDTNF QGPAGAVPAA FQTFGYTRIG VLPVSTLVML
GVAVAAVLFL SRTRMGYHIY AVGGDADVAR LSGVRSGVPT VTAHVLCSVT AGVAGLLLAA
RFGTGSATVY SGGYELEAIA AVVLGGTYLL GGRGGVAGTV AGVLILATLD TVFNVLAVDP
FVKDVLRGVI VIAAVAVYAR GGRSAVRTRF PSGGAPPSSP VPRPAPDTGT APDPDPGTGS
APQPLGGRR