Gene Ndas_5334 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_5334 
Symbol 
ID9249234 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp501525 
End bp503978 
Gene Length2454 bp 
Protein Length817 aa 
Translation table11 
GC content72% 
IMG OID 
ProductV-type H(+)-translocating pyrophosphatase 
Protein accessionYP_003683220 
Protein GI297564247 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0178305 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTCTGGGC TCAACCTCGC CGCTGAAGGC GGCATGGCCC TATCACTGGC AGGCACCGAC 
TTCACCCTCG TCATCATCGT CATGGTGGTG GCCCTTCTGG CACTCGCCGT CGCGGGCGTG
CTGGTACGTG AGGTCCTCGC CGCAGGGCAG GGCACAGAGC GAATGCGCGG CATCGCGCTC
GCGGTGCAGG AAGGAGCGGC GGCCTACCTC AAACGACAGT TCCGCACCCT CGCGGTCTTC
GTCGTCCTCA TCCCGCTCCT GCTGCTCCTG CTCCCCGCCG ACTCGTGGGC CATCGCGATC
GGCCGGTCGG TCTTCTTCGC CCTCGGCGCC CTGCTGTCGG CGGCGACCGG ATTCATCGGC
ATGTGGCTCG CCGTGCGCGG CAACGTCCGG GTCGCCGCGG CCGCCCGGGG CGGTGACGAC
GCCTCCAATC GCACCGCCAT GCGCATCGCC TTCCGGACCG GCGGAGTGGC GGGCATGATC
ACCGTCGGCC TCGGCCTGTT CGGCGCGGCC CTCGTCGTCC TCCTCTACCG GGGCGACGCC
CCCATCGTCC TGGAGGGCTT CGGCTTCGGC GCGGCCCTCC TCGCCATGTT CATGCGCGTG
GGCGGCGGCA TCTTCACCAA GGCCGCCGAC GTCGGCGCCG ACCTCGTCGG CAAGGTCGAG
CAGGGCATCC CCGAGGACGA CCCCCGCAAC GCCGCCACCA TCGCCGACAA CGTCGGCGAC
AACGTGGGTG ACTGCGCCGG CATGGCCGCG GACCTGTTCG AGTCCTACGC GGTCGTGCTC
GTCGCCTCGC TGATCCTCGG CCGCGTCGCC TTCGGCACCG AGGGACTGGT GTTCCCGCTC
CTCGTCCCCA TGATCGGCGT GCTCACCGCC ATCGTCGGCA TCTTCCTGGT CGCCCCCCGC
GCCAGGGACA AGACGGCGAT GTCGGCGATC AACCGCGGGT TCTTCATCTC CGCGGCGATC
TCCGCCGTGC TCGTCGTCGG CACCGCCTTC TGGTACCTGC CCTCCAGCTT CGACCAGCTC
TCCGGCGTCA GCCCGCAGGT CCTGGCCGAG ATCGAGGAGG CCGGGACCAA CCCGGACCCG
AGGATCCTCG CGGTCGCGGC GGTCCTCATC GGCCTCGTGC TCGCCGCCGC CATCCAGCTG
CTCACCGGCT ACTTCACCGA GACCGACCGG CGCCCCGTCC GCGAGATCGG CGACAGCTCC
GAGACCGGCG CGGCGACCGT CATCCTCTCC GGCATCTCCG TCGGCCTGGA GTCGGCCGTC
TACTCCGCCC TCCTCATCGC CGGAGCCGTC TACGCCGCGT TCCTGCTCGG CGGCGGCTCC
ATCACCCTGA GCCTGTTCGC CGTCGCCCTC GCCGGAACCG GCCTGCTCAC CACCGTCGGC
ATCATCGTGG CGATGGACAC CTTCGGCCCC GTCTCCGACA ACGCCCAGGG CATCGCGGAG
ATGTCCGGCG ACGTGGAGGG CCCGGGCGCG GAGATCCTCA CCGGGCTCGA CGCCGTCGGC
AACACCACCA AGGCCATCAC CAAGGGCATC GCGATCGCGA CGGCGGTCCT GGCCGCGACC
GCGCTCTTCG GCGCCTTCCG CACCTCCGTG CAGGCCCAGC TCGGCGACGC CGACGAGGCG
TTCTCGCTCT CACTGGACCA GCCCGACGTG CTCGTCGGCG TCATCATCGG CGCCAGCGTG
GTGTTCTTCT TCTCCGGGCT CGCCATCATG GCGGTCGGCC GGGCGGCGGG CCGCGTGGTC
ATGGAGGTGC GCGACCAGTT CCGCACCCGT CCGGGGATCA TGGACGGCAC GGAGAAGCCC
GAGTACGCGC GCGTGGTCGA CATCTGCACC AAGGACTCGC TGCGGGAGCT GGTCACCCCG
GGCCTGCTGG CCGTCCTCGC GCCGATCGCG GTCGGCTTCG CCCTCGGCTA CGCCCCGCTC
GGCGCGTTCC TGGGCGGCGC CATCGCCGCG GGAGTGCTCA TGGCCGTCTT CCTGTCCAAC
TCGGGCGGGG CCTGGGACAA CGCCAAGAAG CTCGTCGAGG ACGGCCACCA CGGCGGCAAG
GGCTCGGAGG CCCACGCCGC CACGGTCATC GGCGACACGG TCGGCGACCC CTTCAAGGAC
ACCGCGGGTC CCGCCATCAA CCCCCTGCTC AAGGTGATGA ACCTCGTCGC CCTGATCGTG
GCGCCCAGCG TGGTGATCTA CGCCGACAAC GTCGTCCTGC GGGCGGGGAT CGCGGTGGCG
GCCGTCGCCG TGCTCGTCGG CGCCATCCTG TGGTCCAAGC GCCGCGGGGG AGGCACCGAG
ACCGACCTCG CCGAGACCGG CGGCCGGTCC GCGCCCAAGG GCGCCCCGGA GACCGGAGGC
GGAGCCCCGC CGAAGGGTGA GGTCTCCCCT GCCGGGGACG GCGGTGTCGG CGGGGACGCC
GCGGACGGCG CCGACGACCG CAAGGAGGAG GCCCGCGCCG GAGAGGGGAA GTAG
 
Protein sequence
MSGLNLAAEG GMALSLAGTD FTLVIIVMVV ALLALAVAGV LVREVLAAGQ GTERMRGIAL 
AVQEGAAAYL KRQFRTLAVF VVLIPLLLLL LPADSWAIAI GRSVFFALGA LLSAATGFIG
MWLAVRGNVR VAAAARGGDD ASNRTAMRIA FRTGGVAGMI TVGLGLFGAA LVVLLYRGDA
PIVLEGFGFG AALLAMFMRV GGGIFTKAAD VGADLVGKVE QGIPEDDPRN AATIADNVGD
NVGDCAGMAA DLFESYAVVL VASLILGRVA FGTEGLVFPL LVPMIGVLTA IVGIFLVAPR
ARDKTAMSAI NRGFFISAAI SAVLVVGTAF WYLPSSFDQL SGVSPQVLAE IEEAGTNPDP
RILAVAAVLI GLVLAAAIQL LTGYFTETDR RPVREIGDSS ETGAATVILS GISVGLESAV
YSALLIAGAV YAAFLLGGGS ITLSLFAVAL AGTGLLTTVG IIVAMDTFGP VSDNAQGIAE
MSGDVEGPGA EILTGLDAVG NTTKAITKGI AIATAVLAAT ALFGAFRTSV QAQLGDADEA
FSLSLDQPDV LVGVIIGASV VFFFSGLAIM AVGRAAGRVV MEVRDQFRTR PGIMDGTEKP
EYARVVDICT KDSLRELVTP GLLAVLAPIA VGFALGYAPL GAFLGGAIAA GVLMAVFLSN
SGGAWDNAKK LVEDGHHGGK GSEAHAATVI GDTVGDPFKD TAGPAINPLL KVMNLVALIV
APSVVIYADN VVLRAGIAVA AVAVLVGAIL WSKRRGGGTE TDLAETGGRS APKGAPETGG
GAPPKGEVSP AGDGGVGGDA ADGADDRKEE ARAGEGK